Sr. Platform Reliability Engineer

Overview

On Site

$180000 - $200000 per annum

Full Time

Skills

Sr. Platform Reliability Engineer

Job Details

About the Company

This is a well-funded, innovative technology company building advanced, AI-driven products for enterprise customers. With teams in San Francisco and Atlanta, they are in a high-growth phase, actively scaling their platform into a secure, reliable, multi-tenant enterprise system. The engineering team is deeply technical, highly collaborative, and focused on building infrastructure the right way with a big prioritization on reliability, scalability, and long-term platform health.

The Role

The Senior Platform Reliability Engineer will help design, build, and own the infrastructure that powers the company's core platform. This role goes far beyond operational support -you'll be shaping the foundational systems that enable the platform to scale securely and reliably as the product and customer base grow. This is an ideal opportunity for a senior engineer who enjoys greenfield platform work, thrives on system ownership, and wants to influence infrastructure strategy at a meaningful level.

*This role does not offer sponsorship.*

What You'll Be Working On

Building and owning core platform infrastructure across cloud, containers, and hybrid environments

Improving system reliability, scalability, and resilience as the platform evolves

Partnering closely with engineering teams to enable safe, efficient deployments

Strengthening security, compliance, and disaster recovery practices across the platform

Key Responsibilities:

Infrastructure & Platform Ownership

Design, build, and maintain infrastructure across containers, virtual machines, and hybrid environments

Architect scalable and highly available systems on major cloud platforms

Drive operational excellence and reliability across core platform services

Security, Compliance & Resilience

Strengthen security across secrets management, RBAC, and network policies

Support and improve compliance checks, disaster recovery plans, and backup strategies

Proactively identify and address reliability and resilience gaps

CI/CD & Automation

Own CI/CD pipelines, container build workflows, and automated deployments

Improve release reliability and deployment confidence

Enforce secure, consistent, and efficient delivery standards

Technical Requirements:

Bachelor's Degree from a Top 20 University

Strong cloud experience with AWS, Google Cloud Platform, or Azure

Infrastructure as Code experience using Terraform

Hands-on experience with Docker and Kubernetes

Proven experience owning and operating production-grade systems at scale

Oscar Associates Limited (US) is acting as an Employment Agency in relation to this vacancy.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share