Sr. Platform Reliability Engineer

Overview

On Site
$180000 - $200000 per annum
Full Time

Skills

Sr. Platform Reliability Engineer

Job Details



About the Company


This is a well-funded, innovative technology company building advanced, AI-driven products for enterprise customers. With teams in San Francisco and Atlanta, they are in a high-growth phase, actively scaling their platform into a secure, reliable, multi-tenant enterprise system. The engineering team is deeply technical, highly collaborative, and focused on building infrastructure the right way with a big prioritization on reliability, scalability, and long-term platform health.



The Role


The Senior Platform Reliability Engineer will help design, build, and own the infrastructure that powers the company's core platform. This role goes far beyond operational support -you'll be shaping the foundational systems that enable the platform to scale securely and reliably as the product and customer base grow. This is an ideal opportunity for a senior engineer who enjoys greenfield platform work, thrives on system ownership, and wants to influence infrastructure strategy at a meaningful level.



*This role does not offer sponsorship.*



What You'll Be Working On



  • Building and owning core platform infrastructure across cloud, containers, and hybrid environments

  • Improving system reliability, scalability, and resilience as the platform evolves

  • Partnering closely with engineering teams to enable safe, efficient deployments

  • Strengthening security, compliance, and disaster recovery practices across the platform



Key Responsibilities:


Infrastructure & Platform Ownership



  • Design, build, and maintain infrastructure across containers, virtual machines, and hybrid environments

  • Architect scalable and highly available systems on major cloud platforms

  • Drive operational excellence and reliability across core platform services


Security, Compliance & Resilience



  • Strengthen security across secrets management, RBAC, and network policies

  • Support and improve compliance checks, disaster recovery plans, and backup strategies

  • Proactively identify and address reliability and resilience gaps


CI/CD & Automation



  • Own CI/CD pipelines, container build workflows, and automated deployments

  • Improve release reliability and deployment confidence

  • Enforce secure, consistent, and efficient delivery standards



Technical Requirements:



  • Bachelor's Degree from a Top 20 University

  • Strong cloud experience with AWS, Google Cloud Platform, or Azure

  • Infrastructure as Code experience using Terraform

  • Hands-on experience with Docker and Kubernetes

  • Proven experience owning and operating production-grade systems at scale



Oscar Associates Limited (US) is acting as an Employment Agency in relation to this vacancy.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.