Site Reliability Engineer - Platform Infrastructure Engineering

Mountain View, CA, US • Posted 6 days ago • Updated 1 hour ago
Full Time
On-site
USD $168,926.00 - 192,500.00 per year
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • Health Care
  • Authentication
  • NIST SP 800 Series
  • Pivotal
  • Operational Excellence
  • Grafana
  • Incident Management
  • Knowledge Sharing
  • Computer Science
  • Software Engineering
  • Reliability Engineering
  • DevOps
  • Management
  • Cloud Computing
  • Amazon Web Services
  • Google Cloud
  • Google Cloud Platform
  • Microsoft Azure
  • Java
  • Python
  • Ruby
  • JavaScript
  • Orchestration
  • Docker
  • Kubernetes
  • Continuous Integration
  • Continuous Delivery
  • Recovery
  • Failover
  • CHAOS
  • Systems Design
  • Load Balancing
  • Performance Tuning
  • Terraform
  • Ansible
  • Regulatory Compliance
  • FedRAMP
  • System On A Chip
  • NIST 800-53
  • Analytical Skill
  • Network
  • Communication
  • Documentation
  • FOCUS
  • Collaboration
  • Continuous Improvement
  • Artificial Intelligence
  • Workflow
  • Insurance
  • Training And Development
  • Sales
  • SAP BASIS
  • Recruiting
  • Training
  • Promotions
  • Privacy

Summary

Company Overview

ID.me is the next-generation digital identity wallet that simplifies how individuals securely prove their identity online. Consumers can verify their identity with ID.me once and seamlessly login across websites without having to create a new login and verify their identity again. Over 152 million users experience streamlined login and identity verification with ID.me at 20 federal agencies, 45 state government agencies, and 70+ healthcare organizations. More than 600+ consumer brands use ID.me to verify communities and user segments to honor service and build more authentic relationships. ID.me's technology meets the federal standards for consumer authentication set by the Commerce Department and is approved as a NIST 800-63-3 IAL2 / AAL2 credential service provider by the Kantara Initiative. ID.me is committed to "No Identity Left Behind" to enable all people to have a secure digital identity. To learn more, visit ;br>
Company OverviewID.me is the next-generation digital identity wallet that simplifies how individuals securely prove their identity online. Consumers can verify their identity with ID.me once and seamlessly log in across websites without needing to create a new login and re-verify. Over 140 million users experience streamlined login and identity verification with ID.me at 20 federal agencies, 44 state government agencies, and 66 healthcare organizations. More than 600 consumer brands use ID.me to verify communities and user segments to honor service and build more authentic relationships. ID.me's technology meets the federal standards for consumer authentication set by the Commerce Department and is approved as a NIST 800-63-3 IAL2 / AAL2 credential service provider by the Kantara Initiative. ID.me is committed to "No Identity Left Behind" to enable all people to have a secure digital identity. To learn more, visit ;br>
Role OverviewWe are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE team builds the automation, observability, and operational foundations that ensure ID.me's services are reliable, scalable, and secure.

As an SRE, you will play a pivotal role in building the platform and governance processes required to safely scale, deploy, and operate a high volume of machine-generated applications and features. You will design and implement the automated guardrails that maintain our high standards for resilience and security in an AI-accelerated development environment. You'll focus on infrastructure automation, observability, performance optimization, and incident response, partnering closely with Software Engineering teams to foster a culture of reliability and operational excellence.

This role is based out of our Mountain View, CA or McLean, VA offices and requires full-time in-office attendance, 5 days per week.

Responsibilities
  • Build and maintain automated reliability tooling, infrastructure as code, and observability systems that enhance uptime and service performance.
  • Develop monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, OpenTelemetry) to detect and remediate issues proactively.
  • Implement automated architectural reviews and reliability guardrails for agent-developed applications to ensure machine-generated code meets long-term maintainability and performance standards.
  • Partner with engineering teams to design and implement scalable, fault-tolerant systems that meet defined SLIs and SLOs.
  • Automate repetitive operational tasks and develop self-healing and auto-remediation mechanisms to minimize human intervention.
  • Participate in on-call rotations and lead incident response efforts, performing post-incident reviews and driving systemic improvements.
  • Improve the deployment and release process using CI/CD pipelines and progressive delivery techniques to ensure stability and safety.
  • Champion observability, reliability, and operational readiness reviews as part of the development process.
  • Collaborate with Security and Compliance teams to ensure production systems meet FedRAMP, NIST, and internal policy requirements.
  • Contribute to documentation, runbooks, and internal tooling to enhance knowledge sharing and operational maturity across teams.

Minimum Qualifications
  • Bachelor's degree in Computer Science, Software Engineering, or a related technical field.
  • 3-5 years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • 2+ years of hands-on experience managing and scaling services in cloud environments such as AWS, Google Cloud Platform, or Azure.
  • 1+ years proficiency in at least one modern programming language (e.g., Java, Go, Python, Ruby, JavaScript).

Preferred Qualifications
  • Strong understanding of containerization and orchestration technologies (Docker, Kubernetes).
  • Experience implementing and maintaining CI/CD pipelines and automation frameworks.
  • Working knowledge of observability systems-metrics, tracing, logging, and alerting.
  • Experience building automated recovery, failover, or chaos-engineering systems to validate reliability.
  • Familiarity with event-driven architecture and asynchronous processing systems.
  • Knowledge of distributed systems design, load balancing, and performance optimization.
  • Exposure to infrastructure-as-code tools (Terraform, Pulumi, Ansible) and GitOps practices.
  • Understanding of security and compliance frameworks (FedRAMP, SOC2, or NIST 800-53).
  • Strong analytical and troubleshooting skills across the stack-from network to application layer.
  • Excellent communication and documentation skills, with a focus on cross-team collaboration and continuous improvement.
  • Experience using AI agentic coding assistants and deploying custom AI agents or automated workflows into production environments.

The annual base salary listed does not include a company bonus, incentive for sales roles, equity and benefits which will be determined based on experience, skills, education, relevant training, geographic location and role.

ID.me offers comprehensive medical, dental, vision, health savings account, flexible spending accounts (medical, limited purpose, dependent care, commuter benefit accounts), basic and voluntary life and AD&D insurance, 401(k) with company match, parental leave, ability to participate in unlimited paid time off subject to the terms and conditions of the PTO policy, including 8 company wide holidays, short and long-term disability insurance, accident and critical illness insurance, referral bonus policy, employee assistance program, pet insurance, travel assistant program, wellbeing and childcare discounts, benefit advocates, and a learning and development benefit.

The above represents the anticipated total rewards package for this job requisition. Final offers may vary from the amount listed based on qualifications, professional experiences, skills, education, relevant training, geographic location, and other job related factors.

Mountain View, CA Pay Range

$168,926-$192,500 USD

ID.me is a full-time, in-office culture. Unless a specific job description explicitly states otherwise, all roles are on-site five days per week at one of our offices in McLean, VA; Mountain View, CA; New York City, NY; or Tampa, FL. Certain roles - such as field-based sales or other remote-by-design positions - may have different work arrangements as noted in their individual postings.

ID.me maintains a work environment free from discrimination, where employees are treated with dignity and respect. All ID.me employees share in the responsibility for fulfilling our commitment to equal employment opportunity. ID.me does not discriminate against any employee or applicant on the basis of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. ID.me adheres to these principles in all aspects of employment, including recruitment, hiring, training, compensation, promotion, benefits, social and recreational programs, and discipline. In addition, ID.me's policy is to provide reasonable accommodation to qualified employees who have protected disabilities to the extent required by applicable laws, regulations and ordinances where a particular employee works. Upon request we will provide you with more information about such accommodations.

Please review our Privacy Policy, including our CCPA policy, at id.me/privacy. If you provide ID.me with any personally identifiable information you confirm that you have read and agree to be bound by the terms and conditions set out in our Privacy Policy.

ID.me participates in E-Verify.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 80183556
  • Position Id: fdf53c14ec03d1fe0d7d04ae21555519
  • Posted 6 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Sunnyvale, California

Today

Full-time

Cupertino, California

Today

Full-time

Sunnyvale, California

Today

Full-time

USD 110,000.00 - 150,000.00 per year

San Mateo, California

Today

Full-time

USD 130,000.00 - 200,000.00 per year

Search all similar jobs