Senior Site Reliability Engineer

Lehi, UT, US • Posted 1 day ago • Updated 13 hours ago
Full Time
On-site
USD $125,000.00 - 145,000.00 per year
Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

  • Privacy
  • PKI
  • Dragon NaturallySpeaking
  • DNS
  • Lifecycle Management
  • Artificial Intelligence
  • Software Development
  • Bridging
  • Software Engineering
  • Service Level
  • Real-time
  • Issue Resolution
  • Root Cause Analysis
  • Continuous Integration and Development
  • SAFE
  • Quality Assurance
  • Reliability Engineering
  • Software Architecture
  • Forecasting
  • Scalability
  • IaaS
  • Budget
  • Apache Velocity
  • Continuous Improvement
  • CHAOS
  • Testing
  • Cloud Computing
  • Amazon Web Services
  • Google Cloud Platform
  • Google Cloud
  • Microsoft Azure
  • DevOps
  • Kubernetes
  • Terraform
  • Continuous Integration
  • Continuous Delivery
  • Scripting
  • Python
  • Bash
  • Grafana
  • Splunk
  • New Relic
  • Debugging
  • Software Testing
  • Professional Development
  • LinkedIn
  • WINS
  • Insurance
  • Regulatory Compliance
  • SAP BASIS
  • Recruiting

Summary

Who we are

DigiCert is a global leader in intelligent trust. We protect the digital world by ensuring the security, privacy, and authenticity of every interaction. Our AI-powered DigiCert ONE platform unifies PKI, DNS, and certificate lifecycle management, to secure infrastructure, software, devices, messages, AI content and agents. Learn why more than 100,000 organizations, including 90% of the Fortune 500, choose DigiCert to stop today's threats and prepare for a quantum-safe future at ;br>
Job summary

The Site Reliability Engineer (SRE) collaborates with development teams to embed reliability, scalability, and performance best practices throughout the software development lifecycle. This role bridges software engineering and cloud operations, ensuring mission-critical systems remain highly available and resilient. By integrating reliability early, the SRE fosters a culture of shared responsibility while enabling rapid and safe feature delivery.

What you will do
  • Design and build fault-tolerant, high-performing systems that meet Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
  • Implement monitoring, alerting, distributed tracing, and logging to ensure real-time system health visibility and proactive issue resolution.
  • Act as a first responder for production incidents, conduct blameless postmortems, and drive root cause analysis (RCA) and corrective actions.
  • Develop self-healing, automated deployments, and scaling solutions to minimize toil and improve system efficiency.
  • Improve continuous integration and deployment pipelines to enable safe, rapid, and reliable feature rollouts.
  • Review code, debug issues, and perform quality assurance (QA) on software components to enhance system reliability and performance.
  • Work closely with development teams to ensure best practices in software architecture, coding standards, and operational readiness.
  • Forecast scalability needs and optimize cloud infrastructure costs while balancing performance and efficiency.
  • Ensure production environments meet security and compliance requirements, collaborating with teams to mitigate vulnerabilities and enforce best practices.
  • Work closely with development teams to embed reliability at every stage rather than treating it as an afterthought.
  • Use error budgets to balance feature velocity with system stability.
  • Implement observability and automation-first principles to measure system health and drive continuous improvement.
  • Leverage game days, chaos engineering, and resilience testing to validate system robustness and refine operational processes.

What you will have
  • Extensive experience in distributed systems, cloud-native architectures (AWS, Google Cloud Platform, Azure), and DevOps practices.
  • Proficiency in Kubernetes, Terraform, CI/CD pipelines, and Infrastructure as Code (IaC).
  • Strong scripting and automation skills in Python, Go, Bash, or similar languages.
  • Expertise in observability tools such as Prometheus, Grafana, Datadog, Splunk, New Relic, and OpenTelemetry.
  • Ability to troubleshoot complex production issues and drive scalable, resilient solutions.
  • Experience reviewing code, debugging applications, and conducting software testing to ensure high reliability and quality.

Benefits
  • Competitive compensation and comprehensive health, dental, and vision coverage
  • Retirement savings programs with company matching (401(k) or RRSP)
  • Generous paid time off, including holidays, and vacation
  • Paid parental leave and family support benefits
  • Life and disability coverage
  • Flexible spending and health savings options (where applicable)
  • Health and wellness support, including gym reimbursement and wellness programs
  • Employee Assistance Program with 24/7confidential support for employees and families
  • Education assistance and professional development opportunities
  • Access to LinkedIn Learning and continuous learning resources
  • Employee referral bonus program and additional company perks and discounts
  • Internal rewards and recognition platform (Motivosity) to celebrate and acknowledge project wins, milestone achievements, and the outstanding contributions of our colleagues
  • Business travel insurance and global employee support programs

DigiCert is an Equal Opportunity employer and is committed to diversity in its workforce. In compliance with applicable federal and state laws, DigiCert prohibits discrimination on the basis of race or ethnicity, religion, color, national origin, sex, age, sexual orientation, gender identity/expression, veteran's status, status as a qualified person with a disability, or genetic information. Individuals from historically underrepresented groups, such as minorities, women, qualified person with disabilities, and protected veterans are strongly encouraged to apply.

#LI-RR1

Compensation Transparency:

The annualized base salary range for this position is outlined below.

Each candidate's compensation offer will be determined based on factors including experience, skills, qualifications, job duties, business needs, and location. For roles that include additional compensation components, total compensation may include base pay, bonus, equity, or other incentives.

This role may also be eligible for benefits, which will be discussed during the hiring process. We are committed to fair and transparent pay practices and comply with all applicable pay transparency requirements. If you would like more information about compensation or benefits, we are happy to provide additional details during the hiring process.

For more information regarding our comprehensive benefits, see the benefits section.

Base Salary

$125,000-$145,000 USD
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 80184173
  • Position Id: c62ab307b18e185bf5ae1dd4aee049e1
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote or California

Today

Full-time

USD 160,000.00 - 240,000.00 per year

No location provided

Today

Full-time

California

Today

Full-time

USD 120,300.00 - 194,525.00 per year

Remote or California

Today

Full-time

USD 140,000.00 - 180,000.00 per year

Search all similar jobs