job summary:
Enterprise Healthcare client has an immediate opening for a highly motivated Lead Site Reliability Engineer to join their dynamic and growing team. All qualified candidates are encouraged to apply!
location: Telecommute
job type: Contract
salary: $71.25 - 81.25 per hour
work hours: 8am to 5pm
education: Bachelors
responsibilities:
- What is the purpose of this team?
- Describe the surrounding team (team culture, work environment, etc.) & key projects.
- Do you have any additional upcoming hiring needs or is this request part of a larger hiring initiative? I am forming new teams, focusing on Adobe Stack to enhance the scalability of the Adobe platform. This initiative aims to align with a unified technology strategy that supports evolving business needs.
Typical Day in the Role
- Walk me through the day-to-day responsibilities and a description of the project (Outside of the Workday JD).
- What are the performance expectations/metrics?
- What makes this role unique? Lead SRE to drive reliability, scalability, observability (monitoring & alerts) and performance across the production platforms. Role will own the SLO/SLI strategy, modernize observability and incident response, and partner with application teams to deliver resilient systems. This role blends hands-on engineering with technical leadership-guiding standards for operational excellence.
Key Responsibilities:
Reliability Strategy & Ownership
- Define and govern SLOs/SLIs/Error Budgets for critical services; enforce guardrails and drive reliability roadmaps.
- Lead performance tuning collaboration with application teams to ensure high availability and low latency.
- Define and own infrastructure tuning to ensure scalability leading to high availability.
- Lead Metrics and automation driven Reliability.
- Dedug systems across layers.
Production Engineering
- Architect and evolve CI/CD, infrastructure-as-code (IaC- Terraform)
- Design and build serverless APIs (Lambda, API Gateway, SQS, SNS, DynamoDB, etc.).
- Build scalable Kubernetes/container platforms, service meshes, and developer self service workflows.
Observability & Operations
- Mature observability (metrics, logs, traces, RUM, synthetic checks) and AIOps/alert hygiene to reduce noise and MTTR.
- Produce actionable dashboards at team and exec levels.
- Lead incident management (on-call rotations, triage, comms, postmortems).
Security & Compliance
- Partner with Security to embed shift-left practices, secure defaults, and policy-as-code (RBAC, secrets).Ensure compliance with SOC2 / HIPAA / PCI (as applicable) in production operations.
Leadership & Enablement
- Mentor partner teams; establish runbooks, standards, and golden paths.
- Influence architecture decisions, participate in design reviews, and evangelize reliability best practices.
Cost & Efficiency
- Optimize cloud spend via right sizing, autoscaling, workload placement, and utilization insights.
qualifications:
Bachelors
Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.
At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact
Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).
This posting is open for thirty (30) days.
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
![]()