Job Title: SRE Engineer
Location: Remote/ EST hours
Contract length: 6 months, likely to extend
Schedule: Fulltime, M-F 8a to 5p Eastern
w2 Contract
Job Overview:
We re looking for a Site Reliability Engineer (SRE) to join our Global SRE team. In this role, you ll blend software engineering and systems engineering to help ensure our large-scale, distributed digital products are reliable, scalable, and efficient. You ll work closely with software, platform, and product teams to design, build, and operate systems that support customers worldwide.
Job Responsibilities:
Ensure the reliability, availability, and resiliency of digital products by designing and operating fault-tolerant systems
Partner with product and platform teams to define and improve service health using operational and customer-experience metrics
Design, implement, and maintain monitoring, alerting, logging, and tracing solutions that provide real-time visibility into system behavior and customer experience
Analyze system performance, scalability, and capacity, and drive optimizations to improve efficiency and stability in cloud environments
Build automation and tooling to support deployments, scaling, incident response, and operational workflows
Participate in an on-call rotation as part of a globally distributed team, lead incident response efforts, troubleshoot production issues, conduct postmortems, and drive continuous improvement initiatives
Collaborate with security and compliance partners to support secure, privacy-aware, and compliant operations
Work closely with engineering teams to improve developer experience, operational maturity, and overall customer experience
Job Qualifications:
Experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles
Experience operating Kubernetes-based production systems
Hands-on experience with AWS and infrastructure-as-code tools such as Terraform
Experience designing and supporting CI/CD pipelines and automated deployments
Proficiency in Python for automation, tooling, or backend services
Solid understanding of distributed systems and networking concepts
Experience with monitoring and observability platforms such as Datadog and CloudWatch
The Planet Group and our companies are equal opportunity employers. It is our practice not to discriminate against any employee or applicant based on any criteria, condition or basis protected by laws or regulations in the locations where we do business. All qualified applicants are encouraged to apply. We celebrate diversity and are committed to providing an environment of mutual respect. We believe that diversity, equity and inclusion enable us to better meet our mission and values while serving our clients across the globe. If you have a disability or handicap and would like us to accommodate you in any reasonable way, please inform your recruiter, or contact us, so that we can discuss the appropriate alternatives available.