Need urgently Senior SRE Engineer with Azure for contract.
- Remote in US
- Till December 31st 2026 with possible extension
- Must have skills: Current Senior SRE position, Azure log analytics, Azure monitor, Kubernetes, SLA/SLO adherence, terraform. Previous devops roles preferred
- Interview: general interview with me for 30 minutes, Technical interview with EPAM 90 minutes, Project interview 1 hour, Client’s interview 1 hour\
- Candidate should be on your W2 payroll
**About the Role:**
We recently launched services for our client in Azure, and maintaining service health is a key focus. As we grow, we are looking for a Junior SRE to help support incident response, troubleshooting, and the improvement of our cloud reliability. This is a hands-on role for someone eager to learn, contribute to a reliable environment, and develop their skills in Azure and SRE practices.
**Key Responsibilities:**
**Reliability Engineering:**
- Assist in automating operational processes to improve system reliability and performance.
- Work with development and operations teams to learn and apply reliability best practices.
**Incident Response & Troubleshooting:**
- Support the team in responding to service incidents in our Azure environment.
- Participate in root cause analysis and post-incident reviews to help drive improvements.
**Service Health Monitoring:**
- Help implement and maintain monitoring and alerting solutions for critical services.
- Learn to identify and address reliability risks.
**Process & Culture Building:**
- Contribute to establishing SRE practices, including incident management and postmortems.
- Participate in team learning sessions about SRE principles and Azure best practices.
**Continuous Improvement:**
- Assist in analyzing incident trends to support long-term improvements.
- Help promote a culture of reliability and continuous learning.
**Required Skills & Experience:**
- 7+ years in SRE, DevOps, or related roles
- Azure experience
- Experience troubleshooting distributed systems and networking.
- Exposure to Azure monitoring and automation tools (e.g., Azure Monitor, Log Analytics, Application Insights).
- Experience with at least one scripting or programming language (Python, PowerShell, Bash, etc.).
- Incident management and observability concepts.
- Good communication skills and ability to work in a team.
**Preferred Qualifications:**
- Azure certifications (e.g., Azure Fundamentals) are a plus.
- CI/CD pipelines and infrastructure as code.