We are looking for an AWS focused agile and devOps SRE engineer. The ideal candidate would be well versed with AWS foundational aspects along with service enablement, Infrastructure as Code/, operations as code via Terraform and harness.
What you will be responsible for
As an AWS Platform Operations SRE engineer
· Proficient in AWS platform foundations and services.
· Proficient in terraform and Harness
· Incident, change and problem management. (ITSM)
· Devise solutions and methodologies to reduce MTTRs and increase uptime.
· Continuous Monitoring and observability refinements and enhancements.
· Design and maintain Patch management solution.
· Release management function to pre-prod and production environments.
· Participate in regular DR activities along with applications.
· Standardize and maintain metrics and reporting.
· Continuous vigilance for cost optimization opportunities.
· Show back / chargeback reporting.
· Maintain operations as code practices.
Education & Preferred Qualifications
· Bachelor’s degree in any Engineering discipline
· 10 + years of IT experience in a hands-on role with 3 – 5 years of experience designing and supporting AWS environments.
· Understanding of Agile and DevOPs
· Deep understanding of Microsoft Azure IAAS, PAAS and SAAS solutions along with cloud native design patterns and DevOps principles (Infrastructure as Code)
· Appropriate certifications are strongly preferred, demonstrating an understanding of core infrastructure concepts as well as data and/or DevOps principles.
· Deep understanding of cloud computing technologies with demonstrated hands-on experience in one or more of the following domains:
o Core IaaS: Compute, Storage, Networking, High Availability
o Azure PaaS Services : Azure Kubernetes Service, Event Hub etc.
o Data Platform and Bigdata: SQL Server, Azure SQL DB, Azure Stream Analytics, Azure Data Factory / Data Bricks
· Strong programming and scripting background in languages such as shell scripts, python, Java etc.
· Experience in designing, coding and deploying Terraform templates / modules with Harness.
· Handling critical issues and timely remediations in high pressure situations.
· Ability to develop and document deliverables compliant with established company design control and regulatory statutes.
· Automated testing tools experience would be preferred.
· Service Now. Splunk, Grafana knowledge would be preferred.