Job Summary:
We are seeking a highly skilled Senior Cloud Operations Engineer to drive cloud infrastructure operations, implement SRE practices, and deliver innovative DevSecOps and FinOps solutions. This is an individual contributor role requiring deep expertise in cloud technologies, infrastructure automation, and operational AI implementation.
Key Responsibilities
Cloud Operations & Infrastructure
Design and manage enterprise-scale cloud infrastructure with hands-on backend application knowledge
Optimize cloud environments for performance, reliability, and cost-effectiveness
Collaborate with development teams on infrastructure and application architecture
Site Reliability Engineering (SRE)
Implement SRE frameworks and comprehensive observability solutions
Establish application-level SLOs for .NET-based applications using monitoring tools (Prometheus, Grafana,
Datadog)
Execute incident response and drive continuous improvement initiatives
AI & Machine Learning Operations
Build and deploy AI/ML models for operational use cases (predictive maintenance, anomaly detection)
Ensure model governance, data security, and compliance with data residency requirements
DevSecOps & Security
Assess and improve existing CI/CD practices for security and operational excellence
Implement secure build/deployment pipelines with integrated security scanning and automated testing
AI & Machine Learning Operations
Build and deploy AI/ML models for operational use cases (predictive maintenance, anomaly detection)
Ensure model governance, data security, and compliance with data residency requirements
DevSecOps & Security
Assess and improve existing CI/CD practices for security and operational excellence
Implement secure build/deployment pipelines with integrated security scanning and automated testing
Infrastructure as Code
Develop Terraform modules and Cloud Management Platform (CMP) architecture
Maintain infrastructure automation, version control, and deployment practices
Required Qualifications
Extensive hands-on experience with public cloud platforms (AWS, Azure, Google Cloud Platform)
Strong background in infrastructure management and backend application architectures
Proven experience with SRE frameworks and observability tools
Deep understanding of .NET applications and performance optimization
Senior Cloud Operations Engineer
Hands-on AI/ML experience building operational models with focus on data security.
DevSecOps expertise in secure CI/CD practices and security automation.
FinOps experience in cloud cost management and optimization.
Advanced Terraform proficiency and CMP architecture experience.
Bachelor’s degree in computer science/engineering or equivalent experience.
Strong programming skills (Python, Go, PowerShell).
Experience with containerization (Docker, Kubernetes)
Cloud certifications (AWS, Azure, Google Cloud Platform)
SRE/DevOps certifications
Multi-cloud and hybrid cloud experience
Enterprise-scale, high-availability environment experience