Job title SRE Devops with Development experience
Work location Tampa, US Hybrid(3days A week Onsite)
Domain - Cloud & DevOps Engineering
Any certification required - Azure/AWS/Google Cloud Platform Architect
Job Description:
We are seeking a highly skilled SRE / DevOps Engineer with strong development experience to build, automate, and operate scalable cloud-native systems. This role blends software engineering, system reliability, and DevOps practices to improve system performance, enhance automation, and ensure seamless delivery of applications.
The ideal candidate will write production-quality code, design robust infrastructure, and implement automation to improve system reliability and developer productivity.
Key Responsibilities
SRE & Reliability Engineering
- Define and manage SLIs, SLOs, and SLAs
- Ensure high availability, scalability, and fault tolerance of systems
- Lead incident response, troubleshooting, and root cause analysis (RCA)
- Implement self-healing systems and automated recovery mechanisms
- Conduct performance tuning and capacity planning
DevOps & CI/CD Engineering
- Design and implement CI/CD pipelines (Azure DevOps, GitHub Actions, Jenkins)
- Automate build, test, and deployment workflows
- Implement canary, blue-green, and rolling deployment strategies
- Improve release quality and reduce deployment risks
Software Development & Automation
- Develop tools, scripts, and microservices to automate operations
- Write clean, efficient code in Python, Go, Java, or .NET
- Build reusable frameworks for deployment, monitoring, and scaling
- Integrate APIs and backend services
Cloud & Infrastructure
- Architect and manage infrastructure on Azure (preferred), AWS, or Google Cloud Platform
- Use Infrastructure as Code (Terraform, Bicep, CloudFormation)
- Design secure, scalable, and cost-effective cloud architectures
Kubernetes & Containers
- Deploy and manage applications using Docker and Kubernetes (AKS preferred)
- Maintain cluster health, scaling, and upgrades
- Develop Helm charts or Kubernetes manifests
- Optimize resource utilization and performance
Observability & Monitoring
- Implement monitoring using Prometheus, Grafana, ELK, OpenTelemetry, Azure Monitor
- Build dashboards, alerts, and automated incident detection systems
- Enable distributed tracing and log aggregation
Security & DevSecOps
- Integrate security practices into CI/CD pipelines
- Manage secrets using Azure Key Vault or Vault solutions
- Ensure compliance with security standards and policies
Collaboration & Engineering Excellence
- Work closely with developers, QA, and product teams
- Promote DevOps culture and automation-first mindset
- Contribute to architecture discussions and design reviews
- Mentor junior engineers
Required Skills & Qualifications
Core Technical Skills
- 5 10 years of experience in DevOps / SRE / Software Engineering
- Strong experience in CI/CD tools (Azure DevOps preferred)
- Hands-on experience with Kubernetes and Docker
- Expertise in Terraform / Infrastructure as Code
- Strong understanding of cloud platforms (Azure preferred)
Programming & Development
- Proficiency in Python, Go, Java, .NET, or similar languages
- Strong coding skills with focus on automation and backend services
- Experience in API development and system integration
Systems & Networking
- Strong understanding of Linux systems
- Networking fundamentals (DNS, TCP/IP, load balancing, firewalls)
Observability
- Experience with monitoring, logging, and tracing tools
- Understanding of metrics, logs, and distributed tracing
Soft Skills
- Strong problem-solving and analytical mindset
- Ability to handle production issues under pressure
- Excellent communication and collaboration skills
- Ownership-driven approach
Nice to Have
- Experience with GitOps tools (ArgoCD, Flux)
- Knowledge of service mesh (Istio, Linkerd)
- Exposure to event-driven architecture (Kafka, Service Bus)
- Experience with chaos engineering
- Certifications:
- Azure DevOps Engineer (AZ-400)
- Kubernetes (CKA/CKAD)