Overview
Hybrid
Depends on Experience
Full Time
Skills
CI/CD
AWS
Azure
Docker
Terraform
DevOps
Cloud
SRE
Kubernetes
Linux
IaC
CloudFormation
Prometheus
Grafana
ELK
Splunk
CloudWatch
Azure Monitor
Job Details
Job Title: DevOps / SRE Engineer
Location: New Jersey (Hybrid/On-site)
Type of Employment: Full-time
About the Role:
We are seeking an experienced DevOps / Site Reliability Engineer (SRE) with 5–7 years of hands-on experience in cloud infrastructure, automation, CI/CD, and container orchestration. The ideal candidate has strong expertise with Docker, Kubernetes, Jenkins, Maven, and cloud platforms such as AWS or Azure. You will play a critical role in building, scaling, and maintaining highly available, secure, and reliable systems that support enterprise applications.
Key Responsibilities:
- Design, implement, and maintain CI/CD pipelines using Jenkins, Maven, GitHub Actions, or Azure DevOps.
- Build and manage containerized applications using Docker and orchestrate workloads with Kubernetes (EKS, AKS, or self-managed clusters).
- Automate infrastructure provisioning and configuration using IaC tools such as Terraform, CloudFormation, or ARM templates.
- Manage cloud infrastructure on AWS or Azure, including compute, networking, storage, security, and monitoring services.
- Ensure high availability, reliability, scalability, and resilience of production systems following SRE principles.
- Implement monitoring, log management, and alerting using tools like Prometheus, Grafana, ELK, CloudWatch, or Azure Monitor.
- Collaborate with development teams to improve deployment processes and application performance.
- Troubleshoot infrastructure, networking, and build pipeline issues across the stack.
- Support on-call rotation and incident response, ensuring quick resolution and RCA documentation.
- Drive automation across operations to reduce manual processes and operational toil.
Required Qualifications:
- 5–7 years of hands-on experience as a DevOps Engineer, SRE, Cloud Engineer, or similar role.
- Strong experience with Docker and containerization best practices.
- Hands-on expertise with Kubernetes (EKS, AKS, GKE, or on-prem k8s).
- Proven experience building CI/CD pipelines using Jenkins, Maven, and Git-based workflows.
- Proficiency with AWS or Azure cloud environments and cloud-native services.
- Strong background in Linux system administration, security, networking fundamentals, and shell scripting.
- Experience implementing IaC using Terraform, CloudFormation, or ARM/Bicep.
- Familiarity with logging/monitoring tools: Prometheus, Grafana, ELK, Splunk, CloudWatch, Azure Monitor, etc.
- Strong analytical, troubleshooting, and performance tuning skills.
- Experience working in Agile/Scrum environments.
Preferred Skills:
- Experience with service mesh technologies (Istio, Linkerd).
- Knowledge of Kubernetes Operators, Helm charts, and GitOps tools (ArgoCD, Flux).
- Familiarity with secrets management tools such as HashiCorp Vault or AWS Secrets Manager.
- Experience with incident management and SRE best practices (SLIs, SLOs, error budgets).
- Knowledge of security best practices for CI/CD, cloud, and containerized environments.
Education:
- Bachelor’s or Master’s degree in Computer Science, Information Systems, Engineering, or related field (or equivalent practical experience)
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.