Overview
Skills
Job Details
Position Title: SRE + DevOps Engineer
Location: Dallas, TX & NYC/Jersey City, NJ - Onsite
Type of Hire: Full-Time Employee (FTE) / Direct W2
Visa: Independent only
Role Summary
We are seeking an experienced SRE + DevOps Engineer with 10+ years of experience in managing large-scale cloud infrastructure, implementing CI/CD pipelines, ensuring service reliability, and championing DevOps culture. This role requires deep expertise in cloud platforms, automation, observability, and secure infrastructure operations.
Key Responsibilities
- Cloud Infrastructure Management
Use AWS, Azure, Google Cloud Platform, or OCI for provisioning, scaling, and securing infrastructure via IaC tools like Terraform, Ansible, or CloudFormation. - Containerization & Orchestration
Build and manage scalable containerized applications using Docker and Kubernetes. - Monitoring & Incident Response
Use observability tools (Prometheus, Grafana, ELK) to monitor system health, respond to incidents, and perform root cause analysis. - CI/CD Automation
Build, maintain, and enhance pipelines using Jenkins, GitLab CI, or CircleCI for automated testing and deployment. - Security & Compliance
Implement security best practices: IAM, secrets management, VPC/NACLs, auditing, and vulnerability scanning. - Scripting & Automation
Write scripts in Python, Bash, or Java for task automation, performance optimization, and system health checks. - Performance & Scalability Engineering
Analyze system bottlenecks, perform resource tuning, and ensure uptime and resilience during peak traffic. - Release Management
Manage release cycles and change management processes to enable zero-downtime deployments. - Collaboration
Partner with development, QA, and product teams to align DevOps/SRE practices with business goals and release schedules.
Technical Skill Matrix
Skill Category | Technology/Tool | Proficiency |
Cloud Platforms | AWS, Azure, Google Cloud Platform, OCI | Must Have |
Infrastructure as Code | Terraform, Ansible, CloudFormation | Must Have |
Containers & Orchestration | Docker, Kubernetes | Must Have |
CI/CD Tools | Jenkins, GitLab CI, CircleCI | Must Have |
Scripting Languages | Python, Bash, Java | Must Have |
Monitoring & Logging | Prometheus, Grafana, ELK Stack, Datadog | Must Have |
Security Practices | IAM, Secrets Mgmt, Vulnerability Scanning | Must Have |
Networking | VPCs, DNS, firewalls, load balancers | Must Have |
DevOps Processes | Release Management, Change Control | Must Have |
Collaboration Tools | Jira, Confluence, Slack, Git | Good to Have |
Nice to Have
- Certifications in AWS, Azure, Google Cloud Platform, Kubernetes (CKA/CKAD)
- Familiarity with microservices, service mesh (Istio, Linkerd)
- Exposure to DevSecOps methodologies
- Experience with serverless frameworks or hybrid cloud architecture
Education
- Bachelor s degree in Computer Science, Engineering, or related field (or equivalent experience)
Behavioral Skills
- Strong verbal and written communication skills
- High ownership mindset and collaborative spirit
- Analytical thinker with a proactive approach to problem-solving
- Ability to thrive in a fast-paced, high-performance team