SRE - DevOps

Overview

On Site
Full Time
Part Time
Accepts corp to corp applications
Contract - W2
Contract - Independent

Skills

Reliability Engineering
Problem Solving
Conflict Resolution
Communication
Amazon EC2
Amazon S3
Remote Desktop Services
Amazon RDS
Management
Collaboration
Cloud Computing
Microsoft Azure
DevOps
Amazon Web Services
Kubernetes
Docker
Scripting
Python
Windows PowerShell
Ansible
Java
Continuous Integration
Continuous Delivery
Version Control
Optimization
AppDynamics
Grafana
Zabbix
Dynatrace
Splunk
JFrog
Root Cause Analysis
Incident Management
Capacity Management
Scalability

Job Details

About the Role

We are seeking a skilled DevOps Engineer with strong Site Reliability Engineering (SRE) capabilities to design, build, and maintain scalable infrastructure, optimize CI/CD pipelines, and ensure reliability across critical systems. This role requires both hands-on technical expertise and strong problem-solving, collaboration, and communication skills.

Responsibilities

  • Design, implement, and manage CI/CD pipelines.
  • Develop and maintain infrastructure on Azure DevOps, AWS (EC2, S3, Lambdas, RDS, IAM), and Kubernetes.
  • Automate system management using Python, PowerShell, Ansible.
  • Manage containerized environments (Docker) and optimize cluster operations.
  • Implement and monitor application performance using AppDynamics, Grafana, Zabbix, Datadog, or Dynatrace.
  • Configure and monitor logging and observability tools (ELK, Splunk, Prometheus, CloudWatch).
  • Ensure secure software delivery via SonarQube, JFrog Artifactory.
  • Collaborate with developers to review code, troubleshoot performance issues, and enforce best practices.
  • Proactively identify bottlenecks, scalability issues, and reliability risks.
  • Document systems, processes, and post-mortem learnings.

Required Skills

  • Infrastructure & Cloud: Azure DevOps, AWS (E2+), Kubernetes, Docker.
  • Automation & Scripting: Python, PowerShell, Ansible, Core Java.
  • CI/CD & Version Control: End-to-end pipeline design & optimization.
  • Monitoring & Observability: AppDynamics, Grafana, Zabbix, Datadog, Dynatrace, ELK, Splunk, Prometheus.
  • Security & Quality Tools: JFrog Artifactory, SonarQube.

Professional Competencies

  • Strong root cause analysis and incident response skills.
  • Capacity planning and system scalability expertise.
  • Effective communicator with both technical and non-technical stakeholders.
  • Self-motivated, proactive, and resourceful.
  • Quality-focused, delivering work to high standards with minimal rework.
  • Continuous learner who shares knowledge and mentors others.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Purple Drive Technologies LLC