Lead Platform Site Reliability Engineer

Red Hat Linux, Ansible, Python
Contract W2, Contract Corp-To-Corp, 12 Months
Up to $90
Work from home available Travel not required

Job Description

LEAD PLATFORM SITE RELIABILITY ENGINEER (SRE) RHEL, Ansible and Python, Cloud Native Architecture


ALTA IT Services has a 12 month ++ contract opening for a Lead Platform Site Reliability Engineer with strong Red Hat Linux, Ansible and Python experience to support a leading, Washington DC-based health insurance customer.

Candidates must be highly engaged with a proven technical record related to enterprise-scale Cloud transformations.

Work is being conducted remotely for the remainder of COVID safety measures with eventual return to onsite work in Washington DC once pandemic conditions have lifted.


  • 10+ years of overall experience in IT including, with a hands-on development and systems engineering background
  • 3-5 years of experience in a Site Reliability Engineering role
  • Experience with Enterprise Cloud transformation efforts
  • Experience with SRE principles and transformation
  • 3+ years of experience with implementation of Containerization (Kubernetes), Cloud technologies (AWS, Azure, or Google, etc.), DevOps tool chain (Ansible, Jenkins, Artifactory, BitBucket, etc.), and technical patterns (IaC, Automated Provisioning/Release, CI/CD, etc.)
  • Solid understanding of Software coding techniques and experience with full spectrum of Software engineering (Build, Integration, Test, Releasing and Deployment) leveraging Python.
  • Experience in Developing and/or challenging engineering solutions/practices and collaborating with peers within and outside of immediate team, including customers (Dev, Architects, Engineers)



  • Communicates Architectural decisions, plans, goals, and strategies, while highlighting short-term trade-offs vs. long-term commitments and costs
  • Engage in and improve the end to end Lifecycle of services, starting from Inception & design, deployment, operations, and refinement
  • Support activities, including system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews
  • Systems Scalability and sustainability leveraging automation and strive to improve our systems with changes that improve reliability and velocity
  • Experience with Enterprise Cloud transformation efforts
  • Be part of the Cloud journey and as a member of the team to help lead Software automation and reliability for the platform
  • Actively participate and help guide customers on using Cloud-native design and architecture patterns.
  • Provide consultation on technology infrastructure planning and engineering for assigned systems; Assesses the implications of technology strategies on infrastructure capabilities
  • Build strategy to migrate Legacy applications by conversion to multiple Microservices and hosting on AWS Cloud platform.
  • Leverage Cloud-native architecture components including Containers, immutable infrastructure, Microservices, Service Mesh etc., to build highly available and Fault tolerant applications.
  • Research on the global technology trends and their applicability to FEPOC products in support of our internal development teams and business.
  • Promotes and ensures Modern application design, applies engineering best practices in the development and operations life cycle and mitigates vulnerabilities.
  • Monitors and manages the Stability, Availability, and Performance of enterprise systems and platforms across IT domains: e.g., Systems, Network, Storage, Security) by analyzing systems to identify problems, trends, and opportunities for improvement.
  • Maintain and continually improve (patch and upgrade) our Cloud-based infrastructure.
  • Makes data-driven recommendations and decisions, and continuously improves the overall efficacy and efficiency of our software delivery capability.
  • Mentoring peers as well as engaging with others across teams and socializing solutions.

Requirements: Strong skills are required in each of the following areas:

  • Development: Experience programming with one or more languages: Python, Java ,Groovy, Go, etc. –
  • IAC Tools for Platform Automation: Strong hands-on skills and experience in at least one: Ansible, Terraform, etc.
  • Containers: Docker or other OCI-certified containers
  • Container Orchestration Platform : Experience with Kubernetes, AWS EKS, Red Hat OpenShift, Platform 9, or VMware Tanzu
  • CNI Plugins: Calico, Flannel, Weave Net etc.
  • Service Mesh: Istio, AWS App Mesh, OpenShift Service Mesh etc.
  • Container Security Tools: Twistlock, Sysdig, Aqua etc.
  • Platform Monitoring, Observability, & Performance Tools: Nginx, New Relic, AppDynamics, DataDog, Thanos, Jaeger, LogDNA, etc.
  • DevOps Tools: Git/BitBucket, Jira, Ansible/Puppet, Jenkins/Circleci/Bamboo, Maven/Artifactory/nexus, etc. – some combination of these tools is needed


  • Understanding of Cloud Native Architecture –this is a given!
  • Linux, Shell scripting, and general admin skills
  • Network, Security, Plugins, & Storage Skills
  • AWS skills: EC2, S3, EBS, EFS, IAM, VPC, Lambda etc. – or related Azure experience if they’re Azure


Nice to have:

  • Cloud and DevOps certifications, e.g., AWS or Azure Solutions Architect and CKA


HOURLY RATE:  $90/Hr. range. Benefits available. C2C OK


For consideration please contact Melissa McNally via


ALTA IT Services, is an established leader in IT Staffing and Services, specializing in Agile Transformation Services, Program & Project Management, Application Development, Cybersecurity, and Data & Advanced Analytics.   We are an equal opportunity/affirmative action employer and considers qualified applicants for employment without regard to race, gender, age, color, religion, disability, veteran status, sexual orientation, or any other factor.

Position Id : MMNet
Originally Posted : 4 years ago
Have a Job? Post it

Similar Positions

Sr.Java Developer - Direct client Hire
  • InfoGravity LLC.
  • Herndon, VA