Job#: 3016819 Job Description: For direct inquiries, please send resumes to FULLY REMOTE POSITIONOverview:Apex Systems is seeking an experienced
AWS Cloud Site Reliability Engineer (SRE) to join a high-performing team supporting critical cloud infrastructure within a secure federal environment. The ideal candidate will have deep AWS expertise, strong Infrastructure-as-Code (IaC) skills, and a passion for automation, observability, and continuous improvement. This role will collaborate closely with development, QA, and operations teams to ensure reliable, scalable, and efficient cloud operations.
Key Responsibilities:- Design, implement, and manage Infrastructure-as-Code (IaC) using tools such as AWS CloudFormation, Terraform, or Helm.
- Automate deployment, scaling, and configuration of cloud resources.
- Develop and maintain CI/CD pipelines (AWS CI/CD, GitLab CI/CD, Jenkins).
- Implement robust monitoring and alerting solutions using CloudWatch, Datadog, Prometheus, Grafana, Dynatrace, or similar tools.
- Analyze logs, metrics, and system performance to proactively resolve issues and optimize reliability.
- Support incident response, participate in on-call rotations, and conduct post-incident reviews.
- Ensure AWS environments meet security standards and compliance requirements.
- Coordinate release planning and communication between development, QA, and operations.
- Create and submit change records and participate in Technical Change Advisory/Review Boards as required.
- Continuously evaluate and improve release processes, tooling, and cloud infrastructure.
- Collaborate with QA teams to validate releases and support quality assurance practices.
QualificationsEducation & Experience:- Bachelor's degree and 5+ years of relevant experience
- OR 9 years of experience in lieu of a degree.
- Proven experience as a Site Reliability Engineer or similar cloud operations role.
Technical Skills:- Expertise with AWS services and cloud architecture.
- Advanced programming/scripting in at least three: Python, Ansible, Helm, Playwright, Bash, JavaScript, Terraform, Java.
- Strong understanding of DevOps principles and CI/CD pipelines.
- Hands-on experience with Terraform, CloudFormation, Helm, or similar IaC tools.
- Experience creating configuration standards and automating workflows with Ansible.
- Proficiency with GitLab, AWS CodeCommit, or SVN and modern branching strategies.
- Experience with containers and orchestration tools: ECS, EKS, Docker, Kubernetes.
- Familiarity with monitoring/logging: Datadog, CloudWatch, Prometheus, Grafana, Dynatrace.
- Understanding of Agile methodologies and release management practices.
Soft Skills:- Strong verbal and written communication skills.
- Excellent problem-solving and troubleshooting abilities.
- Ability to collaborate across teams and manage competing priorities.
Additional Requirements:- Must be able to obtain and maintain a 6C Public Trust clearance
- No dual citizenship
Preferred Qualifications:- Relevant DevOps/SRE certifications.
- Existing High Risk Public Trust or Secret clearance.
- 3+ years supporting highly available, mission-critical platforms.
- Experience managing distributed container platforms (capacity, provisioning, workload management).
- Experience leading major incident response across multiple vendors using tools such as Datadog and ServiceNow.
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at or .
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.
Apex Benefits Overview: Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Apex team member can provide.