Job#: 3026106 Job Description: Client:Financial ServicesTeam: Platform Engineering / SRE (Harness CD)
Job Title: Systems Operations Engineer 4 - Harness CD / SRE
Location: Zone 2 - Approved Sites:
- USA-TX-IRVING - 401 Las Colinas Blvd W, Bldg A-111432
- 2222 W Rose Garden Ln, Phoenix, AZ 85027
- Charlotte, NC 28202
Contract Length: 12 Months (Conversion to FTE after 12 months)
Work Model: Hybrid (RTO 3 Days Onsite)
Pay Rate: $61 - $65
Top Requirements:5-7+ years in DevOps / SRE / Platform / Cloud EngineeringHands-on experience with Harness CD (enterprise operations and integrations)Strong experience with Kubernetes / OpenShift, Linux, cloud services, and deployment best practicesSolid understanding of CI/CD workflows and release automationSRE concepts: SLIs/SLOs, error budgets, incident response, operational maturity improvementsAutomation & IaC: Python/Bash/PowerShell and Terraform, Ansible, HelmObservability: Prometheus, Grafana, Splunk/ELK, AppDynamics (dashboards, alerts, RCA)Plusses (Preferred Qualifications):- Operating CD platforms at enterprise scale (hundreds of teams, multi-region)
- Experience in Azure and/or Google Cloud Platform, hybrid cloud
- DevSecOps controls, policy enforcement, governance pipelines
- Experience with platform upgrades, migrations, and modernization projects
- Proven contributions to BCP validation, backup verification, resiliency improvements
Job Summary:The Systems Operations Engineer 4 serves as the
Harness CD platform SRE/Owner, responsible for end-to-end reliability, performance, and modernization across non-prod, prod, and BCP environments. The role drives
automation-first operations, implements
observability and alerting, integrates with CI/CD ecosystems (GitHub, Jenkins, Azure DevOps, Kubernetes/OpenShift, cloud providers), and partners with Security to embed
DevSecOps controls. This engineer leads incidents and RCAs, manages SLIs/SLOs/error budgets, and continuously improves scalability, resiliency, and developer experience through hardened pipelines and self-service workflows.
Day-to-Day Responsibilities:Platform Ownership & Reliability (SRE) Operate the Harness CD platform across non-prod, prod, and BCP; maintain
SLIs, SLOs, error budgets, success rates, platform health Lead incident response, troubleshooting, and
RCA for deployment failures, delegate outages, or performance issues
Identify/remediate scaling & capacity constraints across
delegates, pipelines, clusters, and cloud integrationsAutomation & Engineering Excellence Build
automation for provisioning, configuration, scaling, upgrades, and maintenance of Harness components
Implement
IaC using
Terraform, Ansible, Helm; automate delegate lifecycle, cluster onboarding, secret rotation, and pipeline validation
Reduce toil via
resilient, repeatable, self-service workflows
DevOps & CI/CD Integration Maintain/enhance integrations with
GitHub, Jenkins, Azure DevOps, Kubernetes/OpenShift, and cloud providers
Optimize deployment strategies (
blue/green, canary, rolling) for speed and reliability
Embed
DevSecOps controls (policy enforcement, governance pipelines, security checks)
Observability & Monitoring Implement
monitoring, logging, dashboards, and alerting for all Harness components
Use
Splunk, Prometheus, Grafana, AppDynamics to deliver actionable alerts and reduce
MTTD/MTTR Detect/escalate issues (delegate saturation, pipeline slowdowns, API failures, K8s resource constraints)
Modernization & Continuous Improvement Execute
upgrades, hotfixes, patching; evaluate new Harness features & modules
Drive
containerization, cloud-native deployments, multi-cloud expansion Support
BCP readiness and resiliency validation
Technical Leadership Act as
SME for Harness platform operations; produce
architecture docs, runbooks, and standards Mentor and partner with senior engineers to improve patterns and operational excellence
Required Qualifications:- 5-7+ years in DevOps, SRE, Platform, or Cloud Engineering
- Hands-on Harness CD experience
- Strong Kubernetes/OpenShift, Linux, cloud services, deployment best practices
- Solid grasp of CI/CD workflows and release automation
- SRE practices (SLIs/SLOs, error budgets) and operational maturity
- Automation/scripting (Python, Bash, PowerShell)
- IaC with Terraform, Ansible, Helm (or equivalent)
- Observability tools (Prometheus, Grafana, Splunk/ELK, AppDynamics) and full-stack troubleshooting
Job Expectations:- Hybrid work schedule (3 days onsite at an approved location)
- On-call support as needed; flexibility for ad-hoc shifts
- 12-month contract with conversion to FTE target after 12 months
Skills Matrix (optional to paste into Beeline/TAP):- Harness CD - platform administration, pipelines, delegates, integrations
- Kubernetes / OpenShift - cluster ops, workloads, scaling, networking
- CI/CD - GitHub, Jenkins, Azure DevOps; strategies: blue/green, canary, rolling
- IaC & Automation - Terraform, Ansible, Helm; Python/Bash/PowerShell
- Observability - Prometheus, Grafana, Splunk/ELK, AppDynamics; SLO dashboards
- SRE - SLIs/SLOs, error budgets, incident response, RCA, operational maturity
- Security/DevSecOps - policy enforcement, governance pipelines, secrets mgmt
- BCP/Resiliency - backups, DR, failover testing, capacity planning
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at or .
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.
Apex Benefits Overview: Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Apex team member can provide.