RESPONSIBILITIES:
Kforce's client in Greenwood Village, CO is seeking a Site Reliability Engineer (SRE) to strengthen the reliability, scalability, and performance of enterprise systems and applications. This role bridges software engineering and infrastructure operations, focusing on automation, observability, and continuous improvement. The ideal candidate brings hands-on experience with monitoring platforms such as Splunk and Datadog, strong scripting capabilities in Python, and practical expertise with YAML-based configurations and containerization using Docker.
Responsibilities:
* Design, implement, and maintain monitoring, alerting, and observability solutions using Splunk and/or Datadog
* Develop and maintain automation scripts and tools using Python to improve operational efficiency and reduce manual intervention
* Create and manage infrastructure and application configurations using YAML (e.g., CI/CD pipelines, Kubernetes manifests, configuration management)
* Build, deploy, and manage containerized applications using Docker
* Define and enforce SLAs, SLOs, and SLIs to ensure high system reliability and availability
* Perform root cause analysis for production incidents and drive permanent resolutions
* Collaborate with development and infrastructure teams to improve system resiliency, scalability, and performance
* Implement proactive monitoring strategies to detect and prevent issues before customer impact
* Document processes, runbooks, and operational standards
REQUIREMENTS:
* 4+ years of experience in Site Reliability Engineering, DevOps, or related production support roles
* Hands-on experience with monitoring and observability tools such as Splunk and/or Datadog
* Strong scripting/programming skills in Python
* Experience working with YAML for configuration management, CI/CD pipelines, or infrastructure definitions
* Practical experience building and managing Docker containers
* Experience supporting production systems in a cloud or enterprise environment
* Strong troubleshooting skills in distributed systems and application environments
* Experience working in Agile environments
Preferred Skills:
* Experience with Kubernetes and container orchestration
* Familiarity with Infrastructure as Code tools (e.g., Terraform, CloudFormation)
* Experience with CI/CD tools such as Jenkins, GitHub Actions, or GitLab CI
* Knowledge of cloud platforms (AWS, Azure, or Google Cloud Platform)
* Experience defining and implementing reliability metrics (SLIs/SLOs)
The pay range is the lowest to highest compensation we reasonably in good faith believe we would pay at posting for this role. We may ultimately pay more or less than this range. Employee pay is based on factors like relevant education, qualifications, certifications, experience, skills, seniority, location, performance, union contract and business needs. This range may be modified in the future.
We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees. Salaried personnel receive paid time off. Hourly employees are not eligible for paid time off unless required by law. Hourly employees on a Service Contract Act project are eligible for paid sick leave.
Note: Pay is not considered compensation until it is earned, vested and determinable. The amount and availability of any compensation remains in Kforce's sole discretion unless and until paid and may be modified in its discretion consistent with the law.
This job is not eligible for bonuses, incentives or commissions.
Kforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.
By clicking ?Apply Today? you agree to receive calls, AI-generated calls, text messages or emails from Kforce and its affiliates, and service providers. Note that if you choose to communicate with Kforce via text messaging the frequency may vary, and message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You will always have the right to cease communicating via text by using key words such as STOP.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
- Dice Id: kforcecx
- Position Id: ITTND2169444
- Posted 1 day ago