Site Reliability Engineer - Linux, Kubernetes, Terrraform

Hybrid in Chicago, IL, US • Posted 60+ days ago • Updated 8 days ago

Full Time

Hybrid

$150,000 - $160,000/yr

Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

Ansible
Apache Kafka
Jenkins
Kubernetes
Docker
Reliability Engineering
Terraform

Summary

NO SPONSORSHIP - NO OPT

Site Reliability Engineering

This is all about Linux, Kubernetes, Docker, Terraform, Jenkins, Ansible, Harness

Observation, logging and Capacity planning

AI is a huge preference

Kafka is a huge preference, but not a must

SELLING POINTS: AWS Cloud application profiling monitoring logging metrics collection analysis AI ops splunk app dynamics data dog stack driver sys dig scripting bash python go messaging kafka rabbit mq active mq kubernetes docker swarm rancher Jenkins travis harness AI power tools linux

Qualifications:

Experience with maintaining and troubleshooting large-scale distributed systems
Experience with Agile / Scrum methodology
Able to succeed in fast-paced environment with frequent changes

Experience managing infrastructure in public cloud environments like AWS (preferred), Azure or Google Cloud Platform
Experience with AIOps and predictive analysis for anomaly detection, forecasting system capacity using monitoring and alerting tools like Splunk, AppDynamics, Datadog, StackDriver, Sysdig, Prometheus or Grafana
Programming/scripting experience in languages like Java, Bash, Python or Go
Experience with Kafka, RabbitMQ, or ActiveMQ
Experience with container orchestration systems like Kubernetes, Mesos, Docker Swarm or Rancher
Experience with using Continuous Integration and Continuous Delivery (CI/CD) tools like Jenkins, Travis, Harness, Appveyor, CodeBuild or CodePipeline
Familiarity with leveraging large language models (LLMs) to automate and optimize SRE workflows. This may include using AI-powered tools to perform tasks such as, writing scripts, summarizing incident reports, or even creating and maintaining AI workloads.
Familiarity with leveraging large language models (LLMs) to automate and optimize SRE workflows. This may include using AI-powered tools to perform tasks such as, writing scripts, summarizing incident reports, data analysis or even creating and maintaining AI workloads.
Basic exposure to Chaos Engineering tools like, Gremlin, Chaos Monkey, Harness Chaos Engineering, or cloud-native fault injection services like AWS FIS.
Minimum of 4+ years of experience in Site Reliability Engineering / DevOps

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: napil006
Position Id: 8828095
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Chicago, Illinois

•

5d ago

Exciting opportunity at one of the fastest growing financial services firms around the world. They offer prime brokerage, clearing and financing across traditional and digital assets, and are now looking to hire world-class engineers to help build on their success. Responsibilities will include: Automate infrastructure and operational workflows using IaC with Terraform and AWS CDK. Develop & optimize CI/CD pipelines to improve software delivery for large-scale distributed systems using Amazon Co

Full-time

Senior Kafka Platform Engineer (Automation & Kubernetes)

Chicago, Illinois

•

Today

We're seeking a seasoned Kafka engineer to design, operate, and scale our event streaming platform. You'll own the Kafka core (brokers, storage, security, observability) and the automation that powers it-building infrastructure-as-code, operators/Helm charts, and CI/CD to enable safe, self-service provisioning. You'll run Kafka on Kubernetes and/or cloud-managed offerings, ensure reliability and performance, and partner with application teams on best practices. What you'll do Architect, deploy,

Full-time

Lead Site Reliability Engineer (SRE)

Illinois

•

18d ago

Lead Site Reliability Engineer (SRE) Do you love building and pioneering in the technology space? Do you enjoy solving complex technical problems in a fast-paced, collaborative, inclusive, and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. As a Site Reliability Engineer (SRE), you'll tap into your passion for proactively finding and fixing inefficiencies to solv

Full-time

USD 149,800.00 - 171,000.00 per year

Lead Software Engineer - Java, Spring, API, Kubernetes

Chicago, Illinois

•

10d ago

Job Description We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible. As a Lead Software Engineer at JPMorganChase within the Community & Consumer Bank - Digital Communications team, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. As a core technical contributor, you are responsible for conducting critical technolo

Full-time

Search all similar jobs

Site Reliability Engineer - Linux, Kubernetes, Terrraform

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs