Overview
On Site
Compensation information provided in the description
Full Time
Skills
Evaluation
Leadership
Microservices
GRID
Apache Ignite
Load Balancing
Distribution
High Availability
Splunk
Provisioning
Ansible
Software Deployment
Programming Languages
Java
Python
Docker
Continuous Integration
Continuous Delivery
Jenkins
GitHub
Grafana
Computer Networking
Cloud Computing
Training
Military
Kubernetes
Caching
Software Development
Reliability Engineering
Scalability
Innovation
Analytical Skill
Privacy
Marketing
Job Details
Location: Irving, TX
Salary: $96.00 USD Hourly - $103.00 USD Hourly
Description: Job Description: Senior Site Reliability and Operations Engineer (SRE)
Location: Irving, TX
About the Role: As a Senior Site Reliability and Operations Engineer (SRE), you will consult as an expert to develop and influence initiatives and resources for highly complex business and technical needs across Engineering. You will strategize and resolve highly complex and unique challenges requiring in-depth evaluation across multiple areas, delivering solutions that are long-term, large-scale, and require vision, creativity, innovation, and advanced analytical and inductive thinking. You will provide expertise to client senior leadership on innovative Engineering business solutions and strategically engage with client personnel.
Key Responsibilities:
Development & Implementation:
Site Reliability Engineering (SRE):
Required Skills & Qualifications:
Required Qualifications:
Additional Skills:
Join us to leverage your expertise in engineering and reliability to drive innovative solutions and ensure the robustness of our distributed systems. Apply now to be part of a dynamic team that values creativity, innovation, and advanced analytical thinking.
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!
Salary: $96.00 USD Hourly - $103.00 USD Hourly
Description: Job Description: Senior Site Reliability and Operations Engineer (SRE)
Location: Irving, TX
About the Role: As a Senior Site Reliability and Operations Engineer (SRE), you will consult as an expert to develop and influence initiatives and resources for highly complex business and technical needs across Engineering. You will strategize and resolve highly complex and unique challenges requiring in-depth evaluation across multiple areas, delivering solutions that are long-term, large-scale, and require vision, creativity, innovation, and advanced analytical and inductive thinking. You will provide expertise to client senior leadership on innovative Engineering business solutions and strategically engage with client personnel.
Key Responsibilities:
Development & Implementation:
- Design, develop, and optimize distributed caching and compute grid solutions on Kubernetes/OpenShift.
- Understand microservices and containerized workloads using Kubernetes, Docker, and Helm.
- Implement high-throughput compute grid solutions using Apache Ignite, GridGain, Coherence, or similar technologies.
- Optimize application performance by leveraging caching strategies, load balancing, and efficient data distribution.
Site Reliability Engineering (SRE):
- Ensure high availability, scalability, and reliability of distributed systems.
- Implement observability, logging, and monitoring using tools like Splunk, Prometheus, Grafana, ELK, or OpenTelemetry.
- Automate infrastructure provisioning and deployments using Ansible and Helm Charts.
- Understand CI/CD pipelines for seamless software deployment.
- Troubleshoot and resolve incidents related to platform, infrastructure, and distributed caching and compute grids, ensuring minimal downtime.
Required Skills & Qualifications:
- Strong experience in Kubernetes (OpenShift and on-prem/cloud clusters).
- Proficiency in programming languages like Java, Go, or Python.
- Experience with containerization technologies (Docker, Helm, etc.).
- Strong knowledge of CI/CD pipelines (Jenkins, ArgoCD, GitHub Actions).
- Hands-on experience with observability tools (Prometheus, Grafana, Loki, Jaeger).
- Understanding of networking, service meshes (Istio/Linkerd), and security best practices in Kubernetes.
- Experience with multi-cluster and hybrid cloud Kubernetes deployments.
Required Qualifications:
- 7+ years of Engineering experience, or equivalent demonstrated through one or a combination of the following: work or consulting experience, training, military experience, education.
Additional Skills:
- Extensive experience in the implementation of Kubernetes-based distributed caching and solutions.
- Strong foundation in software development, infrastructure automation, reliability engineering, and large enterprise-scale implementations.
- Ability to design, implement, and maintain high-performance distributed systems, ensuring reliability, scalability, and efficiency.
Join us to leverage your expertise in engineering and reliability to drive innovative solutions and ensure the robustness of our distributed systems. Apply now to be part of a dynamic team that values creativity, innovation, and advanced analytical thinking.
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.