Site Reliability Engineer Jobs in Dallas, TX

Refine Results
1 - 20 of 184 Jobs

Site Reliability Engineer

Federal Soft Systems Inc.

Plano, Texas, USA

Contract

8-10 years overall experience Hands-On in at least one language - Java (must),Python (3-4 yrs) Hands-On experience with automated testing tools (JMeter, Junit, Mockito, Postman) Hands-On experience with a source code management system like GIT or SVN including pull, push, branch, commit and merge functions Hands-On experience creating, configuring and maintaining cloud-based applications and infrastructure for the rapid development and monitoring of applications and services:AWS, EC2, Fargate, C

Site Reliability Engineer

Innova Solutions, Inc

Dallas, Texas, USA

Full-time

Innova Solutions has a client that is immediately hiring for a Site Reliability Engineer. Position Type: Full-time (Contract-W2)Duration: 12+ MonthsLocation: Dallas, TX (Hybrid) As a Site Reliability Engineer, you will: Design, develop, and optimize distributed caching and compute grid solutions on Kubernetes/OpenShift Understanding of microservices and containerized workloads using Kubernetes, Docker, and Helm. Implement high-throughput compute grid solutions using Apache Ignite, Grid Gain,

W2 Only and SRE / Observability Specialist

GSS Infotech

Dallas, Texas, USA

Contract

SRE / Observability Specialist Chandler AZ, Dallas TX W2 Candidates only SRE - 7+ years of expobservabilitycisco, AppDynamics, python, ansible & terraformAI/ML part, gen AI stuffAppDynamics / Splunk, GrafanaPython is a plus (but looks like a mandate)OpenshiftDescription: Lead complex technology initiatives including those that are companywide with broad impact.Act as a key participant in developing standards and companywide best practices for engineering complex and large-scale technology solut

SRE Support Engineer / Production support -10+ years

ClifyX

Dallas, Texas, USA

Full-time

ClifyX group is an award winning IT Consultancy formed in 1998. Our Mission is to provide our clients with Optimal Technology solutions that are effective and within budgets. We specialize in helping Organizations to review their strategic SOW Projects/Talent needs and implement high value and cost effective solutions to increase profitability and efficiency. Our consulting capabilities include expertise in Cloud, Artificial Intelligence, Data Analytics and compliance aspects of Cyber Security d

Support / SRE Lead – Digital Platforms (Web & Mobile)

Optimize Search Group

Irving, Texas, USA

Contract

Job Title:Support / Site Reliability Engineering (SRE) Lead Digital Platforms (Web & Mobile) Location: Irving, TX (Hybrid Monday - Wednesday On-Site) Duration: Contract with option to hire Position Summary: Were looking for a dynamic and experienced Support / SRE Lead to oversee the stability, performance, and operational excellence of our digital platformsspanning both web and mobile applications. This role is ideal for a hands-on leader with a deep technical foundation, a passion for reliable

Site Reliability Engineer

Photon

Remote or Mexico City, Mexico City, Mexico

Full-time

About the Role: We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) with deep expertise in Azure Cloud to join our dynamic engineering team. In this role, you will be responsible for ensuring the reliability, availability, and performance of our critical applications and infrastructure hosted on Microsoft Azure. You will leverage your technical expertise and problem-solving skills to build and maintain scalable, resilient, and automated systems. Responsibilities: Relia

Senior Dev Operations Engineer SRE

Buxton Consulting

Remote

Contract

Senior Dev Operations Engineer SRE Remote (Pleasanton, CA) 12+ Months Top 3 Must Haves Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / Dynatrace Experience with AWS solutions using AWS services including Kafka, ECS, EKS. Experience with IaC (Infrastructure as code) CDK or Terraform. Thanks and Regards, Ajeet Singh Buxton Consulting 2010 Crow Canyon Place STE 100 San Ramon, CA 94583 Direct: Email:

SRE Consultant with Java Development

System Soft Technologies

Remote

Contract

System Soft Technologies is widely recognized for its professionalism, strong corporate morals, customer satisfaction, and effective business practices. We provide a full spectrum of business and IT services and solutions, including custom application development, enterprise solutions, systems integration, mobility solutions, and business information management. System Soft Technologies combines business domain knowledge with industry-specific practices and methodologies to offer unique solution

Site Reliability Engineer

Ranger Technical Resources

Remote

Full-time

Site Reliability Engineer #2493 Position Summary: Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a Site Reliability Engineer to help ensure the reliability, scalability, and performance of critical infrastructure and applications. In this role, you ll build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for the company s products. You ll collaborate closely wi

Site Reliability Engineer

Iceberg

Remote

Full-time

Some roles are about keeping the lights on. This isn t one of them. This is about stepping into a high-growth SaaS company serving some of the most security-conscious industries in the world - financial services, healthcare, and insurance - and helping build the backbone of a mission-critical DevSecOps platform. I'm looking for a Senior Site Reliability / DevSecOps Engineer who knows what it takes to build secure, scalable, and resilient systems - someone who thrives where development, security,

Site Reliability Engineer

Splunk Inc.

Remote or San Jose, California, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor

SRE Consultant - Remote

Mudrasys

Remote

Contract, Third Party

Position: SRE Consultant Location: Remote Duration: 6-12months Job Description: Self-Healing/Automated Repair framework to Automatically repair Batch Abnormal Ends due to non-ASCII values in Demographics data (Manual Fix 60 minutes, Automated Repair 2 minutes)Self-Healing for Consumer Alerting system preventing service blackout (Manual Fix 135 minutes, Automated Repair 2 minutes)Create Observability Aggregator framework and streamed Batch metrics into AppDynamics to show Real-Time Batch Ex

Senior Java Developer with Devops/ SRE

Byteware Inc.

Remote

Contract

Senior/ Lead Java Developer with Devops / SRE Remote Required Skills: SRE with 10 plus years experience with Software Development back ground with Java / Spring boot, Python, GitHub, AWS (EMR, EC2, Postgres) and Grafana. Experience with all pillars of SRE - Observability Eng., Chaos Eng., Ops-Automation, Incident-Remediation-Automation, Alerting and Notification.

Automation Developer / Site Reliability Engineer

Princeton IT Services

MX

Contract

Job Title: Platform SRE Automation Developer / Site Reliability Engineer Job Location: Remote in Mexico Job Type; Full time contract Job Summary: This team's engineers support the growing consumer credit card business. The platform is built on a microservice architecture on a modern technology stack hosted in AWS public cloud and uses state of the art development practices and tooling for SDLC, with observability tools such as Datadog, Prometheus, Splunk, etc.Our engineers are responsible

Site Reliability Engineer

Leidos

Remote

Full-time

Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and develop reusable solutions that support our customers in any environment. You will have the opportunity to contribute to the design and implementation of Continuous Integration and Continuous Delivery (CI/CD) pipelines that accelerate the secure delivery of software to production. You will automate the buildout of infrastructure in cloud and on-premises environments to ope

DevOps/SRE Engineer

Judge Group, Inc.

Irving, Texas, USA

Full-time

Location: Irving, TX Salary: $67.00 USD Hourly - $72.00 USD Hourly Description: Job Description: Site Reliability and Operations Engineer (SRE) Location: Irving, TX About the Role: As a Site Reliability and Operations Engineer (SRE) at our company, you will consult on complex initiatives with broad impact and large-scale planning for Systems Operations Engineering. You will review and analyze multifaceted, larger-scale, or longer-term Systems Operations Engineering challenges that require

Associate Site Reliability Engineer

S&P Consultants

Dallas, Texas, USA

Full-time

About the Role: Grade Level (for internal use): 08 The Team: Our SRE team are at the forefront of driving best practices and implementing new solutions across the organization for observability, reliability and stability within our products. We're a tight knit team that values collaboration, always looking for ways to improve and solve problems before they happen. Responsibilities and Impact: Learn and apply reliability best practices: Understand and apply SRE principles to ensure systems are

Site Reliability and operations Engineer (SRE)

Genesis10

Irving, Texas, USA

Full-time

Genesis10 is currently seeking a Site Reliability and Operations Engineer (SRE) with our client in the financial Industry located in Irving, TX. This is a 12+ month contract position. Responsibilities: Design, develop, and optimize distributed caching and compute grid solutions on Kubernetes/OpenShift Understanding of microservices and containerized workloads using Kubernetes, Docker, and Helm Implement high-throughput compute grid solutions using IBM Spectrum Symphony, Tibco Grid Server or simi

Associate Site Reliability Engineer

S&P Global

Dallas, Texas, USA

Full-time

About the Role: Grade Level (for internal use): 08 The Team : Our SRE team are at the forefront of driving best practices and implementing new solutions across the organization for observability, reliability and stability within our products. We're a tight knit team that values collaboration, always looking for ways to improve and solve problems before they happen. Responsibilities and Impact: Learn and apply reliability best practices: Understand and apply SRE principles to ensure systems ar

Site Reliability Engineer TS Clearance

Connexions Data Inc

Remote

Full-time

Site Reliability Engineer Start: Immediate Location: Remote Type: Full Time Hire Top Secret Clearance with SCI eligibility Objectives of this role Run the production environment by monitoring availability and taking a holistic view of system health Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities fo