site reliability engineer sre Jobs

Refine Results
141 - 160 of 180 Jobs

Lead SRE - Dynatrace Expert

Judge Group, Inc.

Plano, Texas, USA

Full-time

Location: Plano, TX Salary: $170,000.00 USD Hourly - $180,000.00 USD Hourly Description: As the Observability Technical Lead, you'll play a key role in enabling and enhancing the observability capabilities across the Technology organization. You'll be responsible for implementing and managing tools that support logging, tracing, alerting, visualization, and AIOps. You'll help drive operational excellence and system reliability through robust monitoring solutions. Key Responsibilities Design

Tech Lead, SRE - Recommendation Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A206446 Apply to this job Share this listing: Responsibilities Our Recommendation Infrastructure Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok users. SREs in our team keep the systems up and running with the highest level of availability, and create highly automated systems and pipelines. What You'll Do Engage in and imp

Tech Lead Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A181865 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lea

Senior Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A04380 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lear

Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A108064 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lea

Lead Site Reliability Engineer

General Dynamics

Texas, USA

Full-time

Type of Requisition: Regular Clearance Level Must Currently Possess: None Clearance Level Must Be Able to Obtain: None Public Trust/Other Required: Other Job Family: Cloud Job Qualifications: Skills: AWS Devops, Cloud Infrastructure, Cloud Service Automation, Cloud Testing, IT Monitoring Certifications: None Experience: 10 + years of related experience ship Required: No Job Description: GDIT is looking to hire a lead Site Reliability Engineer (SRE) to help take a cloud team to the next l

FRFS Cloud Specialist (AWS)/ SRE

Federal Reserve Bank

Richmond, Virginia, USA

Full-time

Company Federal Reserve Bank of Chicago About Us Federal Reserve Financial Services (FRFS) delivers a suite of payments services to financial institutions via FedLine Solutions, Fedwire , National Settlement Service (NSS), FedNowSM, FedCash , FedACH , and Check Services. We are currently leading a strategic effort to transform FRFS to a national, enterprise-focused organization. Through our evolved structure, we will meet the needs of the marketplace for new products and services more quickl

Software Engineer, SRE III (US Federal)

Workday, Inc.

Boulder, Colorado, USA

Full-time

Your work days are brighter here. At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a h

Senior applications solution architect - SRE orchestration

Pepsico

Plano, Texas, USA

Full-time

Overview PepsiCo's Sustain & Operations team, part of the Digital Products and Applications (DPA) organization, delivers and sustains digital products across Strategy and Transformation's core priorities to accelerate PepsiCo's digital transformation. PepsiCo's Digital Transformation requires new business processes, new digital products and new operations outcomes. To drive higher order outcomes for the business, all applications & underlying infrastructure/services independently operating mu

Site Reliability Engineer, AppBank Engineering, Salt Lake City

Goldman Sachs & Co.

Salt Lake City, Utah, USA

Full-time

Job Description What We Do At Goldman Sachs, our Engineers don't just make things - we make things possible. Change the world by connecting people and capital with ideas. Solve the most challenging and pressing engineering problems for our clients. Join our engineering teams that build massively scalable software and systems, architect low latency infrastructure solutions, proactively guard against cyber threats, and leverage machine learning alongside financial engineering to continuously tur

Platform Engineer

InfiCare Technologies

Phoenix, Arizona, USA

Full-time, Contract

Role- Platform Engineer Location- -Phoenix, AZ (1st), Charlotte (Xnd) Mode Of work-onsite from day X Mode for Hire- Contract to hire(WX work) Mandatory Experience: X+ years hands-on admin experience at the platform and application tiers supporting critical Customer Facing applications preferably in the Financial Services Industry X+ years of experience troubleshooting environments across the entire architecture (i.e., applications to infrastructure) X+ years of hands-on Linux administration ex

Lead Site Reliability Engineer

UKG Careers

Lowell, Massachusetts, USA

Full-time

Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we're only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on. At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR techn

Principal Site Reliability Engineer

UKG Careers

Lowell, Massachusetts, USA

Full-time

Company Overview With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And we're only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on. At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR techn

Senior Engineering Program Manager, iCloud SRE, Apple Services Engineering

Apple, Inc.

No location provided

Full-time

The Apple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on an extensive scale, meeting Apple's high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. Our Program Managers partner with engineers who build secure, end-to-end solutions

Messaging SRE Manager, Apple Services Engineering

Apple, Inc.

No location provided

Full-time

People at Apple don't just build products - they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. The Messaging SRE team is directly responsible for server reliability for APNs, iMessage, and FaceTime, and other messaging use cases that are critical for millions of our users. We focus on availability and automation of key services that back these experiences every minute of eve

Sr. Site Reliability Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

We are seeking an experienced Site Reliability Engineer (SRE) to join our team responsible for ensuring the reliability, performance of our Dojo cluster infrastructure. The successful candidate will be responsible for providing exceptional customer response and support, managing third-party systems, and collaborating with various teams to ensure seamless operations. If you have a passion for troubleshooting, automation, and collaboration, we encourage you to apply. Responsibilities Respond to c

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

SRE Specialist

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our MIS operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux System Administration: Admini

SRE Specialist - System

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our FortiGuard operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux System Administration:

SRE Specialist - Infrastructure

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our FortiGuard operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Design, deployment, and manage