Lead Site Reliability Engineer Jobs in California

Refine Results
1 - 20 of 79 Jobs

AWS EKS Lead (SRE / AWS Elastic Kubernetes Search) | Alameda, CA | Contract

SecureKloud Technologies Inc

Alameda, California, USA

Contract, Third Party

Hi , Greetings from Securekloud We do have opening for our client, Role : AWS EKS Lead Consultant Location : Alameda, CA Duration : Long-Term Contract Job Description : We are seeking a highly experienced AWS EKS Lead Consultant to lead end-to-end cloud native platform design and DevOps automation using Kubernetes and AWS. The ideal candidate will combine technical excellence in cloud infrastructure with leadership, strategic thinking, and hands-on DevOps experience in enterprise-grade envi

Lead Site Reliability Engineer

Centene Corporation

Missouri, USA

Full-time

You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: We are seeking a highly skilled and experienced M365 Lead Site Reliability Engineer to join our team. The ideal candidate will be responsible for developing and creating monitor

Lead Site Reliability Engineer - Remote

UnitedHealth Group

Remote or Minnetonka, Minnesota, USA

Full-time

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health op

Tech lead Site Reliability Engineer, Edge - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A104498 Apply to this job Share this listing: Responsibilities Site Reliability Engineering combines software and system engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, hardware, OS, and applications, to quickly resol

Tech Lead, SRE - Recommendation Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A206446 Apply to this job Share this listing: Responsibilities Our Recommendation Infrastructure Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok users. SREs in our team keep the systems up and running with the highest level of availability, and create highly automated systems and pipelines. What You'll Do Engage in and imp

Lead Site Reliability Engineer, Observability - Remote

Cisco Systems, Inc.

Remote

Full-time

Application window is open until further notice. The Meraki cloud supports millions of customer devices from 10 data centers around the world. Meraki's customer base has grown by a factor of 2-3 every year, serving billions of HTTP requests per day globally. Our customers depend on our products to run their critical infrastructure of network switches, security appliances, wireless APs and security cameras. As SREs at Meraki, we are responsible for building and growing the cloud that supports t

Lead Site Reliability Engineer II, Production Engineering

Cisco Systems, Inc.

San Francisco, California, USA

Full-time

Who We Are Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don't own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and remediate issues - before they impact end- user experiences. ThousandEyes is deeply integrated across the entire Cisco technology portfolio and

Senior Lead Site Reliability Engineer - Remote

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Would you enjoy improving stability and safety of one of the largest global networks? \n Would you enjoy hands-on network operations work on a global scale to improve our operational efficiency? \n Join our Platform Security Engineering Team \n The Platform Security Engineering team is a group of engineers that support and secure Akamai's global network and Linode cloud systems. Our systems provide data security, server integrity, network access, and secure communications infrastructure. This is

Site Reliability Engineer. Senior Lead

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Job Title: Site Reliability Engineer. Senior Lead Work Location: 145 Broadway, Cambridge, MA 02142 \n Job Description: \n Akamai Technologies, Inc. is hiring for the following role in Cambridge, MA (multiple openings): Site Reliability Engineer. Senior Lead. Working on analytical projects related to the metadata systems which support fast and reliable configuration of the company's global network. Leading the effort in working closely with the development teams in designing/implementing performa

Azure SRE Architect

Stanley David and Associates

Remote

Full-time

Role :: SRE Architect Location :: Marlborough, MA /Remote Type :: Fulltime Job Description Technical ExpertiseDeep understanding of SRE principles, SRE model, and DevOps methodologies.Experience designing highly available, scalable, and resilient distributed systems.Proficient in architectural design (Microservices, Cloud-native, Event-driven architecture).Skilled in cloud platforms: Azure, Google Cloud Platform.Strong knowledge of observability tools: UIM, Prometheus, Grafana, Datadog, New Re

Tech Lead Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A181865 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lea

Sr. Site Reliability Engineer (Python)

Ledgent Technology

Irvine, California, USA

Full-time

No Corp-to-Corp, No 3rd party firms . Job Title: Sr Site Reliability Engineer SRE Location: 100% onsite in Irvine, CA Employment Type: Direct-hire Compensation: $140,000 to $180,000 (based on level of experience). . Partnered with a client who is at the forefront of the future innovation hub of next-generation networking, IoT smart home products, and software services. Be a part of a pivotal time in propelling the global ventures. Join their mission in shaping a technology-driven future. Th

SRE Architect

Stanley David and Associates

Remote

Full-time

1. Technical Expertise Deep understanding of SRE principles, SRE model, and DevOps methodologies. Experience designing highly available, scalable, and resilient distributed systems. Proficient in architectural design (Microservices, Cloud-native, Event-driven architecture). Skilled in cloud platforms: Azure, Google Cloud Platform. Strong knowledge of observability tools: UIM, Prometheus, Grafana, Datadog, New Relic, Splunk, AppDynamics. 2. Framework Design & Governance Define and validate SLOs,

Senior Dev Operations Engineer SRE

Buxton Consulting

Remote

Contract

Senior Dev Operations Engineer SRE Remote (Pleasanton, CA) 12+ Months Top 3 Must Haves Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / Dynatrace Experience with AWS solutions using AWS services including Kafka, ECS, EKS. Experience with IaC (Infrastructure as code) CDK or Terraform. Thanks and Regards, Ajeet Singh Buxton Consulting 2010 Crow Canyon Place STE 100 San Ramon, CA 94583 Direct: Email:

Sr. DevOps/Site Reliability Engineer (SRE)

JKV International

Mountain View, California, USA

Contract

Job Title: Sr. DevOps/Site Reliability Engineer (SRE)Location: Mountain View, CA (Onsite)Position Type: Fulltime | Independent | H1B TransferInterview Process: Final In-Person (F2F) Interview Required About the Role:We are looking for a passionate and experienced Sr. DevOps/Site Reliability Engineer (SRE) to join our dynamic Platform Engineering team. You will work on cutting-edge cloud platforms like Azure, AWS, or Google Cloud Platform, leveraging state-of-the-art CI/CD tools to support modern

Senior Site Reliability Engineer

FIS

Georgia, USA

Full-time

Job Description Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial services and technology. Our talented people empower us, and we believe in being part of a team that is open, collaborative, entrepreneurial, passionate and above all fun. FIS is hiring a SRE for our innovative Platform Service Delivery team As an SRE team member, you will participate in all of the day to day activities o

DevOps Engineer

AbleForce

Poway, California, USA

Contract

Please, no third parties. Permanent residents only. This position will be three (3) days per week onsite in Poway, CA, and there is no relocation assistance available. Main Duties & Responsibilities: - Administer and enhance a primarily Azure & Windows-based environment with limited Linux system management, ensuring optimal performance, reliability, and security. - Configure and maintain application hosting and delivery using IIS and Nginx reverse proxy. - Develop and support CI/CD pipelines uti

Senior Site Reliability Engineer

Salesforce

San Francisco, California, USA

Full-time

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving you

Senior Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges? Join our Compute Site Reliability team! Our team is responsible for monitoring and measuring the reliability of our suite of products and platforms. In collaboration with Engineering Product teams, we focus on improving performance and reliability of products we support Partner with the best As a Senior Site Reliability Engineer II, you enhance Akamai's

(USA) Staff, Site Reliability Engineer

Walmart Inc.

Remote or Denver, Colorado, USA

Full-time

Position Summary Are you ready to lead and innovate in the world of cloud databases? Join Walmart/VIZIO as a Staff, Site Reliability Engineer and drive the future of database reliability and scalability. With your expertise, you'll ensure seamless operations and cutting-edge solutions, making a significant impact on our technology landscape. What you'll do About the Team: The Database Reliability Engineering Team is dedicated to ensuring the dependability and performance of our database infras