Site Reliability Engineer Jobs in Los Angeles, CA

Refine Results
1 - 20 of 152 Jobs

Site Reliability Engineer

iPeople Infosystems LLC

Remote

Contract

Role: Site Reliability Engineer Location: 100% Remote Type: Contract Position Job description: Production support expertise with SRE Observability experience : Proactive issue identification using observability tools.Skills in using different monitoring & observability tools to track system performanceProduction support activities including proactive identification of issues leveraging observability tools, Corelating inputs from various dashboards & tools to drive resolutionExperience in swiftl

Lead Observability Engineer Sumo Logic & SRE Location :Remote

NeoTech Solutions

US

Third Party, Contract

Role : Lead Observability Engineer Sumo Logic & SRE Location : Remote Hire type : Contract JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will de

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Senior Software Engineer - SRE

Veeva Systems

Los Angeles, California, USA

Full-time

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $2B in revenue in our last fiscal year with extensive growth potential ahead. At the heart of Veeva are our values: Do the Right Thing, Customer Success, Employee Success, and Speed. We're not just any public company - we made history in 2021 by becoming a public benefit corporation

SRE / Python Developer

Artech, LLC

Remote

Contract

Summary Our organization builds and provides systems and infrastructure that fuel our core services. We are the foundation on which our software developers build the products that our customers love. We are looking for passionate and dedicated Site Reliability Engineers to continue our focus on providing our customers the highest quality Services experience. Our services have to scale globally, stay highly available, and "just work. If you love designing, engineering and running systems and inf

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Site Reliability team Our team monitors and measures the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we improve the performance and reliability of the products we support. Partner with the best You will apply statistical data analysis and an understandin

Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

Site Reliability Engineer, Hardware and Infrastructure (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, HARDWARE AND INFRASTRUCTURE (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's larges

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's largest US gov

Senior Dev Operations Engineer SRE

Buxton Consulting

Remote

Contract

Senior Dev Operations Engineer SRE Remote (Pleasanton, CA) 12+ Months Top 3 Must Haves Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / Dynatrace Experience with AWS solutions using AWS services including Kafka, ECS, EKS. Experience with IaC (Infrastructure as code) CDK or Terraform. Thanks and Regards, Ajeet Singh Buxton Consulting 2010 Crow Canyon Place STE 100 San Ramon, CA 94583 Direct: Email:

Site Reliability Engineer, GNC (Falcon)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, GNC (FALCON) SpaceX is looking for a Site Reliability Engineer, GNC to operate and scale custom-built mission-critical products for Guidance, Navigational and Control (GNC). The GNC team per

Site Reliability Engineer, Eng Support - USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : A36899A Apply to this job Share this listing: Responsibilities About the Team USDS Tech and Product at TikTok provides core product platforms and services with leading infrastructure and applications. Our Data Exchange System team provides support to various business-critical applications. You'll be part of a critical SRE team managing these applications, which control data based on compliance and security policies. In this role,

Site Reliability Engineer, Data Platform- USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : 262 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTok a

Site Reliability Engineer, Edge Services - USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : A187656A Apply to this job Share this listing: Responsibilities Team Insight: CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, OS, and applications, to

Sr. Site Reliability Engineer - U.S. Citizen - This role sits within Optum Serves Technology Product organization

Widescope Consulting and Contracting Services

Remote

Full-time

Job Title: Sr. Site Reliability Engineer Location: Headquarters / Telecommute Classification (HR only): Exempt Non-Exempt Reports To (Title): COO Widescope Consulting and Contracting JOB SUMMARY The statements below are not intended to be all-inclusive of the duties and responsibilities of the position. Based on leadership decisions and business needs, all other duties as assigned will be expected for each position.Grafana Widescope Consulting and Contracting is proud to serve our nation's mi

ClickHouse SRE, Data Platform -USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : A30614 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTo

Data Ingestion SRE, Data Platform -USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : A259491 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikT

Apigee SRE Automation Lead

Nityo Infotech Corporation

Remote

Contract

Role: Apigee SRE / Automation Lead Remote Contract Job Job Summary: We are seeking a highly skilled and experienced Apigee SRE / Automation Lead to oversee the reliability, scalability, and automation of our API infrastructure. This role demands deep expertise in Apigee platform operations, infrastructure automation, and system reliability engineering. The ideal candidate will have hands-on experience with tools like Terraform, Ansible, DoJo, and scripting, along with a strong background in Lin