site reliability engineer Jobs in california

Refine Results
1 - 20 of 323 Jobs

Site Reliability Engineer

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Site Reliability Engineer

Peritus Inc.

San Jose, California, USA

Full-time

-Experience with Site Reliability Engineering (SRE) concepts and practices -Strong understanding of monitoring/observability tools (e.g., Grafana) -Debugging experience across Java and React-based applications -Strong troubleshooting and incident management skills -Kubernetes not required

Site Reliability Engineer only w2

Symphony Corporation

Remote

Contract

Site Reliability Engineer 6 Months Remote only W-2 The client is looking for a site reliability engineer.

SRE Engineer (L3 Support)

Stanley David and Associates

San Jose, California, USA

Full-time

Role :: SRE Engineer (L3 Support) Location :: San Jose, CA / RTP, NC Type :: Fulltime Job Description Must Have Technical/Functional Skills SRE, NetApp Storage, Linux Certified, Kubernetes Certified, DevOps, Docker, etc.Roles & Responsibilities Experienced Senior SRE working on Kubernetes, On-Premises experienceCandidate should work independently with little guidance from the leads.Experience in working with AWS.Experience in DB technologies in PostGres and MongoDB.Experience in working with th

Site Reliability Engineer

TekVivid

Cupertino, California, USA

Third Party, Contract

Key Qualifications 4+ years of running services in a large scale *nix environment.Understanding of SRE principles and goals along with good Oncall experienceExperience and understanding on Scaling, Capacity Planning and Disaster RecoveryFast learner with excellent analytical problem solving and communication skillsThe ability to design, author, and release code in any language (Python, Java would be a plus)Deep understanding and experience in administration & usage of Apache Druid at scale.Deep

Site Reliability Engineer

Kforce Technology Staffing

Remote or Redwood City, California, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a Reliability Engineer in Redwood City, CA. The client is looking for a consultant who can help our client with the following deliverables: * High-Level Deliverables for Contract Duration with Estimated Sizing * (Small) Remediate all known policy violations within existing Docker build images * Build image: 2 high, 4 medium, 32 low severity issues related to dependencies * Requires knowledge of Docker, tangential comfort with Rust software

Druid SRE - Remote / Telecommute

Cynet Systems

Remote or San Francisco, California, USA

Contract

Job Description: Pay Range: $55hr - $60hr The Technical Lead will be responsible for overseeing and leading projects related to Druid (Deployment, monitoring, and troubleshooting) SRE / Big Data. Skill Requirements: Proficiency in Big Data and Azure Data Bricks for building and managing data pipelines. Strong experience with Druid and Deployment, monitoring and troubleshooting. Bachelor's degree in computer science, Engineering, or related field. Proficiency in Linux system administration, sh

Site Reliability Engineer (Amdocs)

Highbrow

Remote

Full-time

Key Responsibilities Design, build, and maintain scalable, reliable, and secure infrastructure across production and staging environments. Automate operational tasks and processes using code (Python, Go, Bash, etc.). Drive infrastructure as code (IaC) practices using tools like Terraform, Ansible, or similar. Monitor, troubleshoot, and improve system availability, latency, and performance. Collaborate closely with development, QA, and product teams to design scalable system architecture. Conduct

Sr. Site Reliability Engineer - W2 only

Nasscomm, Inc.

Remote

Contract

Site Reliability Engineer - At least 6+ years of experience defining and implementing Monitoring solutions - alerts, Telemetry, and instrumentation for on-premises and cloud platforms for large enterprises - Site Reliability Engineer will be playing a key role in building Observability and Resilience capabilities on cloud platform (Azure).Responsibilities of the SRE will be: - Build and configure alerts, tracing, telemetry, and instrumentation required for Infrastructure Monitoring and Applicati

SRE (Linux / Golang Automation)

Bayside Solutions

Remote

Contract

Site Reliability Engineer (Linux / Golang Automation) W2 Contract Salary Range: $124,800 - $145,600 per year Location: Remote Role - PST Job Summary: We require a Site Reliability Engineer with a strong background and experience supporting extensive virtualization and Linux compute platforms. Requirements and Qualifications: Experience automating with Golang Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc.) Strong experience supporting Linux and

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Sr. DevOps/Site Reliability Engineer (SRE)

JKV International

Mountain View, California, USA

Contract

Job Title: Sr. DevOps/Site Reliability Engineer (SRE)Location: Mountain View, CA (Onsite)Position Type: Fulltime | Independent | H1B TransferInterview Process: Final In-Person (F2F) Interview Required About the Role:We are looking for a passionate and experienced Sr. DevOps/Site Reliability Engineer (SRE) to join our dynamic Platform Engineering team. You will work on cutting-edge cloud platforms like Azure, AWS, or Google Cloud Platform, leveraging state-of-the-art CI/CD tools to support modern

Sr. Site Reliability Engineer - U.S. Citizen - This role sits within Optum Serves Technology Product organization

Widescope Consulting and Contracting Services

Remote

Full-time

Job Title: Sr. Site Reliability Engineer Location: Headquarters / Telecommute Classification (HR only): Exempt Non-Exempt Reports To (Title): COO Widescope Consulting and Contracting JOB SUMMARY The statements below are not intended to be all-inclusive of the duties and responsibilities of the position. Based on leadership decisions and business needs, all other duties as assigned will be expected for each position.Grafana Widescope Consulting and Contracting is proud to serve our nation's mi

Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Site Reliability team Our team monitors and measures the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we improve the performance and reliability of the products we support. Partner with the best You will apply statistical data analysis and an understandin

Site Reliability Engineer

Viasat, Inc.

Carlsbad, California, USA

Full-time

About us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an inclusive environment that drives positive impact to join our team. What you'll do The Customer Engineering team is a group of highly technica

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

Site Reliability Engineer

Talent Space, Inc.

Westlake Village, California, USA

Full-time

Talent Space, Inc. is seeking a Site Reliability Engineer for a remote full time opportunity with our Financial Services client! Responsible for ensuring the stability, reliability, and scalability of our production systems. Design and implement solutions that improve system performance, reduce downtime, and automate repetitive tasks. Combining systems engineering and operations engineering, you'll enhance operational processes, monitoring systems, and tooling to provide a seamless experience fo

Site Reliability Engineer

LiveRamp

San Francisco, California, USA

Full-time

LiveRamp is the data collaboration platform of choice for the world's most innovative companies. A groundbreaking leader in consumer privacy, data ethics, and foundational identity, LiveRamp is setting the new standard for building a connected customer view with unmatched clarity and context while protecting precious brand and consumer trust. LiveRamp offers complete flexibility to collaborate wherever data lives to support the widest range of data collaboration use cases-within organizations, b