Reliability Engineer Jobs in San Jose, CA

Refine Results
101 - 120 of 149 Jobs

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer only w2

Symphony Corporation

Remote

Contract

Site Reliability Engineer 6 Months Remote only W-2 The client is looking for a site reliability engineer.

Senior Site Reliability Engineer

Randstad Digital

Remote or St. Louis, Missouri, USA

Contract

job summary: Story Behind the Need Who is Resiliency Engineering Enablement? Partner with application and infrastructure teams to define Disaster Recovery (DR) standardsDesign, deploy and manage Tier 1 DR capabilities.Standardize and evangelize DR implementation patternsDefine and evangelize observability and ops excellence standards as related to DRDefine and maintain failover criteriaDefine, maintain and test Technical Recovery Guides (TRG) location: Saint Louis, Missouri job type: Contract

Site Reliability Engineer II

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Are you excited by the opportunity to monitor and produce internet solutions? Would being free to innovate Would being free to innovate excite you? Join our highly skilled security Join our highly skilled Security team Our Team develops and sells Akamai's carrier network security products to fixed and mobile network service providers. We specialize in delivering highly scalable network infrastructure and access-based security products to our customers. We collaborate to enable our customers t

Senior Site Reliability Engineer

Generac Power Systems Inc

Remote or Denver, Colorado, USA

Full-time

We are Generac, a leading energy technology company committed to powering a smarter world. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our current evolution into an energy technology solutions company, we continue to push new boundaries. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our cu

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer (Amdocs)

Highbrow

Remote

Full-time

Key Responsibilities Design, build, and maintain scalable, reliable, and secure infrastructure across production and staging environments. Automate operational tasks and processes using code (Python, Go, Bash, etc.). Drive infrastructure as code (IaC) practices using tools like Terraform, Ansible, or similar. Monitor, troubleshoot, and improve system availability, latency, and performance. Collaborate closely with development, QA, and product teams to design scalable system architecture. Conduct

Site Reliability Engineer III

Kforce Technology Staffing

Remote or Boca Raton, Florida, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a Site Reliability Engineer in Boca Raton, FL. Main Responsibilities: * Delivery of resilient application stacks via "Infrastructure as Code" and other DevOps practices * Monitoring and on-going support of critical, high revenue business applications * Diagnosis and resolution of complex system and application issues * Working with diverse technical and non-technical teams, including Development, QA, IT Operations, Customer Operations and P

Principal Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies? Join our Compute Team! Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day. Partner with the best As a Principal Site Reliability Engineer in the Virtualizatio

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Senior Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Are you excited by the prospect of working with innovative security products? Do you enjoy creating innovative and strategic solutions to solve complex problems? Join Guardicore (now Akamai Enterprise Security Group) Guardicore (now Akamai Enterprise Security Group!) is changing the way organizations protect their data centers and clouds. Our team boasts some of the most talented and experienced cyber security and data center. We're always looking for new people to inspire us and make us bett

Senior Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

ClickHouse SRE, Data Platform -USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A56552 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTok a

Site Reliability Engineer - Remote / Telecommute

Cynet Systems

Remote or Cary, North Carolina, USA

Contract

Job Description: Responsibilities: Use "Golden Images" for VM provisioning, ensuring OS patches and updates are tested in dev/test before production rollout. Oversee Active Directory and DNS updates for large-scale or critical deployments. Coordinate system refreshes, disaster recovery (DR) tests, and failovers for mission-critical applications such as Epic. Lead resolution of P1/P2 incidents, managing escalations when Level 2 cannot resolve, and facilitate major incident war rooms. Enforce Azur

Sr. Site Reliability Engineer - W2 only

Nasscomm, Inc.

Remote

Contract

Site Reliability Engineer - At least 6+ years of experience defining and implementing Monitoring solutions - alerts, Telemetry, and instrumentation for on-premises and cloud platforms for large enterprises - Site Reliability Engineer will be playing a key role in building Observability and Resilience capabilities on cloud platform (Azure).Responsibilities of the SRE will be: - Build and configure alerts, tracing, telemetry, and instrumentation required for Infrastructure Monitoring and Applicati

SRE (Linux / Golang Automation)

Bayside Solutions

Remote

Contract

Site Reliability Engineer (Linux / Golang Automation) W2 Contract Salary Range: $124,800 - $145,600 per year Location: Remote Role - PST Job Summary: We require a Site Reliability Engineer with a strong background and experience supporting extensive virtualization and Linux compute platforms. Requirements and Qualifications: Experience automating with Golang Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc.) Strong experience supporting Linux and

Sr Site Reliability Engineer - Remote

SitusAMC

Remote

Full-time

SitusAMC is where the best and most passionate people come to transform our client's businesses and their own careers. Whether you're a real estate veteran, a passionate technologist, or looking to get your start, join us as we work together to realize opportunities for everyone, we proudly serve. At SitusAMC, we are looking to match your unique experience with one of our amazing careers, so that we can help you realize your potential and career growth within the Real Estate Industry. If you ar

Site Reliability Engineer - Splunk Cloud Services

Splunk Inc.

Colorado, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor

Site Reliability Engineer: Splunk Cloud Services

Splunk Inc.

Colorado, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor

Site Reliability Engineer (Temp to Perm)

Leidos

Remote or Brownsville, Texas, USA

Full-time

This position will require up to 75% travel Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and work real-time hands-on fielding challenges and develop reusable solutions that support our customers in any environment. You will have the opportunity to contribute to the design requirements and implementation of improvements that accelerate the secure delivery, implementation, and sustainment of software in the field. You wi