lead site reliability engineer Jobs

Refine Results
181 - 200 of 208 Jobs

Sr Edge Platforms SRE - Term Appointment

Northwestern University

Evanston, Illinois, USA

Full-time

Department: NAISE - NU ANL Inst Sci Eng Salary/Grade: ITS/82 Job Summary: This will be an SRE role with a focus on maintaining and improving operations of the edge fleet, cloud infrastructure, and data pipeline associated with multiple NSF and DOE funded projects. At this time, the projects collectively operate nearly 200 remote edge devices, each running Linux and a local Kubernetes cluster to host user applications. We expect this number to grow by around 300 devices over the next 5 years as p

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

Site Reliability Engineer, Connected Warfare

Aduril Industries

Seattle, Washington, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Site Reliability Engineer, Connected Warfare

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Sr. Kubernetes Platform Site Reliability Engineer (Starlink)

SpaceX

Redmond, Washington, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. KUBERNETES PLATFORM SITE RELIABILITY ENGINEER (STARLINK) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy Starlink, the world's most advanced broadband internet system. Starli

Sr. Hardware / Infrastructure Site Reliability Engineer (Starlink)

SpaceX

Redmond, Washington, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. HARDWARE / INFRASTRUCTURE SITE RELIABILITY ENGINEER (STARLINK) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy Starlink, the world's most advanced broadband internet system.

Staff Platform Engineer / SRE / NYC or SF

Motion Recruitment Partners, LLC

New York, New York, USA

Full-time

A fast-growing health-tech company leveraging AI is hiring a Staff Platform Engineer to scale their cloud infrastructure and developer experience. With offices in SF and NYC this hybrid opportunity allows you to work in-office 3 days and remotely 2 days per week. As a Staff Platform Engineer, you'll drive scale, speed, and security across a growing engineering organization. This is a high-impact role focused on building multi-tenant, multi-cloud infrastructure and enabling fast, reliable deploym

Senior System Engineer - Engineering Operations (SRE)

AT&T Inc.

Plano, Texas, USA

Full-time

Job Description: Join AT&T and reimagine the communications and technologies that connect the world. Our Consumer Technology experience team is delivering innovative and reliable technology solutions to power differentiated, simplified customer experiences. Bring your bold ideas and fearless risk-taking to redefine connectivity and transform how the world shares stories and experiences that matter. When you step into a career with AT&T, you won't just imagine the future-you'll create it. As a

Senior Staff Machine Learning Engineer - DevOps/Site Reliability Engineer

ServiceNow, Inc.

Santa Clara, California, USA

Full-time

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500 . Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But thi

Senior Staff Software Engineer, Reliability Engineering

Airbnb

Remote

Full-time

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way. The Community You Will Join: We are a community based on connection and belonging - a community that was born in 2007 when two hosts welc

Senior Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A04380 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lear

Senior Engineer - Data Warehouse Site Reliability Engineering (SRE) (ship required)

Oracle Corporation

Pleasanton, California, USA

Full-time

Job Description The candidate for this position must qualify the US-Gov requirements - should be a and resident in the US. We are looking for senior engineers with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product. BS or higher degree in Computer Science / Engineering or equivalent 3+ y

Lead Software Engineer, Site Reliability

Capital One

Plano, Texas, USA

Full-time

Lead Software Engineer, Site Reliability Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. As a

Lead Software Engineer, Site Reliability

Capital One

New York, New York, USA

Full-time

Lead Software Engineer, Site Reliability Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. As a

Lead Software Engineer, Site Reliability

Capital One

Richmond, Virginia, USA

Full-time

Lead Software Engineer, Site Reliability Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. As a

Lead Software Engineer, Site Reliability

Capital One

McLean, Virginia, USA

Full-time

Lead Software Engineer, Site Reliability Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive and iterative delivery environment? At Capital One, you'll be part of a big group of makers, breakers, doers and disruptors, who love to solve real problems and meet real customer needs. We are seeking DevOps Engineers who are passionate about marrying data with emerging technologies to join our team. As a

Senior Engineering Program Manager, iCloud SRE, Apple Services Engineering

Apple, Inc.

No location provided

Full-time

The Apple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on an extensive scale, meeting Apple's high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. Our Program Managers partner with engineers who build secure, end-to-end solutions

COMSEC Account Manager

Chenega MIOS

Oakton, Virginia, USA

Full-time

Req ID: 36914 Summary COMSEC Account Manager Oakton, VA Are you ready to enhance your skills and build your career in a rapidly evolving business climate? Are you looking for a career where professional development is embedded in your employer's core culture? If so, Chenega Military, Intelligence & Operations Support (MIOS) could be the place for you! Join our team of professionals who support large-scale government operations by leveraging cutting-edge technology and take your career to the

Senior, Software Engineer

Walmart Inc.

Remote or Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do As a Senior Software Engineer, you will be part of the Replenishment within Walmart Global Tech. You'll develop technologies, products, or services that could revolutionize the way buyers and suppliers communicate and handle the product inventory at stores. This could include developing new methods of tracking inventory, forecasting or creating more efficient ways to manage fulfillment. You'll support and build applications and processes to help better experienc

Group Director, Software Engineering

Walmart Inc.

Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do AEDT - Associate Experience and Digital Transformation The AEDT Technology team is part of Walmart Global Tech. We are a group of engineers and software leaders. We build technology to connect and enable Walmart associates to accomplish their job functions with ease. We use cutting-edge technology, with a focus on AI. We are continually reinventing and improving processes and productivity software to enable enhanced associate productivity. What you'll do: The