HPC/AI - Kubernetes Engineer Jobs

Refine Results
41 - 60 of 1,327 Jobs

Principal Software Engineer, Kubernetes Networking

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Storage Engineer - New York - W2 - Direct Client - Hybrid

SANS

New York, New York, USA

Third Party, Contract

This position is Hybrid Position Summary: Research Technology Services (RTS) is looking for a Senior Storage Engineer to assist with the implementation and operation of an on-premise Ceph storage cluster. Duties will include configuring services such as the Ceph file system and RADOS object gateway, investigating performance issues, and planning future hardware acquisition to grow the existing cluster. Principal Duties: The University s High Speed Research Network (HSRN) has provisioned an on-pr

MLOps Engineer

IDR, Inc.

Remote

Contract

IDR is seeking an MLOps Engineer to join one of our top clients in the Healthtech-startup space (fully remote). If you are looking for an opportunity to join a large organization and work within an ever-growing team-oriented culture, please apply today! Position Overview: We are looking for an experienced MLOps Engineer to support the development and scaling of ML infrastructure at a cutting-edge, AI-focused startup. This role will focus on automating, monitoring, and optimizing machine learning

Senior Software Engineer, Kubernetes Fleet

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Founding Backend Engineer

Jobot

San Francisco, California, USA

Full-time

Well-Funded Seed Stage Startup / Generative AI / Hybrid Remote Flexibility / Rust This Jobot Job is hosted by: Caitlyn Hardy Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary: $200,000 - $215,000 per year A bit about us: We are a well-funded Seed stage startup that has plans to double our team size in the next 6 months. Our product is public and already generating revenue. This product is unlike anything on the market, created for developers b

Lead AI Data Scientist/Engineer (DoD Focus)

Arch Systems, LLC

Remote

Full-time

Role: Leader AI Scientist/Engineer Company: Arch Systems Location: Remote Employment Type: Full-Time About Arch Systems Arch Systems, LLC is a dynamic and fast-growing technology company specializing in delivering data science, AI/ML solutions, and advanced systems engineering to government and defense customers. We thrive at the forefront of innovation, solving the most complex data challenges with precision, efficiency, and mission impact. Position Overview We are seeking an accomplished Lead

AI/ML Engineer

Avance Consulting

Arlington, Texas, USA

Full-time

Key Responsibilities AI Model Development: Design, develop, and deploy machine learning models and AI solutions to solve complex problems. Data Analysis and Preprocessing: Collect, preprocess, and analyze large datasets to train and evaluate AI models. Performance Monitoring: Monitor the performance of AI models, troubleshoot issues, and optimize algorithms for efficiency and accuracy. Collaboration and Training: Work with cross-functional teams to understand AI requirements and provide training

Senior CloudOps / Kubernetes Engineer

Tyler Technologies, Inc

Herndon, Virginia, USA

Full-time

Description Tyler Technologies is looking for an experienced Cloud and DevOps Engineer to join a team directly supporting Augmented Field Platform. AFO (Augmented Field Operations) is a leading platform for AI driven Field Services Work in the Public Sector. You will work closely with our development and operations teams to automate processes, monitor system performance, and implement best practices for infrastructure as code. If you have a passion for AWS and want to be part of a collaborative

Staff Machine Learning Engineer, AI Enablement

Airbnb

No location provided

Full-time

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way. The Community You Will Join: As a Staff Software Engineer on the AI Enablement team, you will play a critical role in accelerating the de

Backend Software Engineer, Machine Learning Platform, AI Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critica

Senior Member of Technical Staff - AI/ML Infrastructure Engineer 3

Oracle Corporation

No location provided

Full-time

Job Description As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure for performance, re

Principal Machine Learning Engineer, AI (FULLY REMOTE IN USA)

Splunk Inc.

Remote or San Jose, California, USA

Full-time

Description Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we're committed to our work, customers, having fun and most importantly to each other's success. Learn more about Splunk careers and how you can become a part of our journey! Principal Machine Learning Engineer (MLE), Artificial I

Fullstack Software Engineer, Machine Learning Platform, AI Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critical

Artificial Intelligence Engineer

KLA

Milpitas, California, USA

Full-time

Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel d

Lead AI/ML Engineer - On-Prem & Cloud AI Infrastructure/Downtown Brooklyn, NY (Onsite)

Suncap Technology

Brooklyn, New York, USA

Contract

Position: Lead AI/ML Engineer - On-Prem & Cloud AI Infrastructure Location: Onsite working in Downtown Brooklyn, NY Longterm contract In addition to play the role of a ai/ml engineer, work with my network and devops teams to build out necessary AI/ML infrastructure onprem and cloud. They should know how to do this on their own at a smaller scale and certifications like the following are nice to have: Azure DevOps Expert Azure AI Engineer Associate OCI DevOps Professional Oracle Machine Lea

Founding Product Engineer (AI/ML)

Jobot

San Francisco, California, USA

Full-time

Well-Backed AI Health SaaS Startup is looking for Full-Stack Product Engineers to join their growing team! This Jobot Job is hosted by: Sydney Weaver Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary: $140,000 - $220,000 per year A bit about us: A rapidly growing AI healthcare startup is about to announce their 20M Series A and on the hunt for a Product-minded software engineers to help scale a generative AI-powered care platform! Backed by le

Senior Developer Technology Engineer - AI

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We're currently seeking a Senior Developer Technology Engineer, Artificial Intelligence! Would you enjoy researching parallel algorithms to accelerate AI workloads on advanced computer architectures? Is it rewarding to investigate, find, and eliminate system bottlenecks to achieve the best possible performance of computer hardware? Could you be thrilled about an opportunity to partner with the Developer community, working at the forefront of technology breakthroughs that contribute to the succes

Principal AI/ML Engineer

DBSI Services

Plano, Texas, USA

Full-time

Principal AI/ML Engineer Plano, TX (Onsite) NLP, LLM, RAG, GenAI, Python, 13-15 years Role Description We are seeking a highly skilled and experienced AI/ML Principal Engineer to join our team. The ideal candidate will have a strong background in software development, with extensive experience in leading and developing state-of-the-art solutions. You will be responsible for architecting and implementing AI/ML models and systems, providing analysis and guidance to improve overall performance,

AI System Research and Development Engineer - Frameworks

Snowflake Inc.

Menlo Park, California, USA

Full-time

Build the future of the AI Data Cloud. Join the Snowflake team. We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems. Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models.

Senior HPC Infrastructure Engineer

St. Jude Childrens Research Hospital

Tennessee, USA

Full-time

Join a cutting-edge team dedicated to pushing the boundaries of high-performance computing (HPC) and artificial intelligence (AI) infrastructure! As a Senior HPC Infrastructure Engineer, you'll play a pivotal role in designing, implementing, and optimizing our state-of-the-art HPC clusters and servers. Your expertise will ensure that our research computing environment excels in scalability, redundancy, and performance. Key Responsibilities: Lead the architecture, design, and implementation of a