801 - 820 of 999 Jobs

Site Reliability Engineer - AI Cloud

SUPERMICRO COMPUTER INC

San Jose, California, USA

Full-time

Job Req ID: 26861 About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, pass

Principal Software Developer - AI Infra Compute

Oracle Corporation

No location provided

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automatio

Senior Cloud Administrator

AMD (Advanced Micro Devices)

San Jose, California, USA

Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellenc

Hardware Engineering Internships

Apple, Inc.

No location provided

Full-time

Imagine what you could do here! At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what can be accomplished. Dynamic, smart people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with Apple products. The same passion for innovation that goes into our products also applies to our practices strengthenin

Strategic Program Manager, Data Center Infrastructure & NVIDIA Deployments

World Wide Technology

No location provided

Full-time

Qualifications Bachelor's degree: Advanced degree or MBA preferred. PMP or PgMP, Agile, SAFe, or ITIL certifications a plus. Strong understanding of data center environments including space/power/cabling/fiber and ethernet. Experience managing and leading large multi-site deployments for infrastructure projects scaling at least 1000 sites for external customers. 10+ years in program/project management, with 5+ years leading large, complex, cross-functional IT or infrastructure programs in a Pr

Principal Network Operation Engineer - GNOC (Remote)

Oracle Corporation

Texas, USA

Full-time

Job Description We are seeking a skilled and proactive engineers with 6+ years of experience to join our Global Network Operations Center (GNOC) team. The GNOC is our front-line for addressing physical network issues and operates 24x7x365 to ensure the reliability and efficiency of our physical network infrastructure. The team is responsible for performing data collection, triage, technical analysis, incident mitigation, and redirection as necessary to maintain and optimize operations. This ons

Consulting Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

Santa Clara, California, USA

Full-time

Job Description Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently,

HPC Engineer

American Systems Corporation

Arlington, Virginia, USA

Full-time

Overview AMERICAN SYSTEMS is an employee-owned federal government contractor supporting national priority programs through our strategic solutions in the areas of Information Technology, Test & Evaluation, Program Mission Support, Engineering & Analysis, and Training. Responsibilities THIS POSITION COMES WITH A 10K SIGNING BONUS! As an HPC Engineer with AMERICAN SYSTEMS you will have an opportunity to do the followingl: Apply comprehensive knowledge of High Performance Computing (HPC) systems

(USA) Senior, Software Engineer (iOS)

Walmart Inc.

Bentonville, Arkansas, USA

Full-time

Position Summary Join Walmart and your work could help over 275million global customers live better every week. Yes, we are the Fortune #1 company. But you'll quickly find were a company who wants you to feel comfortable bringing your whole self to work. A career at Walmart is where the worlds most complex challenges meet a kinder way of life. Our mission spreads far beyond the walls of our stores. Join us and you'll discover why we are a world leader in belonging, sustainability, and community

HPC Software Engineer (C++/Linux)

KLA

Milpitas, California, USA

Full-time

Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel d

Backend Software Engineer, Machine Learning Platform, AI Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critica

Consulting Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

Nashville, Tennessee, USA

Full-time

Job Description Description As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure for per

Data Scientist-Costa Rica

World Wide Technology

No location provided

Full-time

Required Qualifications 4+ years of relevant work experience in data analysis or related field. (e.g., as a statistician / data scientist / scientific researcher) Fundamental knowledge in fields such as statistics, machine learning, deep learning. 4+ years of practical experience developing, deploying production machine learning applications such as supervised learning, unsupervised learning. 4+ years of programing skills in Python. Interest in researching cutting-edge methods in AI, particularl

Principal Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

Santa Clara, California, USA

Full-time

Job Description As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure for performance, re

Senior Math Libraries Engineers - Python APIs

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is looking for a self-motivated and specialist software engineer for the design and development of high-performant Python APIs for our math libraries. In the last decade, Python has become the de-facto leading programming language for engineers in AI and data science, and more recently in HPC and scientific computing. NVIDIA has been at the forefront of providing GPU accelerated Deep Learning frameworks. These frameworks provide an efficient high-level programming interface allowing their

Fullstack Software Engineer, Machine Learning Platform, AI Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critical

Linux Kernel Software Engineer

AMD (Advanced Micro Devices)

Austin, Texas, USA

Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellenc

Simulation Software Engineer

Johns Hopkins University AppliedPhysicsLaboratory

Laurel, Maryland, USA

Full-time

Description Are you a skilled software engineer with a passion for scientific computing and simulation development? Are you a C++ developer who loves to design innovative solutions to real-world engineering problems? If so, we're looking for someone like you to join our team at APL! The Strike Guidance, Navigation, Control, and Seekers group is looking for a Simulation Software Engineer to help us architect, develop, and modernize physics-based simulations across a broad range of programs. O

Sr. Machine Learning Scientist, GenAI/R&D

Blue Origin, LLC

Seattle, Washington, USA

Full-time

Application close date: Applications will be accepted on an ongoing basis until the requisition is closed. At Blue Origin, we envision millions of people living and working in space for the benefit of Earth. We're working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. Join our team of problem solvers as we add new chapters to the history of spaceflight! This role is part of Enterprise Technology (ET), where we're deve

Senior Site Reliability Engineer - DGX Cloud

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge across different systems, networking, coding, database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at N