hpc engineer Jobs in boston, ma

Refine Results
1 - 20 of 88 Jobs

HPC Software Engineer

Canonical - Jobs

Boston, Massachusetts, USA

Full-time

Job DescriptionJob DescriptionHPC is an important and technically challenging compute domain, with specialised tooling and a very high expectation of precision, efficiency and automation. This role is for a software engineer to join our HPC team to deliver an outstanding HPC experience - from bare metal to public cloud - as part of the broader Ubuntu platform. We are looking for a range of skills and experience, and will work on everything from the kernel to Debian packaging, but the heart of ou

HPC Engineer, AI Infrastructure

Tesla Motors

Remote or Palo Alto, California, USA

Full-time

Tesla's Supercomputing/AI infrastructure team works directly with the high-performance computing and machine learning infrastructure on which our ML algorithms run; this includes virtual simulations, Autopilot hardware, silicon design, and Dojo. With the rapidly-growing need for more data and optimized compute resources, cluster builds are getting larger and increasingly complex. Continued development/automation of deployment, monitoring, self-healing and alerting processes is imperative to the

HPC Engineer, User Assistance, Entry Level (Hybrid Eligible)

Oak Ridge National Laboratory

Remote or Oak Ridge, Tennessee, USA

Full-time

Requisition Id12019 The Organization: The National Center for Computational Sciences (NCCS) provides groundbreaking computational and data science infrastructure for technical and scientific professionals to accelerate scientific discovery and engineering advances across a broad range of disiplines. As an important part of the broader High-Performance Computing (HPC) infrastructure, the division also hosts the Oak Ridge Leadership Computing Facility (OLCF), a Department of Energy Office of Scien

Senior HPC Infrastructure Engineer

St. Jude Childrens Research Hospital

Remote or Memphis, Tennessee, USA

Full-time

Join a cutting-edge team dedicated to pushing the boundaries of high-performance computing (HPC) and artificial intelligence (AI) infrastructure! As a Senior HPC Infrastructure Engineer, you'll play a pivotal role in designing, implementing, and optimizing our state-of-the-art HPC clusters and servers. Your expertise will ensure that our research computing environment excels in scalability, redundancy, and performance. Key Responsibilities: Lead the architecture, design, and implementation of a

Senior HPC Performance Engineer - AI for Science at Scale

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has become the platform upon which every new AI-powered application is built. We are seeking a Sr. HPC Performance engineer to join our team of scientists and engineers passionate about building the next generation of scientific machine learning (ML) frameworks. Starting with digital biology, through high performance computing (HPC) and powerful ML methods, together, we will advance NVIDIA's capacity to accelerate AI for Science and industries that depend on it. What you'll be doing: De

Senior HPC Systems Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Linux HPC Systems Engineer (Hybrid Eligible)

Oak Ridge National Laboratory

Remote or Oak Ridge, Tennessee, USA

Full-time

Requisition Id13847 --Overview: We are hiring a Linux HPC Systems Engineer to design, operate and maintain clusters, servers, workstations that support services where science happens at ORNL! This position resides in the Emerging Technologies & Computing Group in the Information Technology Services Directorate at Oak Ridge National Laboratory (ORNL). The Emerging Technology Computational Group facilitates ORNL goals through HPC systems engineering, integration, and support for the research commu

HPC Systems Engineer

Vbeyond Corporation

New York, USA

Contract

Hi Job Seekers, Hope you are doing great!! Currently, we have a job opening of HPC Systems Engineer- Remote for a contract role with our client. If you are interested then please reply me with your updated resume JD- Hands on experience setting up HPC compute cluster. install Nvidia drivers Install manage configure GPU software stack like Pytorch, tensorflow, cuda Python Setup PBS job scheduler and supporting PBS servers Experience with Redhat and Rocky Linux; bash scripting Nice to have Docke

HPC Software Engineer (Hybrid Eligible)

Oak Ridge National Laboratory

Remote or Oak Ridge, Tennessee, USA

Full-time

Requisition Id 13595 Overview: We're hiring an HPC Software Engineer to support the integration of computing hardware and software tools for accomplishing research tasks across a variety of scientific research areas! This position is in the Emerging Technologies & Computing (ETAC) Group in the Research Computing Support Division (RCSD) of the Information Technology Services Directorate (ITSD) at Oak Ridge National Laboratory (ORNL). Our HPC engineering team facilitates the mission of ORNL throug

HPC SYSTEMS ENGINEER

University of Washington

Remote or Seattle, Washington, USA

Full-time

As a UW employee, you have a unique opportunity to change lives on our campuses, in our state, and around the world. UW employees offer their boundless energy, creative problem-solving skills, and dedication to build stronger minds and a healthier world. By being deeply invested in our work, showing compassion in our interactions, and embodying the spirit of a team player, each member contributes to a thriving community. UW is committed to attracting and retaining a diverse staff; your experienc

Senior Math Libraries Engineer - AI and HPC

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA Math Libraries team is looking for an expert engineer to join our development efforts in the area of kernel generation for AI and HPC, specifically targeting matrix operations, JITing and fusions. Around the world, leading commercial and academic organizations are revolutionizing AI, scientific and engineering simulations, and data analytics, using data centers powered by GPUs. Applications of these technologies are in healthcare, NLP, VR, deep learning, autonomous vehicles and countless

Technical Support Engineer, Linux and HPC Admin - DGX Cloud

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation fueled by great technology-and dynamic people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. NVIDIANS

Liquid Cooling Design Engineer

Dell

Hopkinton, Massachusetts, USA

Full-time

Liquid Cooling Design Engineer Mechanical Engineering leads and delivers the development of innovative and compliant mechanical design solutions, as well as cross-functional interfaces for AI, Cloud, High Performance Computing (HPC), Edge computing devices and enterprise networking, server and storage products. Our team conducts the analysis, feasibility studies and testing of mechanical products, instruments, subassemblies and packaging for new and existing products - and then oversees the int

AWS FSX LUSTRE - SDE II, FSX

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

We are looking for a Software Development Engineer to join the Amazon FSx for Lustre team. Launched in a Re:Invent keynote in November 2018, Amazon FSx for Lustre is a fast-growing AWS service that makes it easy for customers to launch and run high-performance file systems in the cloud. With Amazon FSx, in just minutes you can provision a Lustre parallel file system that can process massive data sets at up to hundreds of gigabytes per second of throughput, millions of IOPS, and sub-millisecond l

Software Development Manager, High Performance Computing

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

The AWS High Performance Computing (HPC) group is looking for a Software Development Manager (SDM) based out of Boston to lead a team focused on building HPC products and services. The HPC team is building a core set of technologies that allow our customers to plan, schedule, and execute HPC workloads across the full range of AWS compute services and capabilities. An HPC infrastructure is complex in nature including the provisioning of multiple resources as computing, networking, storage and the

Software Development Manager, High Performance Computing

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

The AWS High Performance Computing (HPC) group is looking for a Software Development Manager (SDM) based out of Boston to lead a team focused on building HPC products and services. The HPC team is building a core set of technologies that allow our customers to plan, schedule, and execute HPC workloads across the full range of AWS compute services and capabilities. An HPC infrastructure is complex in nature including the provisioning of multiple resources as computing, networking, storage and the

Software Development Manager, High Performance Computing

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

The AWS High Performance Computing (HPC) group is looking for a Software Development Manager (SDM) based out of Boston to lead a team focused on building HPC products and services. The HPC team is building a core set of technologies that allow our customers to plan, schedule, and execute HPC workloads across the full range of AWS compute services and capabilities. An HPC infrastructure is complex in nature including the provisioning of multiple resources as computing, networking, storage and the

AWS FSx Lustre - SDE II, FSx

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

We are looking for a Software Development Engineer to join the Amazon FSx for Lustre team. Launched in a Re:Invent keynote in November 2018, Amazon FSx for Lustre is a fast-growing AWS service that makes it easy for customers to launch and run high-performance file systems in the cloud. With Amazon FSx, in just minutes you can provision a Lustre parallel file system that can process massive data sets at up to hundreds of gigabytes per second of throughput, millions of IOPS, and sub-millisecond l

AWS FSx Lustre - SDE II, FSx

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

We are looking for a Software Development Engineer to join the Amazon FSx for Lustre team. Launched in a Re:Invent keynote in November 2018, Amazon FSx for Lustre is a fast-growing AWS service that makes it easy for customers to launch and run high-performance file systems in the cloud. With Amazon FSx, in just minutes you can provision a Lustre parallel file system that can process massive data sets at up to hundreds of gigabytes per second of throughput, millions of IOPS, and sub-millisecond l

AWS FSx Lustre - SDE II, FSx

Amazon Development Center U.S., Inc.

Boston, Massachusetts, USA

Full-time

We are looking for a Software Development Engineer to join the Amazon FSx for Lustre team. Launched in a Re:Invent keynote in November 2018, Amazon FSx for Lustre is a fast-growing AWS service that makes it easy for customers to launch and run high-performance file systems in the cloud. With Amazon FSx, in just minutes you can provision a Lustre parallel file system that can process massive data sets at up to hundreds of gigabytes per second of throughput, millions of IOPS, and sub-millisecond l