GPU Design Engineer Memory Hierarchy Jobs in San Francisco, CA

Refine Results
1 - 20 of 25 Jobs

Researcher in Systems

OpenAI

San Francisco, California, USA

Full-time

About the Team The Platform Systems team operates at the intersection of cutting-edge AI and distributed systems. We do the engineering and research required to train our flagship models on our largest custom built supercomputers. We build our own model training software, and focus on the lower layers of the stack including collective communication, scheduling, compute efficiency, parallelism strategies, fault tolerance, and observability. The models we train are key ingredients to the AI res

Software Engineer in Systems

OpenAI

San Francisco, California, USA

Full-time

About the Team The Platform Systems team operates at the intersection of cutting-edge AI and distributed systems. We do the engineering and research required to train our flagship models on our largest custom built supercomputers. We build our own model training software, and focus on the lower layers of the stack including collective communication, scheduling, compute efficiency, parallelism strategies, fault tolerance, and observability. The models we train are key ingredients to the AI res

Software Engineer, Networking

OpenAI

San Francisco, California, USA

Full-time

About the Team The Platform Networking team is responsible for the collective communication stack used in our largest training jobs. Using a combination of C++ and CUDA we work on novel collective communication techniques that enable efficient training of our flagship models on our largest custom built supercomputers. The models we train are key ingredients to the AI research progress at OpenAI and the field as a whole, and we continually incorporate learnings from our entire research org into

Data Center Engineer

OpenAI

San Francisco, California, USA

Full-time

About the Team The Compute team works on the design of our AI supercomputers, doing everything from workload modeling to accelerator co-design. We're leaning into our partnerships to make data center co-design an integral part of this process, and are looking for a founding engineer to lead this work. This team will be responsible for working with partners to optimize existing and future data centers for our workloads, and identifying promising new power distribution and cooling technologies.

Software Engineer, Machine Learning Compute

OpenAI

San Francisco, California, USA

Full-time

About the Team The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. You'll join the team responsible for running the infrastructure that supports the models backing ChatGPT and the API. The systems we support include inference kubernetes clusters, GPU health, Infiniband performance, node lifecycle, and more. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this power

GPU Kernels Engineer

OpenAI

San Francisco, California, USA

Full-time

About the Team: Our mission at OpenAI is to discover and enact the path to safe, beneficial AGI. To do this, we believe that many technical breakthroughs are needed in generative modeling, reinforcement learning, large scale optimization, active learning, among other topics. The Research Platform team builds robust and scalable software to support our research efforts. It also offers core development services for mission critical goals and applications. In the Kernel team, we write Kernels for

Software Engineer, Model Inference

OpenAI

San Francisco, California, USA

Full-time

About the Team Our team brings OpenAI's most capable technology to the world through our products. Most recently, we released ChatGPT, GPT-4, the Whisper API, and DALL-E. We empower consumers and developers alike to use and access our start-of-the-art AI models, allowing them to do things that they've never been able to before. Across all product lines, we ensure that these powerful tools are used responsibly. This is a key part of OpenAI's path towards safely deploying broadly beneficial Arti

Software Engineer, Engineering Acceleration

OpenAI

San Francisco, California, USA

Full-time

About the Team The Applied AI team safely brings OpenAI's technology to the world. We released ChatGPT, Plugins, DALL E, and the APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also operate inference infrastructure at scale. There's a lot more on the immediate horizon. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth. We serve end-users directl

Site Reliability Engineer, Research Platform, SRE

OpenAI

San Francisco, California, USA

Full-time

About the team: Reliable services are what enables Open AI to train the best AI models in the world and to bring the promise of safe, effective AI to the world. The SRE team in research is responsible for defining, measuring, and improving the reliability of the research platform. The SRE team works closely with the supercomputing and hardware health teams to improve the functioning of the existing research platform and build the future platform. The research platform is the platform used to co

Staff Software Engineer, Ads Serving Platform

Pinterest, Inc.

San Francisco, California, USA

Full-time

About Pinterest: Millions of people across the world come to Pinterest to find new ideas every day. It's where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their inspiration and create a life they love. In your role, you'll be challenged to take on work that upholds this mission and pushes Pinterest forward. You'll grow as a person and leader in your field, all the while helping Pinners make their lives better in the

HW/SW Co-design Engineer

OpenAI

San Francisco, California, USA

Full-time

About the Team Our mission at OpenAI is to discover and enact the path to safe, beneficial AGI. To do this, we believe that many technical breakthroughs are needed in generative modeling, reinforcement learning, large scale optimization, active learning, among other topics. The Research Platform team builds robust and scalable software to support our research efforts. It also offers core development services for mission critical goals and applications. In the Kernels team, we write Kernels for

Graphics Software Engineer - IV

SGS Consulting

Remote

Contract

Minimum qualification Experience with Vulkan and Ray Tracing and is able to explain important concepts and design decision relevant for a real-time Path Tracer and can demonstrate good understanding of how SW concepts of GPU shaders programming maps to a HW GPU design.Must know how to cast a shadow ray leveraging APIEducation & Experience in Computer Science or related field with focus on graphicsNo degree. 7+ years of work experience.Bachelor's degree with 5+ years of postdegree experience.Mast

Staff Software Systems Engineer

Rivian

Palo Alto, California, USA

Full-time

About Rivian Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what's possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate comfortably in areas that are unknown. Our backgrounds are diverse, but our team shares a love of the outdoors and a desire to protect it

Principal GPU Virtualization Software Engineer

Leom Tech

Remote

Full-time

Skills - One of our Client is looking for a Principal GPU Virtualization or Simulation Software Engineer. Principal GPU Virtualization Software Engineer Roles and Responsibilities: Architect and develop technical solutions that help us deliver high-performance, high-throughput, and high-reliability of GPU virtualization for cross platform vehicle initiatives.Develop GPU virtualization software technology for graphics and display in terms of functionality, performance, efficiency and reliability.

Lead Cloud and Security Engineer

iTech US, Inc.

Remote

Full-time

Position: Lead Cloud and Security Engineer Location: Princeton, NJ 100% Remote. Cloud Engineering Job Responsibilities: Design, deploy, and maintain cloud infrastructure on Azure, ensuring optimal performance and cost-effectiveness.Develop and implement solutions using Tensorflow/Bicep for machine learning tasks.Optimize systems for performance, scalability, and reliability.Deploy and manage PyTorch/TensorFlow workloads on CPU and GPU targets.Implement observability across platforms for applica

On-device ML Engineer - US Remote

Hugging Face

Remote or New York, New York, USA

Full-time

Description Here at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better. We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Grammarly and NASA.

HPC Engineer, Machine Learning Infrastructure - US Remote

Hugging Face

Remote

Full-time

Description Here at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better. We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Grammarly and NASA.

Machine Learning Engineer (Security)

Tential

Remote or Vienna, Virginia, USA

Contract

Our client, a leading Banking & Financial Services company, is seeking a talented Machine Learning Engineer to join their Security team. Responsibility: Build and enhance machine learning models through all phases of development including design, training, validation, and implementation etc. Unlock insights by analyzing large scale of complex numerical and textual data and identifying trends. Partner with a cross-functional team of data engineers, data scientists, and data visualization to deli

Nvidia DGX Infrastructure Architect

World Wide Technology

Remote

Full-time

Nvidia DGX Infrastructure Architect Why WWT? At World Wide Technology, we work together to make a new world happen. Our important work benefits our clients and partners as much as it does our people and communities across the globe. WWT is dedicated to achieving its mission of creating a profitable growth company that is also a Great Place to Work for All. We achieve this through our world-class culture, generous benefits and by delivering cutting-edge technology solutions for our clients. Found

Solution Practice Lead - Advanced Computing (Financial Services vertical)

CDW

Remote

Full-time

This position is part of CDW's Hybrid Infrastructure Practice, a single-source provider to our clients for all things Advanced Computing, Networking, Wireless, & Data Center. The Advanced Computing Practice Lead is responsible for practice development, assisting with planning go-to-market strategies, acting as a liaison between CDW and key advanced computing partners, and mentoring coworkers on artificial intelligence (AI), high performance compute (HPC) and machine learning (ML) technologies. T