Technical Lead Manager, ML Infrastructure

  • Mountain View, CA
  • Posted 20 days ago | Updated 10 hours ago

Overview

On Site
USD 222,775.00 - 333,925.00 per year
Full Time

Skills

IT Management
Artificial Intelligence
Computer Hardware
Productivity
C
TLM
Optimization
Roadmaps
Apache Velocity
Continuous Improvement
Research
Machine Learning (ML)
Leadership
Management
Mentorship
TensorFlow
PyTorch
JAX
Deep Learning
Algorithms
Computer Vision
Natural Language Processing
Communication
Collaboration
Teamwork
CUDA
Training

Job Details

Who We Are

Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver , to support a wide range of applications, from robotaxis and commercial fleets to personally owned vehicles. With technology proven over years of self-driving deployments, Nuro gives the automakers and mobility platforms a clear path to AVs at commercial scale-empowering a safer, richer, and more connected future.

About the RoleThe Training ML Infrastructure team is growing and we are looking for a seasoned engineering leader to lead this team. Nuro is pursuing an ML-first software stack. Our team empowers machine learning development in Nuro by building platforms to improve model training velocity, providing productivity tools to analyze model performance, and optimizing model training to reduce training time. Our solutions include a distributed training platform, scheduler, model component libraries, compilers, e.t.c

About the WorkWe are looking for a TLM to drive the initiatives in distributed training and training optimization using key technologies to scale our deep learning models to achieve higher performance. Your responsibilities:
  • Define and execute the technical vision and roadmap for the Training ML Infra team
  • Lead, mentor, and grow a team of approximately 6-10 engineers, promoting a culture of technical excellence and collaboration.
  • Collaborate with ML teams to ensure smooth adoption and continued velocity.
  • Promote engineering best practices and cultivate a culture of continuous improvement in the team.

About You
  • 6+ years of work or research experience in ML Infra, distributed training, or distributed systems in ML domain
  • Proven leadership experience managing and growing high-performing engineering teams, with a demonstrated ability to mentor and inspire technical talent.
  • Knowledge in using at least one deep learning framework, e.g. Tensorflow, Pytorch, JAX. Be able to understand deep learning algorithms, e.g. computer vision, NLP, behavior planning.
  • Strong communication and teamwork skills. Be passionate about exploring and promoting cutting edge technology.

Bonus Points

  • Experience with CUDA, Pallas or Triton
  • Experience with distributed training speedup (e.g. FSDP, DeepSpeed).
  • Familiarity with TPUs

At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $222,775 and $333,925 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.

At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics. #LI-DNP
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.