Machine Learning Performance Engineer

Santa Rosa, CA, US • Posted 4 days ago • Updated 5 hours ago
Full Time
On-site
USD $160,160.00 - 266,930.00 per year
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • Innovation
  • Electronic Engineering
  • Prototyping
  • Manufacturing
  • Energy
  • Aerospace
  • Semiconductors
  • Data Science
  • Artificial Intelligence
  • Recruiting
  • Stacks Blockchain
  • Mathematics
  • Writing
  • Systems Engineering
  • Modeling
  • Shipping
  • Optimization
  • Fusion
  • Product Engineering
  • Regression Analysis
  • Performance Engineering
  • HPC
  • Machine Learning (ML)
  • Python
  • PyTorch
  • C++
  • Training
  • CUDA
  • GPU
  • Reasoning
  • Software Engineering
  • Testing
  • Code Review
  • Docker
  • Privacy
  • Health Care
  • Insurance

Summary

Overview

Keysight is at the forefront of technology innovation, delivering breakthroughs and trusted insights in electronic design, simulation, prototyping, test, manufacturing, and optimization. Our ~15,000 employees create world-class solutions in communications, 5G, automotive, energy, quantum, aerospace, defense, and semiconductor markets for customers in over 100 countries. Learn more about what we do.

Our award-winning culture embraces a bold vision of where technology can take us and a passion for tackling challenging problems with industry-first solutions. We believe that when people feel a sense of belonging, they can be more creative, innovative, and thrive at all points in their careers.

The AI Models and Data Science team at Keysight AI Labs is hiring a ML Performance Engineer to make our training and inference stacks as fast as the math allows. You'll own end-to-end performance: profiling training workloads on multi-GPU clusters, writing custom CUDA kernels and LibTorch C++ extensions for hot paths, and optimizing inference for embedding in production software where every millisecond matters.

This role sits at the intersection of ML, systems engineering, and HPC. You'll work directly with MLEs and data scientists driving the modeling work, and with the engineering teams shipping these models into Keysight products.

Responsibilities

  • Profile and optimize training workloads - multi-GPU scaling efficiency, throughput, memory footprint, mixed precision, gradient checkpointing tradeoffs
  • Profile and optimize inference for low-latency, high-throughput deployment - quantization, graph optimization, kernel fusion, runtime selection
  • Write custom CUDA kernels and LibTorch (PyTorch C++) extensions to accelerate hot paths in both training and inference
  • Build and maintain serving infrastructure using ONNX Runtime, TensorRT, and similar - including C++ integration paths for embedding models inside production software
  • Partner with MLEs and data scientists on perf-aware architecture choices; partner with product engineering on deployment, versioning, and monitoring
  • Establish performance SLAs and regression tests so models stay fast as they evolve

Qualifications

  • 4+ years in ML engineering, performance engineering, or HPC, with substantial production ML experience
  • Strong Python and C++ - including LibTorch / PyTorch C++ extensions in production
  • Hands-on experience optimizing both training and inference workloads (not just one)
  • CUDA experience required - comfortable profiling GPU code with Nsight and reasoning about occupancy, memory hierarchy, and kernel-level tradeoffs
  • Production deployment experience with ONNX Runtime, TensorRT, or equivalent inference runtimes
  • Solid software engineering fundamentals: testing, versioning, code review, monitoring
  • Experience with Docker and container-based deployment

Careers Privacy Statement

Keysight Technologies Inc. is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability or any other protected categories under all applicable laws.

The level of role and salary will be based on applicable experience, education and skills; Most offers will be between the minimum and the midpoint of the Salary Range listed below.

California Pay Range: MIN $160,160- MAX $266,930

Note: For other locations, pay ranges will vary by region.

US Employees may be eligible for the following benefits:

- Medical, dental and vision

- Health Savings Account

- Health Care and Dependent Care Flexible Spending Accounts

- Life, Accident, Disability insurance

- Business Travel Accident and Business Travel Health

- 401(k) Plan

- Flexible Time Off, Paid Holidays

- Paid Family Leave

- Discounts, Perks

- Tuition Reimbursement

- Adoption Assistance

- ESPP (Employee Stock Purchase Plan)
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX19ba19
  • Position Id: 243a63723814debedfc0975c71e3bdc3
  • Posted 4 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Santa Rosa, California

Today

Full-time

USD 2,000.00 per week

Santa Rosa, California

Today

Full-time

USD 2,000.00 per week

Remote or San Francisco, California

Today

Full-time

USD 200,000.00 - 345,000.00 per year

San Francisco, California

Today

Full-time

USD 118,000.00 - 169,000.00 per year

Search all similar jobs