Apply Now

Senior AI/ML performance engineer

• Posted 2 days ago • Updated 9 minutes ago

Contract Corp To Corp

Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

AI/ML
Tensorflow
Nvidia
CUDA

Summary

Key Responsibilities

Profile and optimize large-scale AI training and inference workloads (transformers, multimodal, diffusion, recommender systems) across multi-node, multi-GPU clusters.
Build tools, frameworks, to detect and identify bottlenecks in compute, memory, interconnects, and communication libraries and deliver optimizations to maximize scaling efficiency.
Develop, maintain and recommend benchmarks for AI training and inference workloads.
Partner with framework teams (PyTorch, TensorFlow) to upstream performance improvements and enable better scaling APIs.
Collaborate across the engineering organizations to deliver efficiency in our usage of hardware, software, and infrastructure
Proactively monitor fleet wide utilization patterns, analyze existing inefficiency patterns, or discover new patterns, and deliver scalable solutions to solve them

Required Qualifications

5+ years in AI/ML performance engineering, HPC, or large-scale inference systems
BS or similar background in Computer Science or related area (or equivalent experience)
Strong understanding and hands-on modern ML techniques and tools
Strong hands-on CUDA programming and optimization experience
Deep understanding of GPU architecture and memory hierarchy
Experience optimizing PyTorch and/or TensorFlow inference
Hands-on experience with NVIDIA Triton, Apache Ray, and Kubernetes GPU scheduling
Experience with RAPIDS and GPU-accelerated data pipelines
Experience in benchmarking methodologies, performance analysis/profiling (e.g. Nsight), performance monitoring tools
Strong track record of optimizing large-scale AI systems

Nice to Have

Neural network architecture optimization experience
Deep TensorRT optimization expertise
Video analytics or real-time inference systems experience
Experience operating large-scale GPU clusters Experience with WebAssembly (WASM) for performance-critical frontend computation.
Advanced Linux OS, container (e.g. Docker) and GitHub skills

Thanks and Regards,

Shiney K

Sr. US IT Recruiter

Prointegrate Inc.

Phone:

Email ID:

New York | London | India

This message (including any attachments) contains confidential information intended for a specific individual and purpose, and is protected by law. If you are not the intended recipient or have received this transmission in error, please contact the sender by reply by email and destroy all copies of the original message. Any unauthorized review, use, copy, dissemination, or disclosure of this email is strictly prohibited.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91097474
Position Id: 2025-134/13954
Posted 2 days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Sr AI Performance Engineer

Remote

•

23d ago

JOB DESCRIPTION Senior Performance Engineer Vision AI Platform Public Sector Location Global (remote) US business hours overlap required Reporting to Assurance Lead/ Assurance Director Team Globally distributed engineering team Industry Artificial Intelligence Edge Computing Public Sector Employment Contract About the RoleOur Vision AI platform gives US public sector clients federal agencies, smart-city operators, defense contractors, and critical infrastructure teams a real-time window into the

Easy Apply

Full-time

Depends on Experience

AI and ML integration / Engineer

Spring, Texas

•

27d ago

Job Summary: We are seeking a highly skilled and motivated Senior ML Engineer with a strong focus on AI and ML integration. The ideal candidate will possess extensive knowledge in AI/ML technologies, particularly in the areas of large language models (LLMs) and their applications. This role requires hands-on experience in developing and fine-tuning AI models, as well as a solid understanding of data analysis and engineering practices. Responsibilities: Design, develop, and implement AI/ML sol

Easy Apply

Contract, Third Party

Depends on Experience

Senior AI Engineer

Hybrid in Dallas, Texas

•

Today

We are seeking a highly skilled and experienced Senior AI Engineer to lead the design, development, and deployment of advanced AI systems. You will work on cutting-edge machine learning models, natural language processing, computer vision, and AI infrastructure to solve real-world problems and drive innovation across our products and services. Key Responsibilities: Design, develop, and deploy scalable AI/ML models for production environments.Lead end-to-end AI project lifecycles from data collec

Easy Apply

Third Party, Contract

Depends on Experience

Sr AI Engineer - AI Platform

Dallas, Texas

•

11d ago

Role: Senior AI Engineer | Data & AI Technology Location: Toronto, ON or Dallas, TX (Onsite 34 days/week) Employment Type:Direct Hire Role OverviewWe are seeking a Senior AI Engineer to design, build, and operationalize enterprise-grade AI solutions in a highly regulated environment. This role provides deep technical leadership across AI engineering, MLOps/LLMOps, and governance-by-design, ensuring AI solutions are secure, scalable, auditable, and production-ready. You will own complex AI system

Easy Apply

Full-time, Contract, Third Party

Depends on Experience

Search all similar jobs