Apply Now

Performance Engineer

Remote • Posted 15 days ago • Updated 7 hours ago

Contract W2

Contract Independent

No Travel Required

Remote

$70 - $80/hr

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

NVIDIA
GPU
Triton
AI
ML
CUDA
Machine Learning (ML)
Performance Engineering

Summary

Senior Performance Engineer

Vision AI Platform · Public Sector

Location Global (remote) — US business hours overlap required

Reporting to Assurance Lead/ Assurance Director

Team Globally distributed engineering team

Industry Artificial Intelligence · Edge Computing · Public Sector

Employment Contract

About the Role

Our Vision AI platform gives US public sector clients — federal agencies, smart-city operators, defense contractors, and critical infrastructure teams — a real-time window into their physical world. Think live sensor dashboards, geospatial overlays, AI inference result streams, and operational command interfaces used by people who cannot afford a slow or confusing UI.

We are seeking a specialized AI Performance Engineer (Consultant) to drive GPU acceleration, CUDA optimization, and distributed AI workload performance for VisionAI.

This is a hands-on performance engineering role focused on optimizing deep learning inference, GPU/CPU utilization, distributed orchestration, and capacity planning across city-scale AI deployments.

The consultant will work closely with AI, DevOps, and Infrastructure teams to improve latency, throughput, and overall system efficiency for production AI workloads.

Key Responsibilities

· Profile and optimize large-scale AI training and inference workloads (transformers, multimodal, diffusion, recommender systems) across multi-node, multi-GPU clusters.

· Build tools, frameworks, to detect and identify bottlenecks in compute, memory, interconnects, and communication libraries and deliver optimizations to maximize scaling efficiency.

· Develop, maintain and recommend benchmarks for AI training and inference workloads.

· Partner with framework teams (PyTorch, TensorFlow) to upstream performance improvements and enable better scaling APIs.

· Collaborate across the engineering organizations to deliver efficiency in our usage of hardware, software, and infrastructure

· Proactively monitor fleet wide utilization patterns, analyze existing inefficiency patterns, or discover new patterns, and deliver scalable solutions to solve them

Required Qualifications

· 5+ years in AI/ML performance engineering, HPC, or large-scale inference systems

· BS or similar background in Computer Science or related area (or equivalent experience)

· Strong understanding and hands-on modern ML techniques and tools

· Strong hands-on CUDA programming and optimization experience

· Deep understanding of GPU architecture and memory hierarchy

· Experience optimizing PyTorch and/or TensorFlow inference

· Hands-on experience with NVIDIA Triton, Apache Ray, and Kubernetes GPU scheduling

· Experience with RAPIDS and GPU-accelerated data pipelines

· Experience in benchmarking methodologies, performance analysis/profiling (e.g. Nsight), performance monitoring tools

· Strong track record of optimizing large-scale AI systems

Nice to Have

• Neural network architecture optimization experience

• Deep TensorRT optimization expertise

• Video analytics or real-time inference systems experience

• Experience operating large-scale GPU clusters Experience with WebAssembly (WASM) for performance-critical frontend computation.

• Advanced Linux OS, container (e.g. Docker) and GitHub skills

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91097474
Position Id: 8940854
Posted 15 days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Remote

•

Today

We are hiring a Senior AI Performance Engineer to work on large-scale GPU-accelerated AI systems powering real-time Vision AI platforms. This role is focused on hands-on performance optimization across distributed multi-GPU environments improving latency, throughput, and GPU utilization for production AI workloads. This is NOT a generic ML / DevOps role.We are looking for candidates with deep GPU + NVIDIA ecosystem experience. Key ResponsibilitiesAnalyze and optimize AI/ML workloads across mu

Easy Apply

Contract

95 - 100

Sr AI Performance Engineer

Remote

•

27d ago

JOB DESCRIPTION Senior Performance Engineer Vision AI Platform Public Sector Location Global (remote) US business hours overlap required Reporting to Assurance Lead/ Assurance Director Team Globally distributed engineering team Industry Artificial Intelligence Edge Computing Public Sector Employment Contract About the RoleOur Vision AI platform gives US public sector clients federal agencies, smart-city operators, defense contractors, and critical infrastructure teams a real-time window into the

Easy Apply

Full-time

Depends on Experience

AI Architect & Machine Learning Developer for Cloud Native AI Solutions

Remote

•

17d ago

Job Title: AI Architect & Machine Learning Developer for Cloud Native AI Solutions Location: Multiple Locations (Remote or Onsite Options Available) Job Type: Full-Time or Contract Salary: From $180K DOE + Great Benefits Job ID: 7337 Position OverviewSherlockTalent is working with an esteemed client on the forefront of AI innovations to fill this AI Architect & Machine Learning Developer role. The selected individual will design and deploy advanced AI solutions from robust data pipelines to prod

Contract

$180,000

AI Engineer

Remote

•

11d ago

We are seeking an AI Developer who is a fast learner and rapid problem solver, capable of quickly understanding new domains, technologies, and business problems, and translating them into high-impact proof-of-concept (PoC) solutions. This role requires the ability to move quickly from idea to working prototypes, validate value early, and then guide successful PoCs into scalable, production-ready AI solutions. As a member of the Investments Technology group, your work directly supports an $80B po

Easy Apply

Third Party, Contract

Depends on Experience

Search all similar jobs

Performance Engineer

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs