PyTorch with Triton performance Engineer


VDart, Inc.
Dice Job Match Score™
👤 Reviewing your profile...
Job Details
Skills
- PyTorch
- Triton
Summary
PyTorch with Triton performance Engineer
Bellevue, WA (Onsite)
Contract
Job Summary
Design and implement high intensity stress workloads using PyTorch and Triton to identify performance bottlenecks and improve platform stability and maturity
Job Description
Design and implement high intensity stress workloads using PyTorch and Triton Exercise core MAIA execution paths including compute memory DMA and collectives Enable early detection of performance cliffs stability issues and system bottlenecks across simulator and real hardware Improve platform maturity reduce latestage escapes and increase confidence for broader internal and external adoption Develop PyTorch workloads stressing modellevel execution such as large GEMMs attention patterns MoElike behavior mixed precision and longrunning loops Author custom Triton kernels to stress hardware execution units memory hierarchies and synchronization paths Build parameterized stress harnesses scalable by problem size number of devices and runtime duration Integrate workloads with existing profiling monitoring and failure triage tooling Collaborate with platform firmware and SDK teams to target known risk areas and emerging issues Document usage patterns and provide reproducible scripts for lab and continuous integration CI usage
Roles and Responsibilities : Develop and maintain a library of reusable PyTorch stress workloads Create Tritonbased micro and macrokernels designed specifically for stress and saturation testing Build and support test harnesses and scripts for singledevice and multidevice execution Ensure workload designs align with platform risk areas and emerging hardwaresoftware issues Collaborate crossfunctionally with platform firmware and SDK teams to refine stress tests Provide comprehensive documentation describing workload intent configuration options and expected stress characteristics Support profiling monitoring and failure triage by integrating stress workloads with existing tools Deliver reproducible and scalable testing solutions for lab and CI environments
- Dice Id: 10330808
- Position Id: 97399-5195-609317
- Posted 8 hours ago
Company Info
VDart, headquartered in Atlanta, GA, is a global leader in digital talent solutions and IT staffing, delivering top technology professionals to businesses worldwide. With a strong presence across North America, Europe and Asia, we specialize in helping organizations navigate complex technology landscapes with the right expertise.
Through a strategic, client-focused approach, we have placed over 20,000 professionals across key industries and advanced technology solutions. Whether placing top talent in cutting-edge roles or providing strategic digital workforce solutions, our network of 4,000 specialists across 13 countries is committed to excellence, agility and impact.
Backed by 18 years of industry experience, we go beyond staffing to build long-term partnerships that accelerate digital transformation and drive sustained growth. Whether you need a technology partner to fuel innovation or specialized workforce solutions to maintain a competitive edge, VDart delivers the right people, skills and mindset to create a lasting impact in a digital-first world.
Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs