At NVIDIA, we build groundbreaking products for the following sectors: Automotive, VR, Gaming, Deep Learning, and High Performance Computing. See your efforts in action as developers use your tools to debug, profile and analyze the performance of their systems/applications using the low-level libraries that you helped to craft as a member of the GPU Foundations Developer Tools team! Innovate as you develop the performance analysis capabilities of NVIDIA hardware along with the Nsight tools and the foundation library to support next generation accelerated computing at datacenter scale.
As a system software engineer in the Developer Tools group, you will be developing software that empowers GPU application developers to build outstanding compute applications deployed on the world's largest distributed environments. We are seeking a talented Software Engineer to join our team and contribute to the performance triage development and co-design of foundational software libraries for developer tools in collaboration with our Hardware Architecture team. Join our team and gain exciting opportunities to work hands-on at every layer of NVIDIA's outstanding technology.
What you'll be doing:- Design, develop, and maintain GPU performance foundation libraries for Nsight tools with focus on high fidelity hardware events and counters.
- Develop and implement GPU assembly tests.
- Build and maintain a test validation framework written primarily in CUDA and GPU Assembly.
- Utilize emulators to debug and verify instruction events.
- Document tools use cases and data processing workflows to facilitate architectural explorations.
What we need to see:- B.S. EE/CS (or equivalent experience) and 5+ years of experience or MS with 2+ years' experience, or Ph.D.
- Strong programming ability in C, C++, and scripting languages such as Python.
- Good understanding or prior experience with low level assembly code.
- Solid understanding of hardware pipeline and execution unit instruction pipeline concepts, with a willingness to work at a detailed implementation level.
- Knowledge of hardware-software co-design principles and practices.
- Experience with performance analysis and optimization of software on hardware accelerators.
- Experience with developing on simulators and emulators.
- Excellent problem-solving skills and the ability to work collaboratively in a team environment.
- Strong communication skills, both written and verbal.
Ways to stand out from the crowd:- Shown knowledge of compute (CUDA/OpenCL), modern graphics (DirectX12, OpenGL, Vulkan, Metal), or DL frameworks (PyTorch/JAX).
- Prior experience authoring developer tools, particularly for GPUs, games, pro visualization, or compute workloads.
- Knowledge of performance analysis, particularly of GPU applications.
- Experience in driver development.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until January 26, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.