NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 30 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
NVIDIA is seeking an AI Architect to join its AI Tools Team! This role will focus on the extensive scale-up of key AI solutions for NVIDIA's internal organization, working closely with various teams such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence, and Driverless Cars to meet their infrastructure needs. The cloud services support nearly half a million automated jobs daily on five thousand servers, enhancing the productivity of thousands of NVIDIA software developers worldwide. The cloud hosts a diverse mix of machines and devices with various operating systems (Windows/Linux/Android) and hardware platforms, including NVIDIA GPUs and Tegra processors.
What you'll be doing:- Architect, build and enable internal AI platforms and solutions to be used by thousands of NVIDIANs worldwide.
- Spot opportunities where AI is the best tool: uncover gaps, and recommend AI-first approaches over conventional solutions-grounded in hands-on evaluation of modern AI-native tools.
- Set the north star with cross-functional teams: align on end-to-end AI system outcomes and translate them into clear, measurable objectives.
- Introduce technologies enabling massively parallel systems to improve turnaround time by an order of magnitude.
- Lead through influence: Drive, motivate, convince, and mentor sub-system owners to achieve improvements with agility, speed, and high engineering standards.
- Optimize for performance and cost: identify bottlenecks across training, evaluation, and testing workflows and improve throughput, latency, and efficiency
- Collaborate with AI product vendors to gain deep insights of the AI industry, and share them with leaders and developers internally.
What we need to see:- MS/PhD in AI/CS (or equivalent experience) with 12+ years building systems software, including 2+ years building/exploring AI solutions.
- Hands-on experience with LLMs, RAG, fine-tuning, and agentic/workflow orchestration.
- Strong "AI-first" approach and proficiency with modern AI-native developer ecosystems and tooling.
- Validated experience deploying to hybrid, multi-cloud environments (and ideally edge).
- Track record architecting and shipping large-scale distributed systems in production.
- Proven ability to find system bottlenecks and deliver measurable performance/cost improvements.
- Strong programming skills in Java and Python; validated understanding of distributed systems concepts and REST APIs.
- Expertise with containerization and virtualization (Docker, VMs); Kubernetes experience is a plus.
- Solid understanding of cloud/platform and data infrastructure tools such as OpenStack, Kubernetes, Chef/Puppet, Hadoop/Ceph/SwiftStack, LXC, Git/Perforce, JFrog, Kafka.
- Excellent multi-functional influence skills-able to drive alignment across org boundaries in a global, multi-time-zone environment.
Ways to stand out from the crowd:- Depth in AI, Machine Learning and Deep Learning algorithms and techniques.
- Strong collaborative and interpersonal skills, with a proven record of guiding and influencing others in dynamic environments.
- Industry thought leader in AI, influenced AI ecosystem to deliver forward looking solutions
- Background in designing high-performance, scalable software systems with a strong focus on hardware cost optimization.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 431,250 USD for Level 6.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until February 1, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.