Apply Now

Senior Software Engineer - AI Frameworks

Redmond, WA, US • Posted 30+ days ago • Updated 9 hours ago

Full Time

On-site

USD $119,800.00 - 234,700.00 per year

Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

Large Language Models (LLMs)
Innovation
Accountability
Onboarding
Scheduling
Training
Collaboration
Computer Science
C
C++
Python
PyTorch
Computer Hardware
Artificial Intelligence
Stacks Blockchain
NPU
GPU
Optimization
CUDA
Caching
Quality Assurance
SAP PP
Data Processing
Software Engineering
IC
Integrated Circuit
Internal Communications
SAP BASIS
Microsoft
Immigration
Military

Summary

Overview

The AI Frameworks team at Microsoft accelerates and optimizes large language model deployment on Microsoft's MAIA AI accelerators and GPUs. We build software across the stack, from PyTorch and inference systems such as vLLM and SGLang to performance-critical runtime and kernel components. Our team operates at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software, with a highly collaborative and inclusive culture.

We are seeking a self-motivated Senior Software Engineer - AI Frameworks who thrives on technical innovation, enjoys diving deep into technical details, and adapts quickly in a fast-moving environment. This is a unique opportunity to directly shape the software that powers Microsoft's most advanced AI infrastructure-from custom silicon to the models running on it.

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

Architect and implement efficient tensor computation primitives and software abstractions for custom AI accelerators.
Develop and extend PyTorch features for model onboarding, optimization, and execution on custom AI accelerators.
Contribute to and improve AI inference stacks such as vLLM and SGLang, including scheduling, KV cache management, and serving pipelines.
Design, develop, profile, and optimize high-performance kernels for NPUs (MAIA) and GPUs to accelerate LLM inference and training workloads.
Collaborate across disciplines to define requirements and
Deliver practical solutions to new technical challenges.

Qualifications

Required Qualifications:

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, or Python OR equivalent experience.

Preferred Qualifications:

Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, or Python OR equivalent experience.

Experience with PyTorch internals, custom operators, hardware backend, or torch.compile/Dynamo-based optimization flows.

Experience with AI inference stacks such as vLLM, SGLang, or similar large-scale model serving systems.

Experience with NPU or GPU kernel development and optimization (e.g., CUDA, Triton, or accelerator-specific toolchains).

Familiarity with common LLM concepts such as attention mechanisms, KV caching, quantization (PTQ/QAT), and distributed parallelism strategies (TP, PP, DP).

#AIInfra

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800.00 - $234,700.00 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $160,200.00 - $261,000.00 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
;br>
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10494596
Position Id: 41680cf696fe28099eb0229776b6ba85
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Redmond, Washington

•

Today

Overview The Artificial Intelligence (AI) Frameworks team at Microsoft develops AI software that enables running AI models everywhere, from world's fastest AI supercomputers, to servers, desktops, mobile phones, internet of things (IoT) devices and internet browsers. We collaborate with our hardware teams and partners, both internal and external, and operate at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. We are a team of highly capable and mo

Full-time

USD 119,800.00 - 234,700.00 per year

Principal Software Engineer - Performance Tooling

Redmond, Washington

•

Today

Full-time

USD 142,800.00 - 274,800.00 per year

Senior Software Engineer, CoreAI Workload Engines

Redmond, Washington

•

Today

Overview The CoreAI Workloads team builds the foundational inference engines and APIs that power largescale AI inference across Azure - from cutting-edge startups to Fortune 500 enterprises and Microsoft Copilots and agents. Our mission is to deliver secure, reliable, and highly efficient GPU inference that enable multitenant AI systems at global scale while maximizing utilization, performance, and developer productivity. We own inference serving and performance of OpenAI and other state of the

Full-time

USD 119,800.00 - 234,700.00 per year

AI Software Engineer

Seattle, Washington

•

Today

The Team You will join a dynamic AI Infrastructure team focused on enabling high-performance AI across Zoom's products and services. The team builds the core systems that support model training, deployment, and inference at scale, driving innovation in areas such as real-time communication, computer vision, and natural language understanding. What You Can Expect You'll design, implement, and own the inference systems that serve Zoom's AI models at production scale, across real-time communicat

Full-time

USD 151,800.00 - 332,200.00 per year

Search all similar jobs

Senior Software Engineer - AI Frameworks

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs