Senior AI Hardware Quality Engineer

Overview

On Site
USD 119,800.00 - 234,700.00 per year
Full Time

Skills

Microsoft Office
Regulatory Compliance
FOCUS
Energy
Sustainability
IaaS
Artificial Intelligence
Quality Management
Microsoft Azure
Continuous Improvement
Data Analysis
Clarity
Performance Metrics
Change Management
Functional Requirements
Computer Hardware
Issue Resolution
Servers
Screening
PASS
Cloud Computing
Systems Engineering
Manufacturing
Repair
Patents
Data Centers
CPU
Failure Analysis
Network
Management
GPU
Debugging
Leadership
Collaboration
Root Cause Analysis
Corrective And Preventive Action
Communication
Project Management
Electrical Engineering
Integrated Circuit
Internal Communications
IC
SAP BASIS
Legal
Recruiting
Microsoft

Job Details

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission.

As Microsoft's cloud business continues to grow the ability to deploy new offerings and hardware infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Hardware, Infrastructure Management, and Fundamentals Engineering (HIFE) team is instrumental in defining and delivering operational measures of success for hardware manufacturing, improving the planning process, quality, delivery, scale and sustainability related to Microsoft cloud hardware. We are looking for seasoned engineers with a dedicated passion for customer focused solutions, insight and industry knowledge to envision and implement future technical solutions that will manage and optimize the Cloud infrastructure.

We are looking for a Senior AI Hardware Quality Engineer to join the team.

Responsibilities:

  • Develop and implement a robust supplier quality management strategy to ensure the data center hardware is manufactured at the highest level of quality standards.
  • Lead quality issues and improvement task force to contain, mitigate, and resolve the top-quality issues at data centers.
  • Conduct debug and failure analysis for GPU subsystems in the Azure fleet and drive resolution with partners and suppliers.
  • Drive the continuous improvement process based on Root Cause Analysis (RCA) and identified opportunities.
  • Responsible for quality readouts based on your telemetry data analysis, to bring clarity on status, actions across the organization and next steps for issue resolution.
  • Establish Critical-to-Quality performance metrics to measure and improve product quality.
  • Act as the voice of quality in the hardware change management process, ensuring quality requirements are considered and met and improved.

Qualifications:

Required Qualifications:
  • Master's Degree in Electrical Engineering, or related field AND 3+ years technical engineering experience
    • OR Bachelor's Degree in Electrical Engineering, or related field AND 5+ years technical engineering experience
    • OR equivalent experience.
  • 5+ years of work experience in managing product quality in the electronic industry.
  • 5+ years of direct engineering experience in hardware system issue resolution for GPU Servers.
  • Versed in filtering through applicable debug data, like telemetry and logs to identify and investigate HW failure signatures.
Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:
  • Bachelor's Degree in electrical and systems engineering, or related field AND 7+ years experience in a large scale manufacturing and/or data center environment/repair
    • OR Master's Degree in electrical engineering and systems engineering or related field AND 6+ years experience in a large scale manufacturing and/or data center environment/repair
    • OR Doctorate in electrical engineering and systems engineering or related field AND 3+ years experience in a large scale manufacturing and/or data center environment/repair
    • OR 9+ years equivalent experience.
  • Patent or track record of engineering excellency.
  • Experience with Liquid Cooling Systems in Data Centers
  • 12+ years of experience in working with the modern server architectures - includes understanding of GPU, CPU methods for failure analysis, debugging or validation.
  • 8+ years of system level server debugging with an understanding of platform, power, system and network environments
  • 3+ years of direct GPU related engineering experience in issue debug/test log review.
  • Leadership skills and ability to collaborate with diverse teams and drive a call to action.
  • Experience in root cause analysis and corrective action methods to identify contributing factors of production defects.
  • Ability to analyze large data sets, extract key insights, and effectively present and communicate the results.
  • Proficient communication and project management skills.
Electrical Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: US corporate pay information | Microsoft Careers

Microsoft will accept applications and processes offers for these roles on an ongoing basis.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

#azurehwjobs #HIFE #AHSI
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.