Job#: 3024429 Job Description:Systems Administrator 2 - High-Performance Computing (HPC)Location: Redmond, WA (Hybrid - Onsite 3x/week)
Contract RoleOverviewWe are seeking a Systems Administrator 2 with strong Linux and automation experience to support the design, deployment, and ongoing operations of high-performance compute (HPC) clusters used by Microsoft's Quantum Computing research teams. This role ensures our researchers have a secure, compliant, highly available, and high-performance environment for running advanced simulations and workloads.
You will work hands-on with compute, storage, and networking infrastructure; develop automation for cluster lifecycle management; and collaborate closely with engineering, security, and research partners. This position is ideal for someone who thrives in distributed Linux environments, enjoys solving complex systems problems, and wants to contribute directly to cutting-edge quantum research.
Key Responsibilities- Build, deploy, and maintain HPC cluster infrastructure, including compute nodes, storage systems, and networking components.
- Develop and operate automation for cluster deployment, configuration, scaling, and lifecycle management.
- Diagnose and resolve platform-level issues affecting reliability, performance, or workload execution.
- Participate in the full DevOps lifecycle, including code development, code review, testing, and production operations.
- Validate HPC platforms for readiness, security compliance, and internal customer use.
- Maintain accurate and comprehensive documentation for architecture, deployment processes, and operational procedures.
- Collaborate with researchers to troubleshoot issues related to running simulations and workloads on HPC platforms.
- Support a major migration of HPC infrastructure from a general Microsoft corporate tenant to a custom Quantum tenant to enhance security and isolation.
Typical Day in the RoleA typical day involves a blend of hands-on systems work and cross-team collaboration. You may be deploying new compute nodes, writing automation to streamline cluster configuration, debugging performance issues affecting research workloads, or validating new platform capabilities for compliance and readiness. You will regularly interact with researchers to ensure their simulations run smoothly and with engineering partners to maintain a secure, stable, and scalable HPC environment.
Required Qualifications- Bachelor's degree in Computer Science, Computer Engineering, or a related technical field.
- 2+ years of Linux systems administration experience in production, lab, or research computing environments.
- 2+ years of experience with automation tools such as Python, Ansible, or Terraform.
- 2+ years of experience supporting distributed, multi-user systems.
- Strong proficiency with the Linux terminal and command-line tooling.
- Experience troubleshooting performance, reliability, or configuration issues in production or pre-production systems.
- Experience writing scripts or tools for automation, diagnostics, or operational workflows.
- Ability to learn and operate within existing platforms and processes while contributing to long-term improvements.
Disqualifier: Candidates without hands-on Linux administration experience will not be considered.
Preferred / Beneficial Skills- Experience with high-performance computing (HPC) as a user or developer of parallel/accelerated applications.
- Familiarity with HPC schedulers such as Slurm.
- Exposure to HPC offerings in Azure.
- Experience with containers or microservices.
- Experience supporting large-scale Linux environments with complex networking and storage requirements.
Ideal Candidate ProfileThe strongest candidates will have:
- 2+ years of Linux systems administration experience.
- Hands-on experience supporting HPC environments or large distributed compute systems.
- Familiarity with Slurm, parallel workloads, and multi-user research environments.
- Experience with automation (Python, Terraform, Ansible) and strong command-line fluency.
- Exposure to Azure-based HPC solutions or cloud-hosted compute clusters.
Unique Value PropositionThis role offers the opportunity to directly impact Microsoft's quantum computing research by delivering a secure, reliable, and high-performance compute environment that enables groundbreaking scientific work. You will collaborate with teams across security, IT, and research, gaining exposure to cutting-edge technologies and contributing to one of Microsoft's most innovative programs.
Performance ExpectationsContractor performance will be measured by:
- Timeliness and completeness of deliverables.
- Quality, reliability, and validation of work prior to submission.
- Adherence to planned work items tracked in Azure DevOps (ADO).
Work Location & Schedule- Hybrid Work Arrangement (HWA): Onsite in Redmond at least 3 days per week.
- Reason: Collaboration, access to hardware, and team alignment.
- Remote Flexibility: Work hours are not flexible across time zones.
EEO Employer
Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at or .
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.
Apex Benefits Overview: Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Apex team member can provide.