Overview
On Site
USD 110,500.00 - 149,500.00 per year
Full Time
Skills
Security Clearance
T1
Systems Engineering
Red Hat Linux
SAFE
High Performance Computing
FOCUS
Operating Systems
Bioinformatics
Genomics
Microscopy
Reporting
Collaboration
Storage
Scheduling
Communication
Continuous Improvement
Business Analysis
Business Analytics
Servers
Computer Networking
GitHub
GPU
Training
Data Mining
Python
Scripting
Git
Ansible
Terraform
Provisioning
Research
Leadership
HPC
ESP
Linux
Management
Computer Hardware
Data Centers
Amazon Web Services
Continuous Integration
Continuous Integration and Development
Continuous Delivery
Workflow
Telecommuting
Taxes
Apache Flex
Military
Insurance
Professional Services
Innovation
Artificial Intelligence
Machine Learning (ML)
Cloud Computing
Application Development
Job Details
Type of Requisition:
Regular
Clearance Level Must Currently Possess:
None
Clearance Level Must Be Able to Obtain:
None
Public Trust/Other Required:
NACI (T1)
Job Family:
Systems Engineering
Job Qualifications:
Skills:
Computer Servers, High-Performance Computing (HPC) Systems, Linux, Red Hat Ansible, Terraform
Certifications:
None
Experience:
5 + years of related experience
ship Required:
No
Job Description:
Seize your opportunity to make a personal impact as a HPC Systems Engineer supporting NIH's National Institute of Allergy and Infectious Disease. GDIT is your place to make meaningful contributions to challenging projects and grow a rewarding career.
At GDIT, people are our differentiator. As a HPC Systems Engineer you will help ensure today is safe and tomorrow is smarter. Our work depends on HPC Systems Engineer joining our team to bridge the gap between our researchers and the high performance computing resources. You will be one of the faces of our High Performance Compute (HPC) clusters to the NIAID research community who will rely on you to help them get their important research work done. You will focus on supporting HPC hardware, installing scientific applications, optimizing submission scripts and running jobs, and monitoring the health of NIAID's HPC clusters; a 4000+ core HPC cluster that is GPU-focused and a 1,500+ core HPC cluster.
Position is primarily remote; however, you must be able to commute at your own expense to the NIAID's datacenter in Rockville, Maryland approximately once a week to meet contractual obligations.
How a HPC Systems Engineer will Make an Impact:
Required Skills and Experience:
Desired Skills and Experience
The likely salary range for this position is $110,500 - $149,500. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
Scheduled Weekly Hours:
40
Travel Required:
None
Telecommuting Options:
Hybrid
Work Location:
USA MD Rockville
Additional Work Locations:
Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.
We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.
Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.
Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans
Regular
Clearance Level Must Currently Possess:
None
Clearance Level Must Be Able to Obtain:
None
Public Trust/Other Required:
NACI (T1)
Job Family:
Systems Engineering
Job Qualifications:
Skills:
Computer Servers, High-Performance Computing (HPC) Systems, Linux, Red Hat Ansible, Terraform
Certifications:
None
Experience:
5 + years of related experience
ship Required:
No
Job Description:
Seize your opportunity to make a personal impact as a HPC Systems Engineer supporting NIH's National Institute of Allergy and Infectious Disease. GDIT is your place to make meaningful contributions to challenging projects and grow a rewarding career.
At GDIT, people are our differentiator. As a HPC Systems Engineer you will help ensure today is safe and tomorrow is smarter. Our work depends on HPC Systems Engineer joining our team to bridge the gap between our researchers and the high performance computing resources. You will be one of the faces of our High Performance Compute (HPC) clusters to the NIAID research community who will rely on you to help them get their important research work done. You will focus on supporting HPC hardware, installing scientific applications, optimizing submission scripts and running jobs, and monitoring the health of NIAID's HPC clusters; a 4000+ core HPC cluster that is GPU-focused and a 1,500+ core HPC cluster.
Position is primarily remote; however, you must be able to commute at your own expense to the NIAID's datacenter in Rockville, Maryland approximately once a week to meet contractual obligations.
How a HPC Systems Engineer will Make an Impact:
- Work with a 4000+ core HPC cluster that is GPU-focused and a 1,500+ HPC cluster supporting the hardware and operating system environments
- Supporting bioinformatics applications for a large and diverse research community with needs in genomics, cryo-electron microscopy, and AI/ML
- Monitor the portfolio of software applications and be proactive in planning upgrades and license renewals
- Monitor and report on cluster performance and generate data to show usage and trends
- Triage support requests from the research community and work with others in the Scientific Infrastructure team to resolve issues and complete service requests
- Collaborate with researchers to guide them in effective use of the HPC resources, such as job scheduler submission, data formats, and building data workflows
- Engage with researchers to understand their HPC needs to include data life cycle management, integration of scientific instruments to HPC, and storage capacity and compute requirements
- Provide input to the Scientific Infrastructure team leader for setting priorities for cluster operations, scheduling policies, resources needed, etc.
- Attend and actively participate in daily standup meetings to provide updates on progress, discuss obstacles, and co-ordinate tasks with other team members
- Work collaboratively in a team environment to achieve project goals
- Engage in open communication, share knowledge, and support fellow teammates
- Provide feedback and contribute to the continuous improvement of team processes
Required Skills and Experience:
- BS/BA (or equivalent) and 5+ years of related experience
- 5+ years of experience managing physical servers, datacenters, networking, and related technologies
- 5+ years of experience managing Linux systems
- Experience with Spack package manager, including making packages from PyPi, R, Github
- Experience installing and packaging GPU applications and optimizing job submission scripts that are used for ML model training, data mining operations, or high-res graphics rendering
- Experience with Python scripting
- Experience using Git distributed workflows
- Experience with Ansible manage system configuration
- Experience with Terraform for provisioning systems
- Ability to translate technical concepts in HPC and research computing to scientists and other non- technical personnel
- Ability to determine meaningful metrics and usage data for leadership
- HPC scheduler experience (esp. SLURM)
- Must be able to obtain a NIH Public Trust
Desired Skills and Experience
- 10 years managing Linux systems
- 10 years managing hardware in datacenters
- 3 years of experience using Amazon Web Services (AWS)
- Experience developing Continuous Integration / Continuous Delivery workflows
The likely salary range for this position is $110,500 - $149,500. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
Scheduled Weekly Hours:
40
Travel Required:
None
Telecommuting Options:
Hybrid
Work Location:
USA MD Rockville
Additional Work Locations:
Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.
We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.
Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.
Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.