Overview
Remote
USD 123,250.00 - 166,750.00 per year
Full Time
Skills
T1
Systems Engineering
SUSE Linux
High Performance Computing
Modeling
Technical Training
Security Clearance
Technical Support
Storage
SAP BASIS
Computer Hardware
Linux
System Administration
Provisioning
xCAT
Configuration Management
Ansible
Puppet
Linux Administration
SLES
Red Hat Linux
CentOS
Batch Management
Scheduling
LSF
NFS
Network
InfiniBand
Oracle Policy Automation
Ethernet
Scripting
Bash
Perl
Python
C
Writing
Wiki
Management
Service Level
File Systems
HPC
MPI
Nagios
Grafana
Telecommuting
Taxes
Apache Flex
Military
Insurance
Professional Services
Innovation
Artificial Intelligence
Machine Learning (ML)
Cloud Computing
Application Development
Job Details
Type of Requisition:
Regular
Clearance Level Must Currently Possess:
Other
Clearance Level Must Be Able to Obtain:
None
Public Trust/Other Required:
NACI (T1)
Job Family:
Systems Engineering
Job Qualifications:
Skills:
High-Performance Computing (HPC) Systems, Linux System Administration, Python (Programming Language), Scripting, SUSE Linux Enterprise Server (SLES)
Certifications:
None
Experience:
10 + years of related experience
ship Required:
Yes
Job Description:
At GDIT, people are our differentiator. Our work depends on an HPC Systems Admin joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS). This position is primarily remote with working hours aligned to the Eastern time zone.
WCOSS2 provides NOAA the operational High Performance Computing (HPC) resources essential to process sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather prediction operations. Operating 24/7, the 10-year WCOSS program will deliver significant computational capability that will evolve over time to keep pace with NOAA's growing environmental modeling needs.
We are looking for individuals to join GDIT's team to deploy, operate and support leading-edge technology for WCOSS. Specific technology training will be provided.
***Active Clearance is a plus***
We think. We act. We deliver. There is no challenge we can't turn into opportunity.
In this role, a typical day will include:
Applying current HPC systems administrative skills; desire to learn and deploy new technologies.
Developing and deploying monitoring capabilities.
Developing and implementing tools for cluster administration.
Providing technical support with team of HPC System & Storage Administrators to resolve operational issues.
Providing off-hour on-call support on a rotating basis.
Contributing to planning for software and hardware upgrades along with future installations.
REQUIRED QUALIFICATIONS
Bachelor's degree or equivalent and 10+ years of experience with Linux-based HPC systems operations.
Experience working in a 24X7 operational environment.
DESIRED QUALIFICATIONS
Demonstrated experience to deploying and managing large-scale HPC systems using OS provisioning tools (e.g., xCat, HPCM, BCM).
Demonstrated experience using configuration management tools (e.g., Ansible, Puppet).
Linux system administration experience (e.g., SLES, RedHat or CentOS).
Batch management/scheduling systems (SLURM, PBSPro, LSF) experience, PBSpro preferred.
Parallel filesystem configuration and monitoring experience (e.g., Lustre, NFS), Lustre preferred.
High Speed Network interconnect configuration and monitoring experience (Infiniband, OPA, Ethernet, Slingshot).
Programming or scripting in at least two languages (e.g., Bash, Perl, Python, C).
Strong writing skills for technical documents, system procedures, user wiki's and FAQs.
Ability to work both independently and as part of a team.
Knowledge/experience managing computer systems under Service Level Agreements (SLAs).
Demonstrated expertise in at least one of these areas: Batch Schedulers, High Speed Networks, Parallel File systems.
Experience running and optimizing HPC performance benchmarks or MPI codes would be a plus.
Experience with utilization and configuration of monitoring solutions such as Nagios and Grafana would be a plus.
The likely salary range for this position is $123,250 - $166,750. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
Scheduled Weekly Hours:
40
Travel Required:
None
Telecommuting Options:
Remote
Work Location:
Any Location / Remote
Additional Work Locations:
Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year. Paid leave and paid holidays are prorated based on the employee's date of hire. The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.
We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.
Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.
Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans
Regular
Clearance Level Must Currently Possess:
Other
Clearance Level Must Be Able to Obtain:
None
Public Trust/Other Required:
NACI (T1)
Job Family:
Systems Engineering
Job Qualifications:
Skills:
High-Performance Computing (HPC) Systems, Linux System Administration, Python (Programming Language), Scripting, SUSE Linux Enterprise Server (SLES)
Certifications:
None
Experience:
10 + years of related experience
ship Required:
Yes
Job Description:
At GDIT, people are our differentiator. Our work depends on an HPC Systems Admin joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS). This position is primarily remote with working hours aligned to the Eastern time zone.
WCOSS2 provides NOAA the operational High Performance Computing (HPC) resources essential to process sophisticated numerical models used to predict and understand atmospheric and oceanic phenomena for weather prediction operations. Operating 24/7, the 10-year WCOSS program will deliver significant computational capability that will evolve over time to keep pace with NOAA's growing environmental modeling needs.
We are looking for individuals to join GDIT's team to deploy, operate and support leading-edge technology for WCOSS. Specific technology training will be provided.
***Active Clearance is a plus***
We think. We act. We deliver. There is no challenge we can't turn into opportunity.
In this role, a typical day will include:
Applying current HPC systems administrative skills; desire to learn and deploy new technologies.
Developing and deploying monitoring capabilities.
Developing and implementing tools for cluster administration.
Providing technical support with team of HPC System & Storage Administrators to resolve operational issues.
Providing off-hour on-call support on a rotating basis.
Contributing to planning for software and hardware upgrades along with future installations.
REQUIRED QUALIFICATIONS
Bachelor's degree or equivalent and 10+ years of experience with Linux-based HPC systems operations.
Experience working in a 24X7 operational environment.
DESIRED QUALIFICATIONS
Demonstrated experience to deploying and managing large-scale HPC systems using OS provisioning tools (e.g., xCat, HPCM, BCM).
Demonstrated experience using configuration management tools (e.g., Ansible, Puppet).
Linux system administration experience (e.g., SLES, RedHat or CentOS).
Batch management/scheduling systems (SLURM, PBSPro, LSF) experience, PBSpro preferred.
Parallel filesystem configuration and monitoring experience (e.g., Lustre, NFS), Lustre preferred.
High Speed Network interconnect configuration and monitoring experience (Infiniband, OPA, Ethernet, Slingshot).
Programming or scripting in at least two languages (e.g., Bash, Perl, Python, C).
Strong writing skills for technical documents, system procedures, user wiki's and FAQs.
Ability to work both independently and as part of a team.
Knowledge/experience managing computer systems under Service Level Agreements (SLAs).
Demonstrated expertise in at least one of these areas: Batch Schedulers, High Speed Networks, Parallel File systems.
Experience running and optimizing HPC performance benchmarks or MPI codes would be a plus.
Experience with utilization and configuration of monitoring solutions such as Nagios and Grafana would be a plus.
The likely salary range for this position is $123,250 - $166,750. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
Scheduled Weekly Hours:
40
Travel Required:
None
Telecommuting Options:
Remote
Work Location:
Any Location / Remote
Additional Work Locations:
Total Rewards at GDIT:
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. GDIT typically provides new employees with 15 days of paid leave per calendar year to be used for vacations, personal business, and illness and an additional 10 paid holidays per year. Paid leave and paid holidays are prorated based on the employee's date of hire. The GDIT Paid Family Leave program provides a total of up to 160 hours of paid leave in a rolling 12 month period for eligible employees. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.
We are GDIT. A global technology and professional services company that delivers consulting, technology and mission services to every major agency across the U.S. government, defense and intelligence community. Our 30,000 experts extract the power of technology to create immediate value and deliver solutions at the edge of innovation. We operate across 50 countries worldwide, offering leading capabilities in digital modernization, AI/ML, Cloud, Cyber and application development. Together with our clients, we strive to create a safer, smarter world by harnessing the power of deep expertise and advanced technology.
Join our Talent Community to stay up to date on our career opportunities and events at
gdit.com/tc.
Equal Opportunity Employer / Individuals with Disabilities / Protected Veterans
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.