Overview
On Site
Hybrid
USD 65.00 per hour
Full Time
Skills
Computer Science
Server Administration
Linux Administration
HPC
Performance Tuning
Job Scheduling
Orchestration
Grafana
Ansible
Terraform
Bash
Python
Problem Solving
Conflict Resolution
Communication
PyTorch
TensorFlow
Cloud Computing
Amazon Web Services
Microsoft Azure
Google Cloud
Google Cloud Platform
Optimization
High Performance Computing
Machine Learning (ML)
Management
CUDA
Computer Hardware
High Availability
Collaboration
DevOps
Artificial Intelligence
Workflow
Docker
Kubernetes
Regulatory Compliance
Lifecycle Management
GPU
Servers
Documentation
Scripting
Training
Job Details
Date Posted: 11/10/2025
Hiring Organization: Rose International
Position Number: 491252
Industry: Manufacturing
Job Title: Senior Server Administrator
Job Location: Chicago, IL, USA, 60661
Work Model: Hybrid
Work Model Details: Onsite 1-5 days/week
Shift: M-F, 8-5
Employment Type: Temporary
FT/PT: Full-Time
Estimated Duration (In months): 13
Min Hourly Rate($): 65.00
Max Hourly Rate($): 76.00
Must Have Skills/Attributes: Ansible, BASH, GPUs (Graphics Processing Units), Grafana, Kubernetes, Linux, Performance Tuning
Experience Desired: Linux System Administration (5 yrs); Experience with NVIDIA GPU-based systems (3 yrs)
Required Minimum Education: Bachelor's Degree
**C2C is not available**
Job Description
***Position can be located in Chicago, IL/Dallas, TX/Peoria, IL/Phoenix, AZ/Broomfield, CT/Cary, NC
Education Requirements:
- Bachelor's in Computer Science or related field.
Required Skills for the Senior Server Administrator:
- Minimum of 8 years work experience, 5+ years of experience in server administration, with at least 3 years focused on NVIDIA GPU-based systems
- 5+ years of experience in server administration, with at least 3 years focused on NVIDIA GPU-based systems.
- Deep understanding of Linux system administration, especially in HPC or AI environments.
- Hands-on experience with NVIDIA GPU drivers, CUDA toolkit, and performance tuning.
- Familiarity with Slurm, Kubernetes, or other job scheduling and orchestration tools.
- Experience with monitoring tools (e.g., Prometheus, Grafana) and infrastructure automation (e.g., Ansible, Terraform).
- Strong scripting skills (Bash, Python, etc.).
- Excellent problem-solving and communication skills.
Desired Skills:
- NVIDIA Certified Professional or similar credentials.
- Experience with multi-GPU and multi-node training setups.
- Familiarity with AI/ML frameworks (e.g., PyTorch, TensorFlow) and their GPU dependencies.
- Exposure to cloud-based GPU infrastructure (AWS, Azure, Google Cloud Platform).
Senior Server Administrator Overview:
We are seeking a highly skilled Senior Server Administrator to join our AI Engineering team. This role is critical to the deployment, maintenance, and optimization of high-performance computing infrastructure, specifically leveraging NVIDIA's advanced GPU technologies. You will work closely with AI researchers, data scientists, and software engineers to ensure our systems are robust, scalable, and tuned for cutting-edge machine learning workloads.
Responsibilities:
- Administer and maintain GPU-accelerated servers and clusters, including NVIDIA A100, H100, and other high-end GPU sets.
- Manage and optimize NVIDIA software stack components such as CUDA, cuDNN, TensorRT, NCCL, and Ncontainers.
- Monitor system performance, troubleshoot hardware/software issues, and ensure high availability of AI infrastructure.
- Collaborate with DevOps and AI teams to support containerized workflows (Docker, Kubernetes) and distributed training environments.
- Implement security best practices and ensure compliance with internal and external standards.
- Lead upgrades, patching, and lifecycle management of GPU servers and related infrastructure.
- Provide documentation, automation scripts, and training for internal teams.
Benefits:
For information and details on employment benefits offered with this position, please visit here. Should you have any questions/concerns, please contact our HR Department via our secure website.
California Pay Equity:
For information and details on pay equity laws in California, please visit the State of California Department of Industrial Relations' website here.
Rose International is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender (expression or identity), national origin, arrest and conviction records, disability, veteran status or any other characteristic protected by law. Positions located in San Francisco and Los Angeles, California will be administered in accordance with their respective Fair Chance Ordinances.
If you need assistance in completing this application, or during any phase of the application, interview, hiring, or employment process, whether due to a disability or otherwise, please contact our HR Department.
Rose International has an official agreement (ID #132522), effective June 30, 2008, with the U.S. Department of Homeland Security, U.S. Citizenship and Immigration Services, Employment Verification Program (E-Verify). (Posting required by OCGA 13/10-91.).
Hiring Organization: Rose International
Position Number: 491252
Industry: Manufacturing
Job Title: Senior Server Administrator
Job Location: Chicago, IL, USA, 60661
Work Model: Hybrid
Work Model Details: Onsite 1-5 days/week
Shift: M-F, 8-5
Employment Type: Temporary
FT/PT: Full-Time
Estimated Duration (In months): 13
Min Hourly Rate($): 65.00
Max Hourly Rate($): 76.00
Must Have Skills/Attributes: Ansible, BASH, GPUs (Graphics Processing Units), Grafana, Kubernetes, Linux, Performance Tuning
Experience Desired: Linux System Administration (5 yrs); Experience with NVIDIA GPU-based systems (3 yrs)
Required Minimum Education: Bachelor's Degree
**C2C is not available**
Job Description
***Position can be located in Chicago, IL/Dallas, TX/Peoria, IL/Phoenix, AZ/Broomfield, CT/Cary, NC
Education Requirements:
- Bachelor's in Computer Science or related field.
Required Skills for the Senior Server Administrator:
- Minimum of 8 years work experience, 5+ years of experience in server administration, with at least 3 years focused on NVIDIA GPU-based systems
- 5+ years of experience in server administration, with at least 3 years focused on NVIDIA GPU-based systems.
- Deep understanding of Linux system administration, especially in HPC or AI environments.
- Hands-on experience with NVIDIA GPU drivers, CUDA toolkit, and performance tuning.
- Familiarity with Slurm, Kubernetes, or other job scheduling and orchestration tools.
- Experience with monitoring tools (e.g., Prometheus, Grafana) and infrastructure automation (e.g., Ansible, Terraform).
- Strong scripting skills (Bash, Python, etc.).
- Excellent problem-solving and communication skills.
Desired Skills:
- NVIDIA Certified Professional or similar credentials.
- Experience with multi-GPU and multi-node training setups.
- Familiarity with AI/ML frameworks (e.g., PyTorch, TensorFlow) and their GPU dependencies.
- Exposure to cloud-based GPU infrastructure (AWS, Azure, Google Cloud Platform).
Senior Server Administrator Overview:
We are seeking a highly skilled Senior Server Administrator to join our AI Engineering team. This role is critical to the deployment, maintenance, and optimization of high-performance computing infrastructure, specifically leveraging NVIDIA's advanced GPU technologies. You will work closely with AI researchers, data scientists, and software engineers to ensure our systems are robust, scalable, and tuned for cutting-edge machine learning workloads.
Responsibilities:
- Administer and maintain GPU-accelerated servers and clusters, including NVIDIA A100, H100, and other high-end GPU sets.
- Manage and optimize NVIDIA software stack components such as CUDA, cuDNN, TensorRT, NCCL, and Ncontainers.
- Monitor system performance, troubleshoot hardware/software issues, and ensure high availability of AI infrastructure.
- Collaborate with DevOps and AI teams to support containerized workflows (Docker, Kubernetes) and distributed training environments.
- Implement security best practices and ensure compliance with internal and external standards.
- Lead upgrades, patching, and lifecycle management of GPU servers and related infrastructure.
- Provide documentation, automation scripts, and training for internal teams.
- **Only those lawfully authorized to work in the designated country associated with the position will be considered.**
- **Please note that all Position start dates and duration are estimates and may be reduced or lengthened based upon a client's business needs and requirements.**
Benefits:
For information and details on employment benefits offered with this position, please visit here. Should you have any questions/concerns, please contact our HR Department via our secure website.
California Pay Equity:
For information and details on pay equity laws in California, please visit the State of California Department of Industrial Relations' website here.
Rose International is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender (expression or identity), national origin, arrest and conviction records, disability, veteran status or any other characteristic protected by law. Positions located in San Francisco and Los Angeles, California will be administered in accordance with their respective Fair Chance Ordinances.
If you need assistance in completing this application, or during any phase of the application, interview, hiring, or employment process, whether due to a disability or otherwise, please contact our HR Department.
Rose International has an official agreement (ID #132522), effective June 30, 2008, with the U.S. Department of Homeland Security, U.S. Citizenship and Immigration Services, Employment Verification Program (E-Verify). (Posting required by OCGA 13/10-91.).
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.