Overview
Hybrid
Depends on Experience
Contract - W2
Skills
Linux
rhel
Fedora
Shell
InfluxDB
Grafana
Job Details
Role: Infrastructure Engineer
Location: Charlotte, NC (Hybrid)
Duration: Long-Term Contract
Employment Type: W2 Only
Location: Charlotte, NC (Hybrid)
Duration: Long-Term Contract
Employment Type: W2 Only
Job Summary:
We are seeking a skilled and motivated Infrastructure Engineer to support and maintain a high-performance big data infrastructure environment for a global financial institution. You will be responsible for ensuring the performance, reliability, and stability of a large-scale Cloudera Hadoop storage cluster and a 30K-core Apache Spark compute GRID, supporting over 150 advanced model developers.
This role requires strong experience in Linux system administration (RHEL/Fedora), monitoring and observability tooling (InfluxDB, Grafana, Telegraf), and automation scripting (Python, Shell), along with solid troubleshooting capabilities across infrastructure and application layers.
Key Responsibilities:
- Manage and support a large-scale big data infrastructure (Cloudera Hadoop & Apache Spark GRID).
- Develop and maintain monitoring/alerting solutions with tools like InfluxDB, Grafana, and Telegraf.
- Automate diagnostics and observability using Python and Shell scripting.
- Administer Linux-based systems (primarily RHEL/Fedora).
- Support CI/CD tools and workflows (Jenkins, GitHub, Ansible).
- Troubleshoot and resolve issues across infrastructure, applications, and network layers.
- Collaborate with cross-functional teams, including PhD-level model developers, to optimize infrastructure performance.
- Participate in infrastructure upgrades, releases, and patching.
- Maintain and validate access/security controls and secrets management tools like HashiCorp Vault.
Top Skills (Priority-wise):
Primary:
- Strong Unix/Linux Administration (RHEL, Fedora)
- Unix Shell and Python Scripting
Secondary (any one group preferred):
Group 1:
- InfluxDB / Time-series databases
- Grafana (Dashboards and Monitoring)
Group 2:
- GRID Computing/Cluster Management experience
- Performance Tuning of large compute environments
Required Qualifications:
- 3 5 years of hands-on experience in Linux Administration (RHEL/Fedora).
- Strong experience with observability tools: InfluxDB, Telegraf, and Grafana.
- Experience building monitoring and diagnostic tools/integrations.
- Proficient in Shell and Python scripting.
- Familiarity with CI/CD tools like Jenkins, GitHub, and configuration tools like Ansible.
- Exposure to secrets management tools (e.g., HashiCorp Vault).
- Strong problem-solving skills with the ability to troubleshoot across multiple technical layers.
Preferred Qualifications:
- Experience in Site Reliability Engineering (SRE) or DevOps roles.
- Exposure to Cloudera Hadoop environments.
- Understanding of containerization technologies (Docker, Kubernetes).
- Familiarity with access management tools like Active Directory and Kerberos.
- A self-starter with a collaborative and service-oriented mindset.
Looking forward to hearing from you.
Thanks & Regards
MD TOUHEED ALAM
SR. TECHNICAL RECRUITER
PURPLE HIRES INC.
Email -
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.