OpenStack Infrastructure Engineer

Overview

On Site
$150,000 - $200,000
Full Time
No Travel Required

Skills

Ansible
Cloud Computing
Data Centers
Continuous Delivery
Linux
Terraform
Kubernetes
Docker
OpenStack

Job Details

Job Title: OpenStack Infrastructure Engineer
Location: Onsite in McKinney, TX (with flexibility)
Compensation: $150,000 $190,000/year (based on experience)
Type: Full-Time, Direct Hire


Overview:

We are seeking an experienced OpenStack Infrastructure Engineer to lead the design, deployment, and optimization of our enterprise-scale infrastructure supporting hybrid cloud and colocation environments. This is a critical, hands-on role focused on delivering resilient, secure, and high-performing OpenStack platforms that power our virtualized services across multiple data centers.

You ll work onsite in McKinney, TX, with flexibility, and play a key role in bridging physical infrastructure, automation, and scalable cloud-native operations. If you're passionate about open infrastructure, bare-metal performance, and driving 99.999% uptime systems this role is for you.


Key Responsibilities:

< data-start="958" data-end="1001">Infrastructure Design & Deployment</>
  • Architect and deploy OpenStack clusters across distributed, geo-redundant data centers.

  • Ensure availability and fault tolerance for hybrid workloads with a target of 99.999% uptime.

  • Evaluate, procure, and manage compute, storage, and networking hardware aligned with OpenStack/Ceph requirements.

< data-start="1305" data-end="1356">Data Center Operations & Capacity Planning</>
  • Collaborate with colocation providers to ensure appropriate power, cooling, and rack space.

  • Utilize DCIM tools to plan and manage rack-level resources and capacity.

  • Conduct quarterly audits to monitor resource utilization and anticipate upgrade cycles.

< data-start="1617" data-end="1647">Networking & Security</>
  • Configure SD-WAN, AWS Direct Connect, and hybrid connectivity strategies.

  • Enforce OpenStack and Linux-based security controls (Keystone, Neutron, encrypted Ceph).

  • Conduct regular security audits and lead remediation efforts.

< data-start="1880" data-end="1915">Automation & Observability</>
  • Automate deployments using Terraform, Ansible, MaaS, and other IaC tooling.

  • Build predictive monitoring pipelines using OpenTelemetry, Grafana, Prometheus, and Loki.

  • Create self-healing infrastructure patterns to minimize MTTR.


Key Performance Indicators (KPIs):

  • Task Timeliness: 80% on-time completion.

  • MTTR (Mean Time to Recovery): < 1 hour.

  • Root Cause Analysis: Preliminary RCA within 24 hours; final RCA within 3 days of incident closure.


Required Competencies:

  • Expert in OpenStack (Nova, Neutron, Keystone), Ceph, and virtualization technologies (KVM/QEMU).

  • Strong proficiency in IaC: Terraform, OpenTofu, Pulumi, Ansible.

  • Deep knowledge of Linux internals and performance analysis (eBPF).

  • Experience with DCIM platforms and hardware lifecycle management.

  • Proficient in observability tools: APM, Grafana, Prometheus, Loki, OpenTelemetry.

  • Strong background in networking, protocols, and secure design practices.

  • Proven success managing container environments (Kubernetes, Docker, Helm).

  • Familiarity with Git, CI/CD (GitHub Actions, Argo), and ticketing systems (Jira, Azure DevOps).


Ideal Attributes:

  • High ownership mentality and ability to independently lead initiatives.

  • Curiosity-driven with a passion for continuous learning.

  • Strong communicator with cross-functional team experience.

  • Ability to prioritize and manage competing demands with clarity and confidence.

  • Empathetic leader who thrives in collaborative engineering environments.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About GTN Technical Staffing