Cloud Engineer - Observability & SRE

Plano, TX, US • Posted 1 hour ago • Updated 1 hour ago
Contract W2
Contract Corp To Corp
On-site
Depends on Experience
Company Branding Image
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Summary

Role Summary

A senior Cloud Engineer with expertise in building and managing scalable observability and infrastructure platforms for enterprise-level cloud microservices environments. This hybrid role demands hands-on experience with container orchestration, cloud infrastructure automation, and high-volume monitoring systems. The engineer will own end-to-end components, support production operations, and leverage AI tools for system troubleshooting and code generation.

Responsibilities

  • Design, develop, and operate observability platforms enabling logging, metrics collection, and tracing for cloud-based microservices applications.
  • Manage and optimize large-scale Kubernetes clusters across multiple regions, including Helm chart management, pod scheduling, and resource tuning.
  • Own and maintain CI/CD pipelines using tools such as Argo CD, Helm, and GitOps methodologies to ensure reliable deployment workflows.
  • Implement Infrastructure as Code (IaC) solutions utilizing Terraform on AWS to provision and manage cloud infrastructure at scale.
  • Operate and maintain monitoring ecosystems including OpenSearch/Elasticsearch, Prometheus, Grafana, Splunk, and Kafka, ensuring high availability and performance.
  • Develop automation solutions to detect, respond, and remediate production issues proactively.
  • Ensure security and compliance by managing vulnerability patching and automating security best practices in container environments.
  • Collaborate with cross-functional teams to improve system reliability, scalability, and performance, contributing to distributed system design.
  • Participate in on-call rotations, incident response, and post-incident analysis to uphold SLA commitments.
  • Utilize AI-assisted coding and troubleshooting tools to accelerate system development, automation, and incident resolution.

Qualifications

  • Bachelor''s degree in Computer Science, Information Technology, or related field.
  • Minimum of 8 years of experience in DevOps, SRE, or platform engineering roles supporting production cloud environments.
  • Proven incident response experience, including alert triage, root cause analysis, and SLA management in 24/7 operations.
  • Expertise in Infrastructure as Code principles with proficiency in Terraform, Ansible, or similar automation tools for cloud provisioning.
  • Strong scripting skills in Python, Golang, or Bash for automation, tooling, and CI/CD pipeline integration.
  • Extensive experience operating and troubleshooting large-scale Kubernetes workloads, including Helm chart management and multi-cluster orchestration.
  • Hands-on knowledge of observability stacks such as OpenSearch, Prometheus, Grafana, Loki, and Splunk, including query optimization and capacity planning.
  • Familiarity with Kafka and AWS MSK, including cluster operation, topic configuration, and schema management.
  • Experience deploying, managing, and migrating Splunk Enterprise environments with Kubernetes-based log shipping architectures.
  • Working knowledge of OpenTelemetry, distributed tracing, and application performance monitoring in cloud environments.
  • Understanding of security frameworks, container hardening practices, and vulnerability remediation at scale, including standards such as FedRAMP, STIG, IL5, ISO 27001, and SOC 2.
  • Experience using AI tools like LLMs, GitHub Copilot, or custom AI agents to enhance operational workflows and incident management.
  • Effective communication skills and the ability to work independently in a hybrid work setting.

Publishing Pay Range: $65.00 - $67.00 hourly

This position offers a hybrid schedule, with time split between the office and remote work.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10112156
  • Position Id: 112574
  • Posted 1 hour ago

Company Info

About GDH

GDH is a technology workforce solutions provider committed to always giving and delivering more. Better talent. More client and consultant support. Greater service. World-class outcomes. Providing technology staffing, project solutions, and recruitment process outsourcing (RPO), we will be able to deepen our understanding of your business challenges, stay up to date with industry trends, and enhance our ability to create custom solutions to help achieve your business outcomes. We have established ourselves as a trusted partner to countless businesses operating in the communications sector. Our primary goal is to source and recruit the most talented professionals, assemble teams of skilled specialists, create innovative recruitment and professional services strategies that drive growth and foster innovation.

GDH Benefits

GDH offers a range of employee benefits that are designed to promote well-being and help maintain a healthy work-life balance. These comprehensive benefits cover various aspects of an employee's life and aim to enhance their overall experience with the company. Our health benefits include three medical insurance options with access to KISx Card, Zero Card, and HealthJoy concierge services. Other plan offerings include dental, vision, life, disability, supplemental insurance, and pet insurance plans. Enjoy additional perks like holiday pay, 401(k) plan, direct deposit, an employee referral program, work-life balance benefits, a Wellbeats membership, a discounted gym membership program, and more!  For more detailed information on benefits, please go to GDH’s website under the tab for candidates.

GDH provides equal employment opportunities (EEO) to all employees and applicants for
employment without regard to race, color, religion, sex, national origin, age, disability, genetic information, veteran's status or any other category protected by law. In addition to federal law requirements, GDH Consulting, Inc. complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities and/or employees. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, benefits and training. Applicants with disabilities that require an accommodation or assistance in applying and/or for interviewing, please contact our HR Department.

Please visit GDH's website for notice of collection for California applicants.

 

About_Company_One
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Easy Apply

Contract, Third Party

Depends on Experience

Columbus, Ohio

6d ago

Easy Apply

Third Party, Contract

Depends on Experience

San Jose, California

Today

Easy Apply

Contract, Third Party

Depends on Experience

San Jose, California

Today

Easy Apply

Contract, Third Party

Depends on Experience

Search all similar jobs