Site Reliability Engineer

Overview

Remote
Depends on Experience
Contract - W2
Contract - 26 week(s)

Skills

PYTHON

Job Details

Senior Site Reliability Engineer (SRE) – Data Platform
Job At a Glance
Location: Orlando, FL; remote (EST Time Zone)
Contract Length: W2 Contract, 6 months with possible extension or conversion
Pay: $67/hr - $72/hr + medical, dental, vision, 401k match

About the Role
We’re seeking a Senior Site Reliability Engineer (SRE) to join our Data Platform team at a leading global media organization. This is a mission-critical role focused on designing, scaling, and sustaining the core platform that powers data products and insights across both digital and physical experiences.
As an SRE, you’ll bring a software engineering mindset to operations—automating processes, reducing manual toil, and enhancing the availability and observability of high-throughput, real-time data systems. You’ll work at the intersection of data engineering, DevOps, and infrastructure, collaborating closely with cross-functional teams to build a robust and scalable data platform.

Key Responsibilities
  • Design, build, and maintain scalable, resilient infrastructure for the core data platform.
  • Implement observability frameworks and telemetry pipelines using tools like OpenTelemetry, Prometheus, Grafana, Datadog, and Honeycomb.
  • Automate operational processes through infrastructure-as-code and CI/CD tooling (Terraform, AWS CDK, etc.).
  • Monitor and optimize platform health through SLAs, SLOs, and SLIs, ensuring high availability and performance.
  • Lead incident response, root cause analysis, and drive a culture of continuous improvement with blameless postmortems.
  • Own and improve DORA metrics (e.g., MTTR, deployment frequency).
  • Collaborate with data engineers, developers, and architects to operate petabyte-scale, real-time data pipelines.
  • Write clean, testable, and reliable code; lead technical design and code reviews.

Basic Qualifications
  • 6+ years of professional experience in software engineering, reliability, infrastructure, or platform development.
  • Strong programming skills in Python and at least one statically typed language (Java, Go, TypeScript, etc.).
  • Deep hands-on experience with AWS services: Lambda, ECS/EKS, S3, IAM, API Gateway, SNS/SQS, Kinesis.
  • Proven experience operating and scaling distributed systems in production.
  • Advanced knowledge of observability design: tracing, metrics, and logging across complex systems.
  • Proficiency with CI/CD pipelinesDevOps practices, and infrastructure-as-code.
  • Solid understanding of SQL/NoSQL databases and architectural trade-offs in large-scale environments.
  • Familiarity with agile workflows, code review practices, and modern SDLC processes.
Preferred Qualifications
  • Experience with real-time data processing, streaming platforms, or analytics infrastructure.
  • Hands-on experience with Datadog, particularly for monitoring serverless applications.
  • Strong skills in performance profilingdistributed tracing, and root cause analysis.
  • Demonstrated success improving system reliability and delivery metrics (MTTR, change failure rate, etc.).
  • Understanding of cloud compliance, governance, and security best practices.
  • Background in media, entertainment, or high-traffic consumer platforms.
Education
  • Required: Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • Preferred: Advanced degree or specialized certifications in cloud, data engineering, or reliability engineering.
Who You Are
  • systems thinker who enjoys solving complex challenges at scale.
  • Passionate about automation, observability, and performance in data-intensive systems.
  • technical leader who balances architectural vision with hands-on execution.
  • Thrives in a fast-paced, collaborative, and high-ownership culture.
  • Motivated by impact and excited to support storytelling, personalization, and user experience for millions worldwide.

#INDREM
#INDGEN
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.