Lead DevOps/SRE Engineer

Remote • Posted 4 hours ago • Updated 4 hours ago
Full Time
Remote
Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

  • DevOps
  • Digital Media
  • PaaS
  • Microservices
  • Grafana
  • IaaS
  • Continuous Integration
  • Continuous Delivery
  • Cost Control
  • Scratch
  • SLA
  • Dashboard
  • Amazon Web Services
  • Migration
  • System On A Chip
  • Auditing
  • Terraform
  • Git
  • Integration Testing
  • Reporting
  • Budget
  • eXist
  • Documentation
  • Regulatory Compliance
  • Leadership
  • Switches
  • Communication
  • Value Engineering

Summary

WHO ARE WE?

Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as FinanceBuzz, All About Cookies, and OnlyInYourState.

As The Discovery and Conversion Company, our mission is to connect consumers with the world's leading brands through data-driven content and technology.

Headquartered in South Florida with a remote-first team spanning over 15 countries, we've built a high-growth, high-performance culture where speed, ownership, and measurable impact drive success.

WHY JOIN US?

At Launch Potato, you'll accelerate your career by owning outcomes, moving fast, and driving impact with a global team of high-performers.

MUST HAVE:
  • 5+ years of production AWS infrastructure experience with deep Terraform expertise.
  • Hands-on experience building the SRE function from scratch and had complete ownership.
  • Experience with a multi-site company where PaaS or microservices are required.
  • CI/CD pipeline ownership in one or more previous roles.
  • PagerDuty experience and standing up an on-call rotation.

EXPERIENCE: 5+ years hands-on with AWS, Terraform, CI/CD pipeline ownership, and SRE tooling (OpenTelemetry, Grafana, PagerDuty or equivalent) in a production environment.

YOUR ROLE

Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture. Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.

OUTCOMES
  • Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.
  • Complete the AWS multi-account migration: move production workloads to an isolated account with zero unplanned downtime.
  • Deliver SOC 2 Type I audit-ready infrastructure evidence package: own the technical controls implementation end-to-end.
  • Version and publish the Terraform module library: (30+ modules) to a private registry to eliminate ad hoc git consumption by product teams.
  • Implement automated deployment rollback for ECS and Lambda: gate production on integration test passage.
  • Stand up monthly cost reporting to leadership: budget anomaly detection, savings plan recommendations, spend by service/team/environment.

COMPETENCIES
  • Ownership orientation: You don't wait to be assigned a problem. If something is broken, undocumented, or a risk, you flag it and fix it. If the runbooks don't exist yet, you write them.
  • Documentation discipline: You write things down. Runbooks, decision rationale, architecture patterns, incident post-mortems. The next person should be able to understand your work without asking you.
  • Cost consciousness: You think about the business impact of infrastructure decisions. You can explain a spending anomaly to a CFO in plain language. You know what things cost before you build them.
  • Calm under pressure: Production incidents happen. You triage clearly, communicate proactively with technical and non-technical stakeholders, and run a tight post-mortem without blame. You've been woken up at 3am. You can handle it.
  • Cross-functional communication: You can work with product engineers, legal/compliance, and executive leadership in the same week without switching communication modes awkwardly. You speak both engineer and business.
  • Proactive reliability: A good SRE reacts to outages. A great SRE catches degradation before it becomes an outage. You build alerting against the patterns, not just the failures.

TOTAL COMPENSATION

Base salary is set according to market rates for the nearest major metro and varies based on Launch Potato's Levels Framework. Your compensation package includes a base salary, profit-sharing bonus, and competitive benefits. Launch Potato is a performance-driven company, which means once you are hired, future increases will be based on company and personal performance, not annual cost of living adjustments.

Want to accelerate your career? Apply now!

Since day one, we've been committed to having a diverse, inclusive team and culture. We are proud to be an Equal Employment Opportunity company. We value diversity, equity, and inclusion.

We do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91144706
  • Position Id: c912177c4a08859643ce8c3b3fb8f6e7
  • Posted 4 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Full-time

Remote

Today

Full-time

Remote or Kansas City, Missouri

Today

Easy Apply

Full-time

USD 120,000.00 - 149,000.00 per year

Remote

2d ago

Easy Apply

Full-time

$70 - $80

Search all similar jobs