Senior SRE

    • Epic Games
  • Cary, NC
  • Posted 13 days ago | Updated 2 hours ago

Overview

On Site
Full Time

Skills

Game development
Incident management
Collaboration
Problem solving
IaaS
Reliability engineering
Life insurance
3D computer graphics
Unreal Engine
Architectural design
Epic
Creativity
Data
Management
FOCUS
Specification
Documentation
Testing
Debugging
UI
IMPACT
SAP BASIS
Grafana
Tableau
Amazon Web Services
Python
Ruby
Coaching
Value engineering
Virtual reality
Media
Policies
Recruiting

Job Details

WHAT MAKES US EPIC?

At the core of Epic's success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it's building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we're always innovating.

Being Epic means being a part of a team that continually strives to do right by our community and users. We're constantly innovating to raise the bar of engine and game development.

LIVEOPS
What We Do

The Epic LiveOps team provides the best possible experience for our players. We dive deep into the data to understand player needs, minimize disruption, and manage Epic's incident response process.
What You'll Do

You will be the voice of the customer in a wide variety of contexts across Epic's business. You will dive deep into incidents to make sure we are providing our players the best possible experience, we hold a relentlessly high bar for Epic's service quality, we focus the attention of our business and tech teams on the right priorities, and operate Epic's incident management process. When other mechanisms fail, LiveOps is the backstop that ensures that Epic operates in the best interest of our players' experience
In this role, you will
  • Respond to alerts and manage issues in the production environment
  • Our Site Reliability Engineers manage the development and operation of our Incident Management Tooling, ensuring robust tooling support for the Incident process
  • Produce specifications and determine the operational feasibility of our tooling
  • Develop quality standards, documentation and testing for our tools codebase Maintain, improve, troubleshoot, debug and update our codebases
  • Develop automated tooling features to drive incident management improvements and reduce the operational cost of the Incident process
  • Work across the stack: Backend, frontend, infrastructure, operation to test, deploy, and iterate based on stakeholder feedback
What we're looking for
  • You thrive on ambiguity. You can understand a diverse set of product features and both identify how an issue impacts a single customer, and can quantify the business impact. You're capable of identifying larger trends surrounding disparate issues and enable product teams to solve the real underlying issues
  • You have a strong technical basis and know how to learn new things. Strong analysis and problem solving skills are essential to do this role successfully (we live in Grafana, Tableau, and similar tools)
  • You are a problem solver with experience with AWS and other cloud infrastructure tools will make you comfortable in this role and the ability to script and automate actions in languages like Python, Ruby, or Go is a bonus
  • You have experience working cross-functionally or across a large number of teams in multiple organizations
  • You have extensive experience working with and building reliable services on AWS or other major cloud infrastructure providers
  • You have a passion for the reliability engineering space
EPIC JOB + EPIC BENEFITS = EPIC LIFE

Our intent is to cover all things that are medically necessary and improve the quality of life. We pay 100% of the premiums for both you and your dependents. Our coverage includes Medical, Dental, a Vision HRA, Long Term Disability, Life Insurance & a 401k with competitive match. We also offer a robust mental well-being program through Modern Health, which provides free therapy and coaching for employees & dependents. Throughout the year we celebrate our employees with events and company-wide paid breaks. We offer unlimited PTO and sick time and recognize individuals for 7 years of employment with a paid sabbatical.

ABOUT US

Epic Games spans across 19 countries with 55 studios and 4,500+ employees globally. For over 25 years, we've been making award-winning games and engine technology that empowers others to make visually stunning games and 3D content that bring environments to life like never before. Epic's award-winning Unreal Engine technology not only provides game developers the ability to build high-fidelity, interactive experiences for PC, console, mobile, and VR, it is also a tool being embraced by content creators across a variety of industries such as media and entertainment, automotive, and architectural design. As we continue to build our Engine technology and develop remarkable games, we strive to build teams of world-class talent.
Like what you hear? Come be a part of something Epic!

Epic Games deeply values diverse teams and an inclusive work culture, and we are proud to be an Equal Opportunity employer. Learn more about our Equal Employment Opportunity (EEO) Policy here .

Note to Recruitment Agencies: Epic does not accept any unsolicited resumes or approaches from any unauthorized third party (including recruitment or placement agencies) (i.e., a third party with whom we do not have a negotiated and validly executed agreement). We will not pay any fees to any unauthorized third party. Further details on these matters can be found here .