Site Reliability Engineer, Fleet Automation

Overview

Remote
USD 147,300.00 - 199,300.00 per year
Full Time

Skills

Testing
Storage
MAGIC
Blogging
Meta-data Management
Management
Computer Networking
Provisioning
Operating Systems
Dropbox
Computer Science
Physics
Mathematics
Software Design
Fluency
Computer Hardware
Linux
Debugging
DHCP
Dragon NaturallySpeaking
DNS
NTP
PXE
Configuration Management
Ansible
Progress Chef
Puppet
Cloud Computing
Amazon Web Services
Google Cloud
Google Cloud Platform
Microsoft Azure

Job Details

Role Description

Site Reliability Engineers on the Fleet Automation team are mission-critical for Dropbox success. The SRE team has major impact inside of Dropbox engineering from testing our disaster readiness and building our in-house multi-exabyte storage system, Magic Pocket. Check out the Dropbox Tech Blog to learn more!

The Site Reliability Team consists of hybrid systems and software engineers who are responsible and take ownership for management of large scale infrastructure while improving reliability and automation. SREs are integrated within the Platform team, and we're looking for engineers who want to be a part of developing infrastructure software, maintaining it, and scaling it. You will join a small, impactful team within Dropbox that significantly influences the world.

Our Engineering Career Framework is viewable by anyone outside the company and describes what's expected for our engineers at each of our career levels. Check out our blog post on this topic and more here.

Responsibilities
  • Build scalable infrastructure to manage metadata for hundreds of billions of files, hundreds of petabytes of user data, and millions of concurrent connections
  • Design the systems and processes that Dropbox engineers use to manage and deploy their software into production
  • Automate the server provisioning process to reduce the labor of our networking engineering and datacenter operations teams. Once we plug a new server in, it walks itself through all aspects of provisioning to join the fleet without any human involvement
  • Own foundational services that serve as a core component of the fleet, such as DHCP, DNS, NTP, PXE
  • Build, test and keep the fleet up to date with the latest Operating System and Kernel
  • Own services that monitor the health of our fleet and host remediations

Many teams at Dropbox run Services with on-call rotations, which entails being available for calls during both core and non-core business hours. If a team has an on-call rotation, all engineers on the team are expected to participate in the rotation as part of their employment. Applicants are encouraged to ask for more details of the rotations to which the applicant is applying.
Requirements
  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience
  • 2+ years of industry experience
  • Demonstrated proficiency in software design and systems fluency, including understanding OS, networks, or hardware to debug system issues and identify bottlenecks
  • Experience with using monitoring tools to maintain reliability of production services
  • Experience working with Linux in a production environment
  • Ability to diagnose technical problems, debug code, and automate routine tasks
  • Demonstrated skill developing distributed systems
  • Familiarity with fundamental services, such as DHCP, DNS, NTP, or PXE
  • Familiarity with config management tools, such as Ansible, Chef, or Puppet
Preferred Qualifications
  • Experience with cloud compute services, such as AWS, Google Cloud, or Microsoft Azure
  • Enthusiastic about new initiatives, contributing to new ideas and approaches, experimenting with them, and sharing learned outcomes.
Compensation

US Zone 1

This role is not available in Zone 1

US Zone 2

$147,300-$199,300 USD

US Zone 3

$130,900-$177,100 USD
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.