Senior Site Reliability Engineer- Cassandra DB

    • Grubhub
  • Posted 60+ days ago | Updated 4 hours ago


Full Time


IBM Rational DOORS
Apache Kafka
Software engineering
Incident management
Backup administration
Cloud computing
Apache Cassandra
Amazon Web Services
Amazon EC2
Amazon S3
Amazon SQS

Job Details

About The Opportunity
We're all about connecting hungry diners with our network of over 300,000 restaurants nationwide. Innovative technology, user-friendly platforms and streamlined delivery capabilities set us apart and make us an industry leader in the world of online food ordering. When you join our team, you become part of a community that works together to innovate, solve problems, grow, work hard and have a ton of fun in the process!

Why Work For Us

Grubhub is a place where authentically fun culture meets innovation and teamwork. We believe in empowering people and opening doors for new opportunities. If you're looking for a place that values strong relationships, embraces diverse ideas-all while having fun together-Grubhub is the place for you!

We are looking for a Senior Site Reliability Engineer to join our Database Engineering organization. At Grubhub, the Database Engineering organization owns the top-level reliability, observability, and availability of the Datastore platforms, including but not limited to Cassandra, ElasticSearch and Kafka. This team contributes to projects, services, designs, and processes with the aim to steward good architecture and provide tools and services to enable software engineering teams to measure and meet reliability agreements.

The Impact You Will Make
  • Manage large critical Cassandra and Elasticsearch clusters supporting millions of transactions per day
  • Build systems to automate all build and maintenance tasks using Ansible and python
  • Develop self-service tools to allow engineers to manage and provision resources with GrubHub best practices
  • Monitor cluster availability, read/ write latencies, and other important performance metrics to proactively identify SLO misses and help mitigate issues
  • Evaluate new technologies and software versions. Test and develop roadmaps
  • Tune Cassandra and ES databases for optimizing throughput and read /write latencies
  • 24X7 on-call rotation support with rest of team for rapid incident response
  • Implement DR strategies, including backups and recovery techniques with minimal downtime.
  • Work with other engineers to manage our data persistence integration and performance with the Grubhub platform.
  • Monitor and scale Elasticsearch/Cassandra clusters to handle growth in traffic

What You Bring To The Table
  • Experience developing backend applications in Python or Java
  • Experience managing, working or developing large Elasticsearch clusters in highly available 24x7 production environments
  • Experience automating the maintenance of infrastructure using Python and Ansible or similar tools.
  • Experience managing automated cloud infrastructures on AWS or other major cloud providers.
  • Experience managing large Cassandra clusters in production is a strong plus.
  • Experience working with docker is a plus
  • Ability to quickly learn new concepts and technologies and adapt to changing needs

About Our Tech
  • Most of our internal tooling is written in Python.
  • Most of our microservices are written in Java
  • Observability tools we use: Datadog, Splunk, Lightstep.
  • Our primary persistence store is Cassandra
  • We operate in 3 Amazon regions (hot+hot+hot)
  • We primarily rely on AWS and its services: EC2, S3, SNS/SQS, ElastiCache, Lambda, etc.

And Of Course, Perks!

  • Flexible PTO. Grubhub employees enjoy a generous amount of time to recharge.
  • Health and Wellness. Excellent medical, dental and vision benefits, 401k matching, employee network groups and paid parental leave are just a few of our programs to support your overall well-being.
  • Compensation. You'll receive a highly-competitive compensation package with eligibility for generous incentives, bonuses, commission, and RSUs.
  • Free Meals. Our employees get a weekly Grubhub credit to enjoy and support local restaurants.
  • Social Impact. We believe in giving back through programs like the Grubhub Community Relief Fund, and provide our employees opportunities to support causes that are important to them.

Grubhub is an equal opportunity employer. We welcome diversity and encourage a workplace that is just as diverse as the customers we serve. We evaluate qualified applicants without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, and other legally protected characteristics. If you're applying for a job in the U.S. and need a reasonable accommodation for any part of the employment process, please send an email to and let us know the nature of your request and contact information. Please note that only those inquiries concerning a request for reasonable accommodation will be responded to from this email address.

If you are a resident of the State of California and would like a copy of our CA privacy notice, please email
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.