Overview
On Site
Full Time
Skills
Root Cause Analysis
FOCUS
Agile
AIM
Distributed Computing
Systems Architecture
Scalability
Computer Science
Java
C++
C
Pick
Problem Solving
Conflict Resolution
Apache Lucene
Elasticsearch
Search Technologies
Law
Legal
Collaboration
Job Details
About the Role
At Uber, we provide a centralized, reliable, and interactive observability data platform that includes metrics, logging, and tracing. This platform empowers engineers with the tools needed for monitoring, troubleshooting, and performing root cause analysis at scale.
The Metrics team, a core component of this platform, is responsible for delivering a cutting-edge, end-to-end distributed metrics solution designed to operate at Uber's scale. Our system ingests over 5 billion metrics per second and handles over 25K queries per second, with cardinality up to 500K. As Uber continues to grow, our system evolves to meet increasing demands. Beyond solving the inherent challenges of large-scale distributed systems-such as scalability, availability, performance, and efficiency-we also focus on providing intelligent insights that identify issues before they impact our customers, ensuring a seamless experience for millions of users worldwide.
We are a dynamic, agile team with members from diverse backgrounds. We dream big, aim high, and execute with precision. Join us to tackle challenging large-scale distributed system problems and raise the bar of observability for Uber.
What the Candidate Will Do
1. Design system architecture, own key components to deliver a centralized metric system for Uber.
2. Join on call rotation, driving continuous improvements on system availability, scalability, performance and efficiency.
3. Collaborate with other infrastructure teams, production engineer team and product manager to drive adoption and best practices, and to design and implement high impact, cross-product features.
Basic Qualifications
1. BS or higher degree in Computer Science, or a related technical discipline, or equivalent experience.
2. Proficient in one or more backend languages, like Java, Go, C/C++, C, with the ability to pick up new ones quickly.
3. Strong problem solving skills, with relevant experience in designing and implementing large scale distributed backend services
Preferred Qualifications
1. Proven record of building and operating highly reliable distributed systems at scale.
2. Experience with OpenTelemetry, Prometheus, Influx and/or building and operating monitoring infrastructure at large scale.
3. Under the hood experience with Apache Lucene, ElasticSearch, OpenSearch and other Search technologies is a big plus.
4. Batch and stream data process pipeline experience is a plus.
Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuels progress. What moves us, moves the world - let's move it forward, together.
Uber is proud to be an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you have a disability or special need that requires accommodation, please let us know by completing [this form](;br>
Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.
At Uber, we provide a centralized, reliable, and interactive observability data platform that includes metrics, logging, and tracing. This platform empowers engineers with the tools needed for monitoring, troubleshooting, and performing root cause analysis at scale.
The Metrics team, a core component of this platform, is responsible for delivering a cutting-edge, end-to-end distributed metrics solution designed to operate at Uber's scale. Our system ingests over 5 billion metrics per second and handles over 25K queries per second, with cardinality up to 500K. As Uber continues to grow, our system evolves to meet increasing demands. Beyond solving the inherent challenges of large-scale distributed systems-such as scalability, availability, performance, and efficiency-we also focus on providing intelligent insights that identify issues before they impact our customers, ensuring a seamless experience for millions of users worldwide.
We are a dynamic, agile team with members from diverse backgrounds. We dream big, aim high, and execute with precision. Join us to tackle challenging large-scale distributed system problems and raise the bar of observability for Uber.
What the Candidate Will Do
1. Design system architecture, own key components to deliver a centralized metric system for Uber.
2. Join on call rotation, driving continuous improvements on system availability, scalability, performance and efficiency.
3. Collaborate with other infrastructure teams, production engineer team and product manager to drive adoption and best practices, and to design and implement high impact, cross-product features.
Basic Qualifications
1. BS or higher degree in Computer Science, or a related technical discipline, or equivalent experience.
2. Proficient in one or more backend languages, like Java, Go, C/C++, C, with the ability to pick up new ones quickly.
3. Strong problem solving skills, with relevant experience in designing and implementing large scale distributed backend services
Preferred Qualifications
1. Proven record of building and operating highly reliable distributed systems at scale.
2. Experience with OpenTelemetry, Prometheus, Influx and/or building and operating monitoring infrastructure at large scale.
3. Under the hood experience with Apache Lucene, ElasticSearch, OpenSearch and other Search technologies is a big plus.
4. Batch and stream data process pipeline experience is a plus.
Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuels progress. What moves us, moves the world - let's move it forward, together.
Uber is proud to be an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you have a disability or special need that requires accommodation, please let us know by completing [this form](;br>
Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.