Senior Cloud Performance Engineer, Infrastructure

  • San Francisco, CA
  • Posted 15 hours ago | Updated 3 hours ago

Overview

On Site
USD 180,000.00 - 200,000.00 per year
Full Time

Skills

Performance Analysis
Open Source
Benchmarking
Orchestration
Data Collection
Reporting
Cloud Computing
SAN
Artificial Intelligence
CHAOS
Firmware
Linux Kernel
Build Automation
Roadmaps
Systems Architecture
Mentorship
ICS
Leadership
Software Engineering
Performance Engineering
Python
Java
C
C++
Automated Testing
Linux
CPU
Scheduling
Docker
Kubernetes
Communication
Writing
PyTorch
JAX
Microservices
Debugging
Performance Tuning
Database
Streaming
Machine Learning (ML)
GPU
ARM
Continuous Integration
Continuous Delivery
Regression Analysis
System Imaging
Provisioning
Computer Hardware
Computer Networking
MySQL
UI
React.js
Tableau
Dashboard
Law
Legal
Collaboration

Job Details

About the Role

We are seeking a highly skilled and motivated Senior Cloud Performance Engineer to join Uber's Fleet Engineering organization. This role will lead the design, build, and operation of our next-gen cloud platform qualification and performance engineering. You'll drive end-to-end performance evaluations, build automation and tooling that accelerates decisions, and partner across infra, SRE, internal service and product teams, and cloud/xPU vendors to deliver measurable improvements at Uber scale.

You'll work across multi-cloud, containerized, virtualized, and bare-metal systems to execute qualifications and in-depth performance analyses-tuning and improving observability across the stack, from hardware and Linux kernel runtimes to distributed application services. Beyond running benchmarks, you'll dive deep to debug performance anomalies, extend existing benchmarks, and develop new workloads that better capture Uber's production patterns. This role offers close collaboration with engineers across the stack, cloud partners, silicon vendors, and open-source communities to shape the next generation of Uber's infrastructure. It's ideal for someone with a hardware-software co-design background who thrives on uncovering performance insights and building systems that make those insights actionable.

What the Candidate Will Do

1. Design and lead the architecture and development of Uber's benchmarking and qualification platform, automating test orchestration, data collection, analysis, and reporting across multi-cloud and on-prem environments.
2. Develop and extend workloads and benchmarks (compute, storage, network, ML/AI) and integrate stress, chaos, and regression tests to validate hardware and platform choices.
3. Analyze and optimize end-to-end performance across hardware, firmware, Linux kernel, runtimes, and distributed services using advanced profiling tools (perf, eBPF, flamegraphs, tracing frameworks).
4. Build automation and observability tooling (Go/Python/Java, Kubernetes/Docker) for CI/CD-based performance regression detection, telemetry, alerting, and anomaly detection.
5. Collaborate with hyperscalers and silicon partners (Arm, GPU, and accelerator vendors) to evaluate emerging instances, kernels, and infrastructure technologies, and translate findings into roadmap recommendations.
6. Influence system architecture and tooling decisions that improve how Uber builds, monitors, and scales its infrastructure.
7. Drive execution and quality, writing design docs, setting milestones, mentoring ICs, and communicating insights and results to stakeholders and leadership.

Basic Qualifications

1. 5+ years software engineering or systems/performance engineering experience (BS in CS/EE or related), with demonstrated end-to-end ownership of complex projects.
2. Proficient in two or more: Go, Python, Java, C/C++; strong CS fundamentals and testing/automation discipline.
3. Hands-on with Linux internals (CPU scheduling, memory, I/O, networking) and perf tooling (perf, eBPF, flamegraphs, tracing frameworks).
4. Experience with Docker/Kubernetes, microservices, and distributed systems; comfort building production services and pipelines.
5. Proven track record of clear communication, writing design docs/postmortems, and leading cross-functional efforts.

Preferred Qualifications

1. Experience tuning databases, stream processing, batch or ML platforms (e.g. PyTorch, JAX).
2. Familiarity with microservices debugging and distributed tracing (OpenTelemetry, Jaeger).
3. Performance tuning for databases/streaming/batch/ML platforms; GPU/xPU or Arm performance exposure.
4. Experience building observability (OpenTelemetry/Jaeger), CI/CD perf gates, and regression detectors.
5. Large-scale fleet know-how: OS imaging/provisioning, config rollout, hardware health monitoring, and DC networking fundamentals.
6. Full-stack bonus: backend (e.g., MySQL) and light UI work (React/Tableau) for results dashboards

For San Francisco, CA-based roles: The base salary range for this role is USD$180,000 per year - USD$200,000 per year.

For Sunnyvale, CA-based roles: The base salary range for this role is USD$180,000 per year - USD$200,000 per year.

For all US locations, you will be eligible to participate in Uber's bonus program, and may be offered an equity award & other types of comp. You will also be eligible for various benefits. More details can be found at the following link [](;br>
Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuels progress. What moves us, moves the world - let's move it forward, together.

Uber is proud to be an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you have a disability or special need that requires accommodation, please let us know by completing [this form](;br>
Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.