POSITION OVERVIEW:
seeking a world-class, high-caliber ClickHouse Technical Director to serve as the principal strategic advisor and ultimate technical escalation point for one of the world''s largest ClickHouse footprints—spanning over 500,000 pods across the Walmart ecosystem.
This individual-contributor role will focus explicitly on the mission-critical Network Observability and system telemetry environments where data is ingested continuously on a minute-by-minute basis. You will partner directly with 10 designated Walmart technical contacts to guide proactive architecture, resolve complex performance bottlenecks, and design robust lifecycle upgrade frameworks that support Walmart’s expanding global footprint.
CORE RESPONSIBILITIES:
1. Strategic Technical Advisory & Architecture
- Conduct regular strategic technical synchronization meetings (bi-weekly/monthly) with Walmart engineering leaders to address roadmap milestones and architectural friction points.
- Author and validate schema designs, hardware sizing profiles, and engine selections (e.g., ReplicatedMergeTree) to match expanding global scaling goals.
2. Advanced Performance Observability & Tuning
- Deep-dive into core ClickHouse system tables (system.query_log, system.part_log, system.metrics) to map, diagnose, and alleviate underlying performance blocks.
- Refactor long-running queries and configure custom resource utilization and compression levels across CPU/Memory/IO infrastructure.
3. Enterprise Lifecycle & Upgrade Management
- Perform architectural evaluations on new ClickHouse releases to measure compatibility and performance gains.
- Design step-by-step upgrade execution roadmaps complete with staging validations and zero-downtime rollback strategies.
4. Tier-4 Production Incident Escalation
- Act as the absolute backstop for unlimited production incidents when internal teams exhaust independent resolution workflows.
REQUIRED SKILLS & QUALIFICATIONS:
- 12+ years of enterprise Data Engineering experience, with 5+ years focusing entirely on large-scale ClickHouse systems.
- Elite competency in deep ClickHouse internals (MergeTree variants, parts lifecycle, ZooKeeper/Keeper mechanics).
- Background architecting high-volume telemetry (metrics, logs, traces) processing systems.
- Absolute location compliance: Must reside and be legally authorized to work in the USA.
Job Responsibilities
1. Strategic Technical Advisory & Architecture
- Conduct regular strategic technical synchronization meetings (bi-weekly/monthly) with Walmart engineering leaders to address roadmap milestones and architectural friction points.
- Author and validate schema designs, hardware sizing profiles, and engine selections (e.g., ReplicatedMergeTree) to match expanding global scaling goals.
2. Advanced Performance Observability & Tuning
- Deep-dive into core ClickHouse system tables (system.query_log, system.part_log, system.metrics) to map, diagnose, and alleviate underlying performance blocks.
- Refactor long-running queries and configure custom resource utilization and compression levels across CPU/Memory/IO infrastructure.
3. Enterprise Lifecycle & Upgrade Management
- Perform architectural evaluations on new ClickHouse releases to measure compatibility and performance gains.
- Design step-by-step upgrade execution roadmaps complete with staging validations and zero-downtime rollback strategies.
4. Tier-4 Production Incident Escalation
- Act as the absolute backstop for unlimited production incidents when internal teams exhaust independent resolution workflows.