Hybrid in New York, New York
•
Today
Designing Databricksbased lakehouse architectures on AWS (Delta Lake + S3 + Unity Catalog). Clear separation of compute vs. serving layers in distributed architectures. Low-latency API strategy where Spark is insufficient (e.g., leveraging optimized services or caching). Caching strategies to accelerate reads and reduce compute cost. Data partitioning, file size tuning, and optimization strategies for large-scale pipelines. Experience handling multi-terabyte structured timeseries workloads. Abil
Easy Apply
Full-time
Depends on Experience

















