Job Title: Data Architect (Lakehouse)
San Jose California
12+ Months
About the Role : We are seeking an experienced Data Architect to serve as the technical authority on our data platform team.
Lead Data Engineer/Architect will be responsible for shaping the architecture and engineering standards of our enterprise-scale Lakehouse environment on Databricks, guiding a team of talented engineers, and driving measurable improvements in platform performance and cost efficiency.
Candidate Profile
- 8-10+ years of professional data engineering experience
- Strong foundation in Lakehouse architecture principles and best practices
- Deep expertise in Apache Spark and Delta Lake optimization techniques
- Demonstrated track record of shipping and operating production-grade data platforms
Key Responsibilities
- Serve as the primary technical authority for data platform decisions across the engineering organization
- Define and enforce data design patterns, partitioning strategies, and file management standards to ensure platform reliability and scalability
- Lead cost optimization initiatives and continuous performance tuning across Databricks workloads
- Establish and uphold the code quality bar through rigorous code reviews, standards documentation, and engineering best practices
- Mentor and develop team members, fostering a culture of technical excellence and continuous learning
Technical Skills & Tools
Platform: Databricks, Apache Spark, Delta Lake, Apache Hive
Languages: Python (PySpark), SQL, Scala
Architecture: Medallion (Bronze/Silver/Gold), Data Lakehouse, Lambda/Kappa patterns
DevOps: GitHub, CI/CD pipelines, Infrastructure-as-Code
Cloud: Azure, AWS, or Google Cloud Platform data services
Education: At least a bachelor s degree (or equivalent experience) in Computer Science, Software/Electronics Engineering, Information Systems, or a closely related field is required.