Lead Databricks Engineer - Certified

Overview

Full Time
Part Time
Accepts corp to corp applications
Contract - W2
Contract - Independent

Skills

Extract
Transform
Load
ELT
Apache Spark
Python
SQL
PySpark
Streaming
Continuous Integration
Continuous Delivery
Version Control
Git
DevOps
Data Modeling
Data Warehouse
Performance Tuning
Unstructured Data
Workflow
Scalability
Collaboration
Databricks
SANS
Cloud Computing
Amazon Web Services
Microsoft Azure
Google Cloud
Google Cloud Platform

Job Details

Job Description:

  • 5+ Years of Design and implement ETL/ELT pipelines using Databricks and Apache Spark.
  • Strong proficiency in Python, SQL, and PySpark.
  • Knowledge of Delta Lake, data lakehouse concepts, and streaming data.
  • Familiarity with CI/CD pipelines, version control (Git), and DevOps practices.
  • Understanding of data modeling, data warehousing, and performance tuning.
  • Develop and maintain data lakehouse architectures for structured and unstructured data.
  • Optimize data workflows for performance, scalability, and cost efficiency.
  • Collaborate with data scientists, analysts, and business stakeholders to deliver high-quality data solutions.
  • Monitor and troubleshoot data pipelines, ensuring reliability and accuracy.
  • Integrate Databricks with cloud services (AWS, Azure, or Google Cloud Platform) and other enterprise systems.
  • Hands-on experience with cloud platforms (AWS, Azure, or Google Cloud Platform).

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.