Databricks Engineer (Only G.C / U.S.C)
6+Months
Cincinnati, OH 45202 (Onsite 5 days a week)
**Position Overview:**
We are seeking a talented Data Engineering Specialist with expertise in Microsoft Azure Databricks to join our data solutions team. In this role, you will develop robust, scalable data integration pipelines and modern Lakehouse architectures leveraging Databricks on Azure. You will collaborate closely with business partners and data professionals to deliver high-quality datasets and analytic solutions, ensuring data reliability, scalability, and security in accordance with best practices like the Medallion architecture and Data Mesh methodologies.
**Core Responsibilities:**
- Develop, optimize, and maintain large-scale data pipelines and ETL/ELT workflows utilizing Databricks, Spark, and Python.
- Partner with cross-functional teams to gather requirements and translate them into curated, production-grade data assets.
- Implement design patterns for efficient data management; ensure clean separation of raw, curated, and refined layers (Bronze, Silver, Gold).
- Monitor and optimize existing data pipelines for performance, cost, and reliability.
- Implement and maintain cloud-based data solutions on Azure, with experience in Google Cloud Platform or AWS being advantageous.
- Apply data modeling principles, including both structured (SQL) and unstructured (NoSQL) databases.
- Support data governance by enforcing data quality checks, lineage, and best-in-class security protocols.
- Utilize orchestration tools (such as Airflow, Prefect, or Azure Data Factory) for workflow management.
- Create and maintain documentation for processes, data systems, and architecture.
**Essential Qualifications:**
- Demonstrated proficiency building and managing data pipelines using Databricks, Spark, SQL, and Python.
- Significant experience with Azure cloud services; exposure to other cloud platforms (Google Cloud Platform, AWS) is a plus.
- Solid understanding of data architecture, data modeling, and database administration (relational and NoSQL technologies).
- Knowledge of modern data management frameworks (Medallion, Data Mesh, etc.).
- Familiarity with data security, privacy, and compliance standards.
- Skilled in workflow orchestration (e.g., Airflow, Azure Data Factory, Luigi).
- Strong analytical, collaborative, and communication abilities.
**Preferred Experience:**
- Working knowledge of leading cloud data warehousing platforms (e.g., Snowflake, Redshift, BigQuery).
- Experience building dashboards or BI solutions with tools such as Power BI or Tableau.
- Background in system performance tuning and troubleshooting distributed data environments.
- Exposure to ML Ops or data science environments on Databricks or similar platforms.
- Familiarity with real-time data processing (Kafka, Flink, etc.).
- Industry experience in regulated sectors such as financial services or healthcare is advantageous.
- Programming experience with Go or other modern languages is a bonus.