Databricks Data Engineer

Bedminster, NJ, US • Posted 60+ days ago • Updated 14 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

  • Data Lake
  • Data Processing
  • Workflow
  • Scalability
  • Data Quality
  • Documentation
  • Data-flow Diagrams
  • Management
  • Computer Science
  • Data Science
  • Software Engineering
  • Information Systems
  • Databricks
  • Data Structure
  • Data Storage
  • Change Data Capture
  • Python
  • Star Schema
  • Dimensional Modeling
  • ELT
  • Extract
  • Transform
  • Load
  • Writing
  • SQL
  • PL/SQL
  • Relational Databases
  • Oracle
  • NoSQL
  • Database
  • MongoDB
  • Cosmos-Db
  • Cloud Computing
  • Continuous Integration
  • Continuous Delivery
  • Git
  • Microsoft Azure
  • DevOps
  • Storage
  • Apache Parquet
  • Apache Avro

Summary

Databricks Data Engineer

Responsibilities:
  • Assist with leading the team's transition to the Databricks platform and utilize the newer features of Delta Live Tables, Workflows etc.
  • Design and develop data pipelines that extract data from Oracle, load it into the data lake, transform it into the desired format, and load it into Databricks data lakehouse.
  • Optimize data pipelines and data processing workflows for performance, scalability, and efficiency.
  • Implement data quality checks and validations within data pipelines to ensure the accuracy, consistency, and completeness of data.
  • Help create and maintain documentation for data mappings, data definitions, architecture and data flow diagrams.
  • Build proof-of-concepts to determine viability of possible new processes and technologies.
  • Deploy and manage code in non-prod and prod environments.
  • Investigate and troubleshoot data related issues and fix or provide solutions to fix defects.
  • Identify and resolve performance bottlenecks, which could include suggesting ways to optimize and performance tune databases and queries to enhance query performance.

Requirements:
  • Bachelor's Degree in Computer Science, Data Science, Software Engineering, Information Systems, or related quantitative field.
  • 4+ years of experience working as a Data Engineer, ETL Engineer, Data/ETL Architect or similar roles.
  • Current/active Databricks Data Engineer/Analyst certification a BIG plus.
  • 3+ years working with Databricks with knowledge and expertise of data structures, data storage and change data capture gained from prior production implementations of data pipelines, optimizations, and best practices.
  • Solid continuous experience in Python.
  • 3+ years of experience in Kimball dimensional modeling (star-schema comprising of facts, type1 and type2 dimensions, aggregates, etc.) with solid understanding of ELT/ETL.
  • 3+ years of solid experience writing SQL and PL/SQL code.
  • 2+ years of experience with Airflow.
  • 3+ years of experience working with relational databases (Oracle preferred).
  • 2+ years of experience working with NoSQL databases: MongoDB, Cosmos DB, DocumentDB or similar.
  • 2+ years of cloud experience (Azure preferred).
  • Experience with CI/CD utilizing git/Azure DevOps.
  • Experience with storage formats including Parquet/Arrow/Avro.

#RecruitPS
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90764204
  • Position Id: 648
  • Posted 30+ days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Raritan, New Jersey

Today

Full-time

USD 132,000.00 - 150,000.00 per year

South Plainfield, New Jersey

7d ago

Easy Apply

Full-time, Third Party

Depends on Experience

New Jersey

13d ago

Easy Apply

Contract

$50 - $60

Woodbridge Township, New Jersey

Today

Easy Apply

Full-time

USD 58.00 - 61.00 per hour

Search all similar jobs