Data Engineer

New York, NY, US • Posted 9 hours ago • Updated 9 hours ago
Contract Independent
Contract W2
Contract Corp To Corp
On-site
$60 - $70/hr
Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

  • Databricks
  • aws
  • devops
  • python
  • sql

Summary

Job Title: Databricks Data Engineer with DevOps Skills

Location : Los Angeles CA or NYC (Hybrid)

Hire type : CTH

Job Summary

We are looking for an experienced Databricks Data Engineer with strong DevOps expertise to join our data engineering team. The ideal candidate will design, build, and optimize large-scale pipelines on the Databricks Lakehouse Platform on AWS, while driving automated CI/CD and deployment practices. This role requires strong skills in PySpark, SQL, AWS cloud services, and modern DevOps tooling. You will collaborate closely with cross-functional teams to deliver scalable, secure, and high-performance data solutions.


Must Demonstrate (Critical Skills & Architectural Competencies)

  • Designing and implementing Databricks-based Lakehouse architectures on AWS
  • Clear separation of compute vs. serving layers
  • Ability to design low-latency data/API access strategies (beyond Spark-only patterns)
  • Strong understanding of caching strategies for performance and cost optimization
  • Data partitioning, storage optimization, and file layout strategy
  • Ability to handle multi-terabyte structured or time-series datasets
  • Skill in requirement probing, identifying what matters architecturally
  • A player-coach mindset: hands-on engineering + technical leadership

Key Responsibilities

1. Data Pipeline Development

  • Design, build, and maintain scalable ETL/ELT pipelines using Databricks on AWS.
  • Develop high-performance data processing workflows using PySpark/Spark and SQL.
  • Integrate data from Amazon S3, relational databases, and semi/nonstructured sources.
  • Implement Delta Lake best practices including schema evolution, ACID, OPTIMIZE, ZORDER, partitioning, and file-size tuning.
  • Ensure architectures support high-volume, multi-terabyte workloads.

2. DevOps & CI/CD

  • Implement CI/CD pipelines for Databricks using Git, GitLab, GitHub Actions, or AWS-native tools.
  • Build and manage automated deployments using Databricks Asset Bundles.
  • Manage version control for notebooks, workflows, libraries, and environment configuration.
  • Automate cluster policies, job creation, environment provisioning, and configuration management.
  • Support infrastructure-as-code via Terraform (preferred) or CloudFormation.

3. Collaboration & Business Support

  • Work with data analysts and BI teams to prepare curated datasets for reporting and analytics.
  • Collaborate closely with product owners, engineering teams, and business partners to translate requirements into scalable implementations.
  • Document data flows, technical architecture, and DevOps/deployment workflows.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10114281
  • Position Id: 8935092
  • Posted 9 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

15d ago

Easy Apply

Contract

$80 - $100

New York, New York

8d ago

Easy Apply

Contract

Depends on Experience

Hybrid in New York, New York

Today

Easy Apply

Contract

90 - 93

Hybrid in New York, New York

8d ago

Easy Apply

Contract

65 - 70

Search all similar jobs