Innovative Information Technologies, Inc

Data Engineer at Dallas, TX & Middletown, NJ

Dallas, TX, US • Posted 4 hours ago • Updated 4 hours ago

Contract W2

On-site

Depends on Experience

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

API
Access Control
Apache Kafka
Apache Spark
Broadcasting
Change Data Capture
Cloud Computing
Clustering
Concurrent Computing
Continuous Delivery
Continuous Integration
Customer Focus
Data Modeling
Databricks
DevOps
ELT
Extract
Transform
Load
Git
GitHub
Grafana
JSON
Management
Microservices
Microsoft Azure
Optimization
Orchestration
Performance Tuning
PySpark
React.js
Real-time
Regulatory Compliance
SQL
Scala
Semantics
Soft Skills
Streaming
Unity
Workflow
struct

Summary

Fullstack Databricks Developer
Onsite Locations: Dallas TX, Middletown NJ
Long term

Skills & Qualifications
1.Technical Core (Databricks & Spark)
- Expert PySpark/Scala: Deep understanding of Spark internals, broadcast joins, and RDD/Dataframe partitioning.
Delta Lake Mastery: Proficiency in Delta features like Z-Ordering, Liquid Clustering, Change Data Feed (CDF), and Time Travel.
Streaming Patterns: Hands-on experience with Watermarking, Checkpoints, and handling late-arriving data in Structured Streaming.
2. Data Modeling & Languages
SQL: Expert-level SQL for complex transformations and window functions.
JSON/Semi-Structured Data: Mastery of parsing and generating complex nested JSON objects within Spark (e.g., struct, array, to_json, from_json).
Medallion Design: Proven experience moving data across Bronze, Silver, and Gold layers with clear "Data Contracts."
3. Full Stack & DevOps
CI/CD: Experience automating data pipeline deployments (Git-based workflows).
Observability: Ability to set up monitoring and alerts using Databricks SQL Alerts or Grafana to track pipeline lag.
4.Soft Skills
Architectural Thinking: Ability to decide when to use "Continuous" vs. "AvailableNow" streaming based on cost vs. latency requirements.
Client Focus: Understanding how an API client (e.g., a React app or a microservice) will consume the Gold layer JSON.

Job Title:
Data Engineer (Streaming & Full Stack Databricks)

Role Summary
We are seeking a high-performing Data Engineer to design and implement a real-time data platform using the Medallion Architecture.
You will be responsible for the end-to-end development of data pipelines from ingesting real-time source data into Bronze, transforming it into a relational silver layer, and finally delivering high-concurrency, consumption-ready JSON Gold tables.
You will act as a "Full Stack" data professional, handling everything from infrastructure automation (DataOps) to complex nested data modeling.

Key Responsibilities
Real-Time Ingestion: Build scalable ingestion pipelines using Auto Loader and Spark Structured Streaming to capture data from Kafka, Event Hubs, or CDC sources into raw Delta tables.
Relational Transformation: Develop ELT logic to cleanse, deduplicate, and normalize data into a relational format. Ensure ACID compliance and "exactly-once" processing semantics.
JSON API Optimization: Design and build the layer specifically for client consumption. This involves flattening/nesting data into optimized JSON structures within Delta tables to support low-latency API queries.
Advanced Orchestration: Implement and manage complex workflows using Delta Live Tables (DLT) or Standard Streaming Live tables and Databricks Workflows to ensure data freshness and lineage.
Governance & Security: Use Unity Catalog to enforce fine-grained access control (row/column level) and maintain a searchable data catalog for consuming clients.
DataOps & Automation: Own the deployment lifecycle using Databricks Asset Bundles (DABs) and CI/CD pipelines (GitHub Actions/Azure DevOps) to ensure reproducible environments.
Performance Tuning: Optimize streaming triggers, watermarking, and stateful processing to minimize latency and manage cloud costs effectively.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10113363
Position Id: 8923042
Posted 4 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Data Engineer (Ex- AT&T)

Dallas, Texas

•

Today

Job Title:- Data Engineer (Ex-AT&T) Location:- Dallas /Plano TX or Middletown NJ onsite Duration:- 6-12+ months+, possible extension Visa:- , GC- TN-Visa holders ) (need strong communication) MUST HAVE 10/10 COMMUNICATION Please send me genuine and excellent profiles. Must have Valid LinkedIn profile. Must have excellent comms. Candidates must also be local to the area Communication must be flawless Must have valid LinkedIn profile with profile pic and good number of connection and must be cre

Easy Apply

Contract, Third Party

Depends on Experience

Sr Data Engineer ONLY 15+ YEARS EXP

Dallas, Texas

•

Today

ONLY 15+ YEARS EXPERIENCE ONLY LOCAL TO NJ, DALLAS TX AND PLANO TX Job Description Job Title: Data Engineer (Streaming & Full Stack Databricks) Role Summary We are seeking a high-performing Data Engineer to design and implement a real-time data platform using the Medallion Architecture. You will be responsible for the end-to-end development of data pipelinesfrom ingesting real-time source data into Bronze, transforming it into a relational silver layer, and finally delivering high-concurrency,

Easy Apply

Contract, Third Party

70+

Azure Data Architect - Local to Dallas, TX

Dallas, Texas

•

8d ago

Job Role: Sr. Data Architect Azure Databricks Location: Dallas, TX (Work from Office 5 Days) Rate: USD $73-77/hr on c2c Duration: 4-6 Months No of position: 1 Exp. Start date: 4th week of March Year of Experience: 15-18 Years Type: Contract Experience: 15 18 years in Data & Analytics10+ years as ETL / Data Architect5 6 years as Azure Data Architect Key Responsibilities: Define target Azure data platform architecture (Lakehouse / Modern Data Warehouse) aligned with enterprise standards.Lead arch

Easy Apply

Contract, Third Party

$69 - $70

Sr. Data Architect – Azure Databricks - Onsite - Locals to TX

Dallas, Texas

•

2d ago

Title:Sr. Data Architect Azure Databricks - Onsite - Locals to TXLocation:Dallas,TXLength:Long termRestriction:w2 or c2c Description:WebCam Interview 6Months ProjectLinkedIn Must Onsite (5days onsite per week) at Dallas,TX Only LOCAL to Texas Area candidates Key Responsibilities:- Definetarget Azure data platform architecture(Lakehouse / Modern Data Warehouse) aligned with enterprise standards.Lead architectural decisions acrossdata ingestion, storage, processing, semantic/consumption, securi

Easy Apply

Contract

Depends on Experience

Search all similar jobs