Data Engineer

Hybrid in Baltimore, MD, US • Posted 30+ days ago • Updated 18 hours ago

Contract Independent

No Travel Required

Hybrid

Depends on Experience

Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

Java
SQL
Python
Apache Spark
PySpark
Amazon Redshift
Amazon S3
Amazon Web Services
AWS Glue
Delta Lake
Data Integration
Continuous Integration
Continuous Delivery
EDI X12
EDI HL7
FHIR
HIPAA
HL7
ASC X12
Terraform
CloudFormation
CMS
Scala

Summary

Project Overview:

Responsible for designing, building, and maintaining data pipelines and infrastructure to support data-driven decisions and analytics. The individual is responsible for the following tasks:

· Design, develop and maintain data pipelines, and extract, transform, load (ETL) processes to collect, process and store structured and unstructured data

· Build data architecture and storage solutions, including data lakehouses, data lakes, data warehouse, and data marts to support analytics and reporting

· Develop data reliability, efficiency, and qualify checks and processes

· Prepare data for data modeling

· Monitor and optimize data architecture and data processing systems

· Collaboration with multiple teams to understand requirements and objectives

· Administer testing and troubleshooting related to performance, reliability, and scalability

· Create and update documentation

Key Responsibilities

Hands-On Data Pipeline Development

· Design, code, and deploy ETL/ELT pipelines across bronze, silver, and gold layers of the Data Lakehouse.

· Build ingestion pipelines for structured (SQL), semi-structured (JSON, XML), and unstructured data using PySpark/Python programming language using AWS Glue or EMR.

· Implement incremental loads, deduplication, error handling, and data validation.

· Actively troubleshoot, debug, and optimize pipelines for scalability and cost efficiency.

EDW & Data Lake Implementation

· Develop dimensional data models (Star Schema, Snowflake Schema) for analytics and reporting.

· Build and maintain tables in Iceberg, Delta Lake, or equivalent OTF formats.

· Optimize partitioning, indexing, and metadata for fast query performance.

Healthcare Data Integration

· Build ingestion and transformation pipelines for EDI X12 transactions (837, 835, 278, etc.).

· Implement mapping and transformation of EDI data with FHIR and HL7 frameworks.

· Work hands-on with AWS Health Lake (or equivalent) to store and query healthcare data.

Data Quality, Security & Compliance

· Develop automated validation scripts to enforce data quality and integrity.

· Implement IAM roles, encryption, and auditing to meet HIPAA and CMS compliance standards.

· Maintain lineage and governance documentation for all pipelines.

Collaboration & Delivery

· Work closely with the Lead Data Engineer, analysts, and data scientists to deliver pipelines that support enterprise-wide analytics.

· Actively contribute to CI/CD pipelines, Infrastructure-as-Code (IaC), and automation.

· Continuously improve pipelines and adopt new technologies where appropriate.

Required Skills & Qualifications

The candidate should have experience as a data engineer or in a similar role, with a strong understanding of data architecture and ETL processes. The candidate should be proficient in programming languages for data processing and knowledgeable about distributed computing and parallel processing.

· 3+ years of hands-on experience in building, deploying, and maintaining data pipelines on AWS or equivalent cloud platforms.

· Strong coding skills in Python and SQL (Scala or Java a plus).

· Proven experience with Apache Spark (PySpark) for large-scale processing.

· Hands-on experience with AWS Glue, S3, Redshift, Athena, EMR, Lake Formation.

· Strong debugging and performance optimization skills in distributed systems.

· Hands-on experience with Iceberg, Delta Lake, or other OTF table formats.

· Experience with Airflow or other pipeline orchestration frameworks.

· Practical experience in CI/CD and Infrastructure-as-Code (Terraform, CloudFormation).

· Practical experience with EDI X12, HL7, or FHIR data formats.

· Strong understanding of Medallion Architecture for data lake houses.

· Hands-on experience building dimensional models and data warehouses.

· Working knowledge of HIPAA and CMS interoperability requirements.

Education:

This position requires a bachelor’s or master’s degree from an accredited college or university with a major in computer science, statistics, mathematics, economics, or a related field. Three (3) years of equivalent experience in a related field may be substituted for the Bachelor’s degree.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10432825
Position Id: 8878798
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Data Engineer

Bethesda, Maryland

•

Today

M9 Solutions is dedicated to providing IT services and solutions to the Federal Government by mobilizing the right people, skills, clearance levels, and technologies to help organizations who desire improved performance and modern, sustainable change. M9 has provided quality IT services and support to more than 30 Federal Agencies and multiple commercial customers nationwide. Our capabilities include digital transformation, software development, cloud migration, applications & infrastructure, cy

Full-time

USD 60,000.00 per year

Data Engineer

Washington, District of Columbia

•

Today

Job Number: R0235328 Data Engineer The Opportunity: Ever-expanding technology like IoT, machine learning, and artificial intelligence means that there's more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it's gathered from disparate sources. We need an experienced data engineer like you to help our clients find answers in their data to impact important missions-from fraud detection to cancer re

Full-time

USD 99,000.00 - 225,000.00 per year

Data Engineer

Alexandria, Virginia

•

Today

Job Number: R0235693 Data Engineer The Opportunity Do you want to work at the forefront of advanced technology and solve complex data challenges? You know that data yields pivotal insights when it's gathered from disparate sources and organized. As a data engineer, you have the chance to develop and deploy the pipelines and platforms that make this data meaningful. What's more, you'll have the chance to grow Booz Allen's DataOps capab ilities while working with a multi-disciplinary team of anal

Full-time

USD 62,000.00 - 141,000.00 per year

Data Engineer

Reston, Virginia

•

Today

Job Number: R0236455 Data Engineer The Opportunity: Ever-expanding technology like IoT, machine learning, and artifi cia l intelligence means that there's more structured and unstructured data available today than ever before. As a data engineer, you know that organizing data can yield pivotal insights when it's gathered from disparate sources. We need an experienced data engineer like you to help our clients find answers in their data to impact important missions from fraud detection to cancer

Full-time

USD 77,600.00 - 176,000.00 per year

Search all similar jobs