Senior Data Engineer with Pyspark

Rocky Hill, CT, US • Posted 60+ days ago • Updated 11 hours ago

Full Time

On-site

Depends on Experience

Hexplora

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Summary

Job Title: Senior Data Engineer (PySpark, ETL SSIS) -W2 only
Location: Rocky Hill,CT

Job Description:

We are looking for an experienced and motivated Data Engineer with expertise in PySpark to join our dynamic team. As a key member of our data engineering team, you will play a crucial role in designing, building, and maintaining scalable data pipelines that enable efficient data processing and analytics within the healthcare domain.

This role will combine both development and administrative activities, making it essential that the candidate has experience not only in building robust data pipelines but also in overseeing their operational aspects to ensure performance, reliability, and optimization.

Key Responsibilities:

Design, develop, and maintain scalable ETL pipelines using PySpark to process large datasets.
Collaborate with cross-functional teams (data scientists, analysts, business stakeholders) to understand data requirements and deliver high-quality solutions.
Work on administrative tasks, including monitoring, troubleshooting, and optimizing data pipelines and infrastructure.
Manage data integration across healthcare systems, ensuring compliance with relevant standards.
Leverage SSIS for ETL development and ensure smooth data movement across different environments.
Integrate and transform data from multiple sources, ensuring data quality and consistency.
Handle and resolve data processing issues, ensuring minimal disruption to operations.
Document best practices, processes, and workflows to maintain pipeline efficiency and scalability.
Work with both relational and non-relational databases, ensuring smooth data flow and optimized performance.

Required Skills & Qualifications:

Strong experience in Data Engineering, with expertise in designing, building, and maintaining ETL pipelines.
Strong proficiency in PySpark for large-scale data processing and transformation.
Experience with ETL tools, particularly SSIS (SQL Server Integration Services).
Solid understanding of data modeling, relational databases, and data warehousing principles.
Experience working with cloud-based data storage and processing technologies (AWS, GCP, or Azure).
Familiarity with healthcare data standards, such as HL7 and FHIR, is highly desirable.
Proven ability in data pipeline monitoring, troubleshooting, and performance tuning.
Strong communication skills and the ability to work collaboratively with cross-functional teams.

Infowave Systems is an equal opportunity employer that is committed to diversity and inclusion in the workplace.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91015321
Position Id: 25-00569
Posted 30+ days ago

Company Info

About Hexplora

Our Purpose:

In today s rapidly evolving healthcare space, all healthcare organizations large and small are looking to leverage data to obtain actionable information that can be used to improve the health and wellness of patients in the most cost effective manner. Hexplora offers a comprehensive, end-to-end data warehousing and business intelligence solution for Health Plans, Accountable Care Organizations (ACOs), Independent Physicians Associations (IPAs), self-insured employers, and institutional providers to address all of their reporting and analytics requirements, allowing them to easily decipher mass information through a single source and apply it to cost savings and care management analysis.

Our Team:

Hexplora is led by a highly dynamic and motivated team of technology experts with deep healthcare expertise and experience in implementing industry leading solutions that enable ACOs, IPAs, and Payers to leverage Informatics as a strategic asset that can deliver market differentiating and cost saving capabilities.

Overview:

Through establishing Infowave Systems, an IT Services and Solutions company, our leadership has been instrumental in assisting Healthcare companies with designing, developing and deploying Enterprise Data Warehouses (EDW), Business Intelligence (BI), Analytics, Web Portals and Reporting Modules in various technologies for over fifteen years. With the experience and expertise gained over the years, Infowave Systems designed and developed a Business Analytics Solution referred to as Healthcare Analytical and Reporting Platform (HARP). In 2013, HARP solution was spun off as HEXPLORA to offer the solution under a SaaS model (Software as a Service).

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.