Apply Now

Data engineer

Jersey City, NJ, US • Posted 1 day ago • Updated 1 day ago

Full Time

No Travel Required

On-site

$50 - $60/hr

Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

pyspark
databrick

Summary

Data Engineer (PySpark & Databricks)

Location :Jersycity NJ

Only on W2 payroll

10+ years Experience.

Job Summary

We are looking for a skilled Data Engineer with strong expertise in PySpark and Databricks to design, build, and optimize scalable data pipelines and cloud-based data platforms. The ideal candidate will work closely with data analysts, data scientists, and business stakeholders to deliver reliable and high-performance data solutions.

Key Responsibilities

Design, develop, and maintain scalable ETL/ELT pipelines using PySpark.
Build and manage data workflows on the Databricks platform.
Process large-scale structured and unstructured datasets efficiently.
Optimize Spark jobs for performance, scalability, and cost.
Integrate data from multiple sources including APIs, databases, and cloud storage.
Develop and maintain data lakes and data warehouse solutions.
Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.
Implement data quality, validation, monitoring, and governance processes.
Troubleshoot production issues and provide performance tuning recommendations.
Create technical documentation and maintain coding standards and best practices.

Required Skills & Qualifications

Bachelor’s degree in Computer Science, Information Technology, Engineering, or related field.
3+ years of experience in Data Engineering or Big Data development.
Strong hands-on experience with:
- PySpark
- Databricks
- Apache Spark
- SQL
Experience with cloud platforms such as:
- AWS
- Azure
- Google Cloud Platform
Knowledge of data pipeline orchestration tools like:
- Airflow
- Azure Data Factory
- Jenkins
Experience with Delta Lake, Data Lake architecture, and distributed computing.
Strong understanding of ETL concepts and data modeling.
Familiarity with version control tools such as Git.
Good problem-solving and communication skills.

Preferred Qualifications

Databricks certification is a plus.
Experience with streaming technologies such as Kafka or Spark Streaming.
Knowledge of CI/CD pipelines and DevOps practices.
Exposure to machine learning pipelines is an advantage.

Technical Stack

Programming: Python, SQL
Big Data: PySpark, Apache Spark
Platform: Databricks
Cloud: AWS/Azure/Google Cloud Platform
Databases: Snowflake, Redshift, PostgreSQL, MySQL
Tools: Airflow, Git, Jenkins

Soft Skills

Strong analytical and troubleshooting abilities
Ability to work independently and collaboratively
Good communication and stakeholder management
Attention to detail and quality-focused mindset

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91139017
Position Id: 8972191
Posted 1 day ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Senior Data Engineer Python & PySpark

Jersey City, New Jersey

•

Today

Senior Data Engineer - Python & PySpark Job Summary We are seeking an experienced Senior Data Engineer with strong expertise in Python, PySpark, SQL, and Big Data technologies. The ideal candidate will be responsible for designing, developing, and optimizing scalable data pipelines and ETL/ELT workflows for processing large volumes of structured and unstructured data. The role requires hands-on experience with distributed data processing, cloud platforms, orchestration tools, and performance op

Easy Apply

Full-time, Part-time, Contract, Third Party

Data Engineer

New York, New York

•

Today

Description Your role at GEI. The Data Engineer is responsible for designing, building, and maintaining the data pipelines and integrations that power GEI's AI solutions and digital initiatives. This role focuses on ensuring enterprise data is accessible, reliable, and governed so that AI capabilities can be deployed and scaled with confidence. The Data Engineer plays a hands-on role by preparing and integrating the data foundations that AI solutions depend on. This includes building ingestion p

Full-time

USD 90,000.00 - 150,000.00 per year

Data Engineer

New York, New York

•

Today

THE POSITIONOur roster has an opening with your name on it We are looking for a Data Engineer to join our growing data platform team and take end-to-end ownership of designing, building, and scaling the foundational data infrastructure that powers analytics, machine learning, and business decision-making across the company. In this role, you will independently drive the design and delivery of reliable, secure, and cost-efficient data platforms, enabling data engineers, analysts, and data scient

Full-time

USD 125,000.00 - 163,800.00 per year

Data Engineer - Python/PySpark

Hybrid in Jersey City, New Jersey

•

2d ago

Hi, Our client is lookingfor Data Engineer - Python/PySpark in Jersey City, NJ / Hybrid below is the detailed requirements. Job Title: Data Engineer - Python/PySpark Location: Jersey City, NJ / Hybrid Duration: Full Time Job Description:Bachelors degree in related field. 36 years of experience as a Data Engineer with strong Python and PySpark expertise Experience building scalable ETL/ELT pipelines in big data environments Strong understanding of KYC/AML and Client Onboarding domains H

Easy Apply

Full-time, Third Party

Depends on Experience

Search all similar jobs