PySpark Data Engineer

Hybrid in Rutherford, NJ, US • Posted 12 hours ago • Updated 55 minutes ago
Full Time
On-site
USD 65.00 per hour
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Recruiting
  • Financial Services
  • Computer Science
  • Information Technology
  • Streaming
  • API
  • Dimensional Modeling
  • SQL
  • Apache Hadoop
  • HDFS
  • Apache Hive
  • Cloud Computing
  • Amazon S3
  • Google Cloud
  • Google Cloud Platform
  • Storage
  • Electronic Health Record (EHR)
  • Databricks
  • Version Control
  • Git
  • Problem Solving
  • Conflict Resolution
  • Analytical Skill
  • Communication
  • Workflow
  • Orchestration
  • Apache Airflow
  • Microsoft Azure
  • Amazon Web Services
  • Step-Functions
  • Apache Kafka
  • Amazon Kinesis
  • NoSQL
  • Database
  • MongoDB
  • Apache Cassandra
  • Amazon DynamoDB
  • Continuous Integration
  • Continuous Delivery
  • Extract
  • Transform
  • Load
  • Data Warehouse
  • Apache Spark
  • Python
  • Data Quality
  • Management
  • Data Governance
  • Regulatory Compliance
  • Data Processing
  • Data Engineering
  • PySpark
  • Big Data

Summary

Date Posted: 06/30/2026

Hiring Organization: Rose International

Position Number: 503541

Industry: Financial Services

Job Title: PySpark Data Engineer

Job Location: Rutherford, NJ, USA, 07070

Work Model: Hybrid

Work Model Details: 3 days onsite and 2 days remote

Employment Type: Temp to Hire

FT/PT: Full-Time

Estimated Duration (In months): 6

Min Hourly Rate($): 65.00

Max Hourly Rate($): 74.00

Must Have Skills/Attributes: Data Engineer, PySpark

Experience Desired: Experience as a Data Engineer, with significant experience specifically in PySpark (7-10 yrs); Strong proficiency in Python programming (7-10 yrs); Experience with Apache Spark, including Spark SQL, Spark Streaming, and DataFrame API (7-10 yrs); Experience with big data technologies such as Hadoop, HDFS, Hive (7-10 yrs); Experience data warehousing concepts, dimensional modeling, and ETL principles (7-10 yrs)

Required Minimum Education: Bachelor's Degree

Preferred Education: Master's Degree

**C2C is not available**

Job Description
Required Education
Bachelor's degree in computer science, Engineering, Information Technology, or a related quantitative field.

Preferred Education
Master's degree in a related field.

Required Skills
7-10 years of experience as a Data Engineer, with significant experience specifically in PySpark.
7-10 years of experience in Python programming.
7-10 years of experience with Apache Spark, including Spark SQL, Spark Streaming, and DataFrame API.
7-10 years of experience and understanding of data warehousing concepts, dimensional modeling, and ETL principles.
7-10 years of experience in SQL for data querying and manipulation.
Experience with big data technologies such as Hadoop, HDFS, Hive, or similar.
Familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud Platform) and their data services (e.g., S3, ADLS, Google Cloud Storage, EMR, Databricks, Glue).
Experience with version control systems (e.g., Git).
Excellent problem-solving, analytical, and communication skills.

Preferred Skills
Experience with workflow orchestration tools (e.g., Apache Airflow, Azure Data Factory, AWS Step Functions).
Knowledge of stream processing technologies (e.g., Kafka, Kinesis).
Experience with NoSQL databases (e.g., MongoDB, Cassandra, DynamoDB).
Familiarity with data governance tools and practices.
Experience in a CI/CD environment.

Responsibilities
Design, build, and optimize data pipelines using PySpark to extract, transform, and load (ETL) data from various sources into data lakes and data warehouses.
Develop and maintain scalable data processing jobs and frameworks using Apache Spark with Python (PySpark).
Work closely with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions.
Implement data quality checks, monitoring, and alerting for data pipelines to ensure data accuracy and reliability.
Optimize existing PySpark jobs for performance, efficiency, and cost-effectiveness.
Manage and process large datasets, ensuring data governance, security, and compliance.
Troubleshoot and resolve issues in data pipelines and data processing jobs.
Participate in code reviews, contribute to architectural discussions, and promote best practices in data engineering.
Stay informed about new PySpark features, big data technologies, and industry best practices.
Document data pipelines, data models, and processes.

#CT1

  • **Only those lawfully authorized to work in the designated country associated with the position will be considered.**

  • **Please note that all Position start dates and duration are estimates and may be reduced or lengthened based upon a client's business needs and requirements.**


Benefits:
For information and details on employment benefits offered with this position, please visit here. Should you have any questions/concerns, please contact our HR Department via our secure website.


California Pay Equity:
For information and details on pay equity laws in California, please visit the State of California Department of Industrial Relations' website here.


Rose International is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender (expression or identity), national origin, arrest and conviction records, disability, veteran status or any other characteristic protected by law. Positions located in San Francisco and Los Angeles, California will be administered in accordance with their respective Fair Chance Ordinances.

If you need assistance in completing this application, or during any phase of the application, interview, hiring, or employment process, whether due to a disability or otherwise, please contact our HR Department.

Rose International has an official agreement (ID #132522), effective June 30, 2008, with the U.S. Department of Homeland Security, U.S. Citizenship and Immigration Services, Employment Verification Program (E-Verify). (Posting required by OCGA 13/10-91.).
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: roseint
  • Position Id: 503541
  • Posted 12 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Rutherford, New Jersey

Today

Easy Apply

Contract

Remote or Jersey City, New Jersey

Today

Full-time

USD 70.00 - 77.00 per hour

New York, New York

Today

Easy Apply

Full-time

USD 75.00 - 85.00 per hour

New York, New York

Today

Full-time

USD 150,000.00 - 200,000.00 per year

Search all similar jobs