Apply Now

Data Engineer ( Pyspark, Hadoop, Scala) || Hybrid in Sunnyvale, CA || Must work on our W2

Hybrid in Sunnyvale, CA, US • Posted 12 hours ago • Updated 12 hours ago

Contract W2

No Travel Required

Hybrid

Depends on Experience

Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

PySpark
Hadoop
Scala
ETL
Machine Learning (ML)

Summary

Title ; Data Engineer ( Pyspark, Hadoop, Scala)

Hybrid 2-3 days onsite in Sunnyvale, CA

Must work on our W2

Interview: one Coding round.
Job Description

We are looking for a highly motivated and eager-to-learn Data Engineer with hands-on experience in PySpark, Hadoop, Scala, and ETL processes. The ideal candidate will work on large-scale data processing, specifically handling signals and datasets, transforming raw user data into structured tables, and supporting pre-machine learning workflows.

Skill Set: Can perform Pyspark, Hadoop, Scala, ETL,

Day to Day: Working on signals and table. Preparing and processing raw data from users and creating tables

Considered the premachine learning.

(would like to see someone with a master''s w/ 1-2 years ex)Very hungry to learn.

Bonus: Machine Learning Background.

Key Responsibilities:

Process and transform large volumes of raw data using PySpark and Hadoop
Develop and maintain ETL pipelines for data ingestion and processing
signals and datasets, building and optimizing data tables
Clean, validate, and prepare data for pre-machine learning use cases
Collaborate with analytics and data science teams to support model development
Ensure data quality, consistency, and performance optimization

Required Skills:

Strong hands-on experience in PySpark, Hadoop, Scala
Good understanding of ETL processes and data pipelines
Experience working with large-scale structured and unstructured datasets
Basic knowledge of data modeling and data transformation techniques
Strong problem-solving and analytical skills

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91001743
Position Id: 8951246
Posted 12 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Hybrid in Sunnyvale, California

•

Yesterday

Data Engineer Bay Area, CA (Onsite) Job Summary: Looking for a Data Engineer to work on client data signals, focusing on processing raw user data and building structured datasets for pre-machine learning use cases. Key Skills: PySpark, Hadoop, Scala ETL development Data processing & table creation Responsibilities: Process and transform raw data into usable datasets Build and maintain data pipelines and tables Support data preparation for ML initiatives Preferred: Masters degree with 12

Easy Apply

Contract, Third Party

Depends on Experience

Data Engineer with Autonomous Vehicle Platforms Experience

Hybrid in Los Altos, California

•

Yesterday

B.S. or M.S. in Computer Science, Data Engineering, or a related field.3+ years of experience building production-grade data infrastructure or ML data pipelines.Strong proficiency with Python and SQL, and experience with data workflow orchestration tools (e.g., Airflow, Prefect, Luigi).Experience with autonomous vehicle datasets or robotics sensor data.Deep experience with AWS services, especially S3 (data storage), EC2 (compute), and SageMaker (model training).Understanding of best practices fo

Easy Apply

Contract

$70 - $90

Data Scientist (Big Data Systems)

Cupertino, California

•

Today

Data Scientist (Big Data Systems) W2 Contract Pay Rate: $40- $50 per hour Location: Cupertino, CA - Hybrid Role Job Summary: Seeking a data scientist/analyst. Ideally, skilled in data analysis. While the role is primarily for a data scientist, we are also open to candidates with strong project management skills to help oversee aspects of our data operations. Duties and Responsibilities: Ad-hoc data analysis and investigations, running queries on our big data systems (SQL, Splunk, and HDFS)

Easy Apply

Contract

$40 - $50 per hour

Senior Backend Software Engineer

Hybrid in Cupertino, California

•

6d ago

A globally leading consumer device company headquartered in Cupertino, CA is looking for aSenior Backend Software Engineerto join their team! Responsibilities: Convert raw data into format acceptable by training jobs on Google Cloud Platform and AWS Leverage internal and open-sourced training modules. Leverage internal and open-sourced inference stack to generate inferences with fine-tuned LLMs on massive amounts of data, for pre-train and post-training Have the ability to efficiently process

Easy Apply

Contract

Depends on Experience

Search all similar jobs

Data Engineer ( Pyspark, Hadoop, Scala) || Hybrid in Sunnyvale, CA || Must work on our W2

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs