Spark Job Migration Specialist

San Francisco, CA, US • Posted 12 hours ago • Updated 12 hours ago
Contract W2
Contract Corp To Corp
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

  • Amazon S3
  • Amazon Web Services
  • Apache HBase
  • Apache Hadoop
  • Apache Hive
  • Apache Kafka
  • Apache NiFi

Summary

We are looking for Spark Job Migration Specialist for our client in San Francisco, CA
Job Title: Spark Job Migration Specialist
Job Location: San Francisco, CA
Job Type: Contract
Job Overview:
Pay Range: $60hr - $65hr

Responsibilities:

  • Migrate JVM workloads and Spark-Submit tasks to Databricks JAR tasks or Notebook tasks.
  • Convert HiveQL scripts and Oozie workflows into optimized Spark SQL or PySpark applications.
  • Refactor data pipelines from Azure Synapse to cloud platforms, updating dependencies and notebook references.
  • Implement performance optimization techniques including Adaptive Query Execution (AQE) in Spark 3.
  • Perform regression testing to validate data consistency between legacy and new systems.
  • Develop validation scripts to ensure output accuracy and reliability.
  • Customize Spark jobs using spark context configurations for better monitoring and troubleshooting.
  • Reconfigure job properties, cluster settings, and Spark configurations for optimized execution.
  • Ensure schema evolution, data correctness, and validation using golden datasets.
Required Skills And Qualifications:
  • 5+ years of experience with Apache Spark (PySpark or Scala).
  • Strong experience with Hadoop ecosystem including HDFS, Hive, HBase, and MapReduce.
  • Experience in migrating data pipelines to cloud or enterprise data platforms.
  • Proficiency in SQL and performance tuning techniques.
  • Hands-on experience with distributed data processing frameworks.
  • Experience with scripting languages such as Python, Shell, or Scala.
  • Familiarity with data ingestion tools like Sqoop, Kafka, and NiFi.
  • Experience working with cloud storage solutions such as ADLS, S3, or Blob Storage.
Preferred Skills And Qualifications:
  • Experience with Databricks platform and notebook-based development.
  • Familiarity with Azure and AWS cloud environments.
  • Strong understanding of data pipeline architecture and optimization strategies.
  • Experience handling large-scale data migrations and transformations.
  • Knowledge of cluster tuning and Spark configuration best practices.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10516350
  • Position Id: CA_SJMP_0323
  • Posted 12 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

San Francisco, California

Today

Easy Apply

Contract

Depends on Experience

San Francisco, California

Today

Easy Apply

Contract

Depends on Experience

Remote

Today

Easy Apply

Contract

Depends on Experience

Elk Grove, California

Today

Contract

Search all similar jobs