Hi [Candidate Name],
I hope you re doing well.
We have an exciting opportunity for a Cloudera Data Engineer to support the Medicaid Data Warehouse Migration Project in an AWS environment. Please review the job details below and let me know if you d be interested in exploring this role.
Job Title: Cloudera Data Engineer
Location: Remote
Project: Medicaid Data Warehouse Migration
Mode of Work: Remote
Job Summary:
We are seeking a skilled Cloudera Data Engineer to support the migration and ongoing operations of a Cloudera/Hive/Scala-based data pipeline in AWS. The role involves replicating, configuring, and validating data pipelines and infrastructure as part of an AWS-to-AWS migration, ensuring data integrity, performance, and operational stability.
Key Responsibilities:
Replicate and configure existing Cloudera cluster (HDFS, YARN, Hive, Spark) in the new AWS account.
Coordinate with infrastructure and DevOps teams for provisioning (EC2, IAM, networking).
Migrate and validate metadata stores, job configurations, and dependencies.
Validate job execution, data outputs, and performance parity post-migration.
Maintain job schedules, dependencies, and runtime configurations.
Monitor Cloudera Manager dashboards and ensure cluster health.
Troubleshoot and optimize Spark/Scala jobs for efficiency.
Manage access controls, documentation, and operational procedures.
Required Skills and Experience:
7+ years of experience in Data Engineering or Big Data development.
4+ years working with Cloudera platform (HDFS, YARN, Hive, Spark, Oozie).
Strong proficiency in Scala, Java, and HiveQL (Python/Bash preferred).
Experience deploying and operating Cloudera workloads on AWS (EC2, S3, IAM, CloudWatch).
Hands-on experience with Cloudera Hadoop distribution and Apache Spark.
Experience implementing business-rules processing using Drools.
Ability to collaborate with infrastructure, DevOps, and governance teams.
Preferred Qualifications:
Cloudera certification (CDP Data Engineer or Administrator).
Experience with Cloudera version upgrades or AWS-to-AWS migrations.
Prior experience in public-sector or large enterprise data environments.
If this role aligns with your experience, please share your updated resume, and I ll connect with you to discuss next steps.