1. Strong spark - fundamentals and architecture of spark and working knowledge.
2 .Strong scala - experience with developing data pipelines using spark and scala.
Familiarity with scala functions for data transformation.
3. Understand existing data pipelines and recommend modifications and best practices.
4. Good knowledge on AWS infrastructure services Amazon Simple Storage Service (Amazon S3),EMR, and Amazon Elastic Compute Cloud (Amazon EC2), Lambda, etc