Experience Required - 6+ Years
Must Have Technical/Functional Skills
Python; PySpark; Google Cloud Platform; ETL - Big Data / Data Warehousing; Data Analytics; SQL; Git (GitHub, GitLab, BitBucket, SVN)
Roles & Responsibilities
5+ years of hands-on software development experience with Big Data & Analytics solutionsGoogle Cloud Platform Cloud - Big Query, Airflow, DataProc, PubSub,Hadoop Hive, Spark, Python, shell scripting,
Strong experience in designing, developing, and optimizing data pipelines for large-scale data processing, transformation, and analysis using Big Data and Google Cloud Platform technologies.
Proficiency in SQL and database systems, with experience in designing and optimizing data models for performance and scalability.Experience of building event processing pipelines with Kafka or Google Cloud Platform PubSub.Design and development experience with Airflow, PubSub Kafka, Git, Jenkins is desirable.Knowledge of distributed (multi-tiered) systems, algorithms & relational databases.
Strong Object-Oriented Programming skills and design patterns.Experience with CICD pipelines, Automated test frameworks, and source code management tools (XLR, Jenkins, Git).
Good knowledge and experience with configuration management tools like GitHubAbility to analyze complex data engineering problems, propose effective solutions, and implement them effectively.Ability and willingness to learn, adopt and build solutions using the Enterprise frameworks established for Big DataLooks proactively beyond the obvious for continuous improvement opportunities.
Communicate effectively with product and cross functional team.
Willingness to learn new technologies and leverage them to their optimal potential.Understanding of various SDLC methodologies, familiarity with Agile & scrum ceremonies.
Certifications in cloud platform (Google Cloud Platform Professional Data Engineer) is a plus