Overview
Skills
Job Details
Job Title: Google Cloud Platform Data Engineer Location: REMOTE Duration: Long Term
Required minimum 3 years of proven hands-on experience in the following:
Design and implement robust data pipelines using Google Cloud Platform (Google Cloud Platform) services such as BigQuery, Cloud Storage, and Pub/Sub.
Develop and manage workflows using Cloud Composer (Apache Airflow) for efficient scheduling and orchestration.
Write clean, efficient, and scalable code in Python, leveraging advanced programming techniques.
Craft complex SQL queries in BigQuery, including window functions, CTEs, and performance tuning strategies.
Build and maintain real-time data processing systems using Apache Kafka.
Model and manage NoSQL databases, particularly MongoDB, with a focus on scalable schema design.
Utilize Shell scripting and perform Linux system administration tasks to support data infrastructure.
Conduct data profiling and implement validation techniques to ensure data quality and integrity.
Develop and maintain API integration scripts for seamless service automation and data exchange.
Troubleshoot and resolve data-related issues with strong analytical and problem-solving skills.
Create and maintain data flow diagrams to clearly communicate architecture and pipeline logic to stakeholders.