Job Title: Senior /Lead Google Cloud Platform Data Engineer
Location: Remote
Duration: / Term: Fulltime
Job Description:
Experience Desired: 10+ Years
Job Description:
We are looking for a Senior Data Engineer / Lead with deep expertise in Google Cloud Platform, distributed data processing, and pipeline architecture. This role will own the design and implementation of enterprise-scale data platforms, mentor junior engineers, and drive data engineering best practices.
Key Responsibilities
- Architect and build scalable, high-performance data pipelines on Google Cloud Platform
- Lead design of data platforms using Dataproc, BigQuery, Cloud Storage
- Develop and optimize large-scale ETL pipelines using PySpark
- Design and manage complex Airflow workflows (DAGs)
- Drive data architecture decisions and best practices
- Collaborate with business, product, and analytics teams
- Mentor junior engineers and conduct code reviews
- Ensure data governance, quality, and security compliance
- Optimize cost and performance of cloud infrastructure
Required Skills
10+ years of experience in Data Engineering / Big Data
- Strong hands-on expertise in:
- Python (advanced)
- PySpark / Spark (deep optimization knowledge)
- Apache Airflow (complex DAGs, scaling)
- Deep experience with Google Cloud Platform stack:
- Dataproc
- BigQuery
- Cloud Storage
- Pub/Sub (expected at this level)
- Strong SQL, data modeling & data warehousing concepts
- Experience designing large-scale distributed systems
- Proven experience leading projects / teams
Good to Have
- Experience with real-time processing (Kafka / PubSub / Streaming)
- CI/CD for data pipelines
- Infrastructure as Code (Terraform)
- Google Cloud Platform certifications (Professional Data Engineer preferred)
Behavioral Expectations
- Strong ownership mindset
- Ability to influence stakeholders
- Mentorship & leadership capability
- Strategic thinking
Skills
Dataproc (Spark cluster management), BigQuery (data warehousing, optimization), Cloud Storage (GCS)Pub/Sub (for streaming use cases)
Apache Airflow
- Complex DAG design
- Dependency handling
- Scheduling & scaling
Key Skills:
Google Cloud Platform, Dataproc, BigQuery, Cloudstorage, Airflow, Pub/Sub,