Overview
Skills
Job Details
No Remote for Data Engineer- Only Onsite
We are seeking a Data Engineer with expertise in Google Cloud Platform (Google Cloud Platform) and Vertex AI to design, build, and optimize data pipelines supporting machine learning and analytics. The role requires hands-on experience in data engineering, ML-ready data preparation, and integration with Vertex AI pipelines for scalable AI/ML model development.
Key Responsibilities
- Design and implement scalable ETL/ELT pipelines using Dataflow, Dataproc, BigQuery, and Pub/Sub. 
- Collaborate with Data Scientists and MLOps teams to prepare and serve ML-ready datasets for training and inference on Vertex AI. 
- Integrate structured, semi-structured, and unstructured data from multiple sources into Google Cloud Platform data lake/warehouse. 
- Build feature pipelines and manage Vertex AI Feature Store. 
- Implement data quality checks, governance, and lineage in pipelines. 
- Optimize storage and compute costs across Google Cloud Platform services. 
- Support real-time and batch data processing for ML pipelines and analytics. 
- Ensure security, compliance, and monitoring of data pipelines. 
Required Skills & Experience
- Strong expertise with Google Cloud Platform data services: BigQuery, Dataflow (Apache Beam), Pub/Sub, Dataproc, Cloud Storage, Composer (Airflow). 
- Experience working with Vertex AI pipelines and Feature Store. 
- Strong SQL and Python programming skills. 
- Hands-on experience with data modeling, partitioning, performance optimization. 
- Proficiency with CI/CD for data pipelines (Cloud Build, Jenkins, GitHub Actions). 
- Familiarity with Terraform/IaC for Google Cloud Platform environment setup. 
- Knowledge of containerization (Docker, Kubernetes) for pipeline orchestration. 
Preferred Qualifications
- Google Cloud Platform Certifications: Professional Data Engineer or Professional Machine Learning Engineer. 
- Experience with Kubeflow, MLflow, or TFX for pipeline integration. 
- Exposure to data observability tools (Dataplex, Great Expectations, dbt). 
- Strong understanding of AI/ML lifecycle workflows and model deployment integration. 
Why Join Us?
- Work on data & AI-driven solutions at scale. 
- Collaborate with a global team of Data Engineers, MLOps Engineers, and Data Scientists. 
- Opportunity to grow into a Lead Data Engineer / MLOps Architect role.