Overview
Skills
Job Details
No Remote for Data Engineer- Only Onsite
We are seeking a Data Engineer with expertise in Google Cloud Platform (Google Cloud Platform) and Vertex AI to design, build, and optimize data pipelines supporting machine learning and analytics. The role requires hands-on experience in data engineering, ML-ready data preparation, and integration with Vertex AI pipelines for scalable AI/ML model development.
Key Responsibilities
Design and implement scalable ETL/ELT pipelines using Dataflow, Dataproc, BigQuery, and Pub/Sub.
Collaborate with Data Scientists and MLOps teams to prepare and serve ML-ready datasets for training and inference on Vertex AI.
Integrate structured, semi-structured, and unstructured data from multiple sources into Google Cloud Platform data lake/warehouse.
Build feature pipelines and manage Vertex AI Feature Store.
Implement data quality checks, governance, and lineage in pipelines.
Optimize storage and compute costs across Google Cloud Platform services.
Support real-time and batch data processing for ML pipelines and analytics.
Ensure security, compliance, and monitoring of data pipelines.
Required Skills & Experience
Strong expertise with Google Cloud Platform data services: BigQuery, Dataflow (Apache Beam), Pub/Sub, Dataproc, Cloud Storage, Composer (Airflow).
Experience working with Vertex AI pipelines and Feature Store.
Strong SQL and Python programming skills.
Hands-on experience with data modeling, partitioning, performance optimization.
Proficiency with CI/CD for data pipelines (Cloud Build, Jenkins, GitHub Actions).
Familiarity with Terraform/IaC for Google Cloud Platform environment setup.
Knowledge of containerization (Docker, Kubernetes) for pipeline orchestration.
Preferred Qualifications
Google Cloud Platform Certifications: Professional Data Engineer or Professional Machine Learning Engineer.
Experience with Kubeflow, MLflow, or TFX for pipeline integration.
Exposure to data observability tools (Dataplex, Great Expectations, dbt).
Strong understanding of AI/ML lifecycle workflows and model deployment integration.
Why Join Us?
Work on data & AI-driven solutions at scale.
Collaborate with a global team of Data Engineers, MLOps Engineers, and Data Scientists.
Opportunity to grow into a Lead Data Engineer / MLOps Architect role.