Data Engineer Google Cloud Platform & Vertex AI

  • Alpharetta, GA
  • Posted 54 days ago | Updated 1 day ago

Overview

On Site
Hybrid
$60 - $80
Accepts corp to corp applications
Contract - Independent
Contract - W2
Contract - 36 Month(s)
Able to Provide Sponsorship

Skills

Analytics
Apache Beam
Artificial Intelligence
Cloud Computing
Cloud Storage
Collaboration
Continuous Delivery
Continuous Integration
Data Engineering
Data Flow
Data Lake
Data Modeling
Data Processing
Data Quality
Docker
ELT
Extract
Transform
Load
GitHub
Good Clinical Practice
Google Cloud
Google Cloud Platform
Jenkins
Kubernetes
Machine Learning (ML)
Machine Learning Operations (ML Ops)
Management
Orchestration
Performance Tuning
Python
Real-time
Regulatory Compliance
SQL
Storage
Terraform
Training
Unstructured Data
Vertex
Warehouse
Workflow

Job Details

No Remote for Data Engineer- Only Onsite


We are seeking a Data Engineer with expertise in Google Cloud Platform (Google Cloud Platform) and Vertex AI to design, build, and optimize data pipelines supporting machine learning and analytics. The role requires hands-on experience in data engineering, ML-ready data preparation, and integration with Vertex AI pipelines for scalable AI/ML model development.


Key Responsibilities

  • Design and implement scalable ETL/ELT pipelines using Dataflow, Dataproc, BigQuery, and Pub/Sub.

  • Collaborate with Data Scientists and MLOps teams to prepare and serve ML-ready datasets for training and inference on Vertex AI.

  • Integrate structured, semi-structured, and unstructured data from multiple sources into Google Cloud Platform data lake/warehouse.

  • Build feature pipelines and manage Vertex AI Feature Store.

  • Implement data quality checks, governance, and lineage in pipelines.

  • Optimize storage and compute costs across Google Cloud Platform services.

  • Support real-time and batch data processing for ML pipelines and analytics.

  • Ensure security, compliance, and monitoring of data pipelines.


Required Skills & Experience

  • Strong expertise with Google Cloud Platform data services: BigQuery, Dataflow (Apache Beam), Pub/Sub, Dataproc, Cloud Storage, Composer (Airflow).

  • Experience working with Vertex AI pipelines and Feature Store.

  • Strong SQL and Python programming skills.

  • Hands-on experience with data modeling, partitioning, performance optimization.

  • Proficiency with CI/CD for data pipelines (Cloud Build, Jenkins, GitHub Actions).

  • Familiarity with Terraform/IaC for Google Cloud Platform environment setup.

  • Knowledge of containerization (Docker, Kubernetes) for pipeline orchestration.


Preferred Qualifications

  • Google Cloud Platform Certifications: Professional Data Engineer or Professional Machine Learning Engineer.

  • Experience with Kubeflow, MLflow, or TFX for pipeline integration.

  • Exposure to data observability tools (Dataplex, Great Expectations, dbt).

  • Strong understanding of AI/ML lifecycle workflows and model deployment integration.


Why Join Us?

  • Work on data & AI-driven solutions at scale.

  • Collaborate with a global team of Data Engineers, MLOps Engineers, and Data Scientists.

  • Opportunity to grow into a Lead Data Engineer / MLOps Architect role.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Hexacorp