Overview
Skills
Job Details
Title: AI Platform Engineer
Location: New York City, NY | Hybrid
Type: Full-time / Contract
Start Date: ASAP
Job Summary:
We are seeking a highly experienced AI Platform Engineer with 10+ years of experience in designing and managing large-scale AI/ML infrastructure. In this role, you will lead the development and automation of production-grade machine learning platforms using Google Cloud Platform and modern MLOps tools.
Responsibilities:
Architect, build, and manage robust AI/ML infrastructure using Vertex AI, Kubeflow, TFX, and Cloud AI Platform.
Design and maintain CI/CD pipelines for ML models using Cloud Build, Cloud Functions, and Artifact Registry.
Automate model training, deployment, and monitoring using Vertex AI Pipelines and Cloud Composer.
Work with ML engineers and data scientists to ensure scalability, reliability, and operational efficiency of ML workloads.
Manage and provision infrastructure using Terraform, Deployment Manager, or other IaC tools.
Ensure end-to-end compliance with security, privacy, and governance standards.
Monitor and optimize performance of AI workloads on GKE, Cloud Run, and Cloud Functions.
Required Skills:
10+ years of experience in AI infrastructure, DevOps, or MLOps roles, with a strong focus on cloud-based ML systems.
Deep knowledge of Google Cloud Platform (Google Cloud Platform) and tools like Vertex AI, Kubeflow, and TFX.
Proven experience building CI/CD workflows and automation for ML deployment pipelines.
Advanced skills in Python, Terraform, and cloud-native DevOps tools.
Strong understanding of ML lifecycle, including versioning, governance, and security compliance.
Experience managing production systems at scale using GKE, Cloud Run, and Cloud Functions.
Note - “Energy & Utility domain with experience in Google solutions”
The person has worked in the Energy & Utilities industry — for example, in areas like: Power generation or distribution (electric companies) Gas or water utilities Renewable energy (solar, wind, etc.) Smart grid, metering, or infrastructure optimization AND The person has hands-on experience using Google Cloud (Google Cloud Platform) tools or solutions, such as: Vertex AI, BigQuery ML, AutoML, Cloud AI Platform, etc. Building AI/ML models, analytics, or data pipelines using Google Cloud services In simple terms: They understand Energy & Utility business problems and know how to solve them using Google Cloud technologies.