Gen AI Engineer with Google Cloud Platform

Overview

On Site
Depends on Experience
Contract - W2
Contract - 12 Month(s)

Skills

Vertex
LLM
Gen AI

Job Details

Role : Gen AI Engineer with Google Cloud Platform

Location : Dallas TX (Onsite)

Must have : Gen AI , LLM , RAG , MLOps , Vertex AI ,Google Cloud Platform exp

  • Design and build end-to-end AI/ML systems and applications, from experimentation and data preprocessing to production deployment.
  • Implement and optimize Generative AI models (text, image, multimodal) and integrate capabilities like Retrieval-Augmented Generation (RAG) and prompt engineering strategies to enhance LLMs with external knowledge sources.
  • Leverage a wide range of Google Cloud Platform services, including Vertex AI, Big Query, Cloud Run, GKE (Google Kubernetes Engine), Dataflow, and Pub/Sub, to build, train, and deploy custom AI models and solutions.
  • Manage the entire model lifecycle, including training, evaluation, fine-tuning, versioning, deployment, and monitoring performance in production environments.
  • Optimize models and systems for improved performance, scalability, efficiency, and cost, implementing techniques like model quantization and GPU memory optimization.
  • Build and maintain scalable and reliable ML pipelines using MLOps practices, employing tools like Docker and Kubernetes for containerization and CI/CD pipelines for automated deployment.
  • Document technical designs, processes, and best practices, and potentially mentor junior team members
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.