Overview
Skills
Job Details
GEN AI Architect with Google Cloud Platform
Location: Louisville, KY
Duration: 12+ Months (Contract)
We are seeking an experienced and innovative Generative AI Architect to lead the design, development, and deployment of cutting-edge AI/ML solutions leveraging the Google Cloud Platform (Google Cloud Platform). The ideal candidate will have deep expertise in Large Language Models (LLMs), RAG systems, and production-grade MLOps using Vertex AI.
Key Responsibilities
AI/ML Architecture: Architect and implement end-to-end AI/ML and Generative AI solutions on Google Cloud Platform using core services like Vertex AI, BigQuery, Cloud Storage, Cloud Composer, and Dataflow.
LLM Development: Develop, fine-tune, and integrate Large Language Models (LLMs) into critical enterprise applications, focusing on robust and scalable performance.
GenAI Pipeline Engineering: Design and build advanced Generative AI pipelines, including complex Retrieval-Augmented Generation (RAG) systems, utilizing frameworks such as LangChain and Semantic Kernel, and integrating with Vector Databases.
Collaboration & Strategy: Partner with cross-functional business and technical teams to identify, prioritize, and translate high-impact GenAI use cases into technical roadmaps and deliverable solutions.
Responsible AI: Ensure all deployed AI systems adhere to strict Responsible AI principles, data privacy regulations, and enterprise security standards.
Optimization & Scaling: Optimize model performance, scalability, and cost efficiency in production environments on Google Cloud Platform.
Mentorship: Provide technical guidance and mentorship to junior engineers, contributing to the establishment of best practices in AI engineering and MLOps.
Required Skills and Experience
< data-path-to-node="7">Google Cloud Platform & Cloud Expertise</>Strong proficiency in the core Google Cloud Platform (Google Cloud Platform) AI/ML and data services: Vertex AI (training, deployment, monitoring), BigQuery, Cloud Spanner, Cloud Functions, and Pub/Sub.
Hands-on experience with pipeline orchestration tools like Cloud Composer or Cloud Dataflow.
Expertise in working with and deploying Large Language Models (LLMs), including Google's Gemini family or similar models.
Proven hands-on experience with Prompt Engineering techniques (e.g., Zero-shot, Few-shot, Chain-of-Thought).
Deep knowledge of frameworks like LangChain and experience utilizing external APIs like Azure/OpenAI APIs.
Demonstrable experience integrating and managing data in Vector Databases for RAG implementations.
Expertise in programming languages like Python and/or Java.
Strong background in leading AI/ML frameworks (TensorFlow, PyTorch).
Familiarity with containerization and orchestration using Docker and Kubernetes.
Understanding of CI/CD pipelines for model deployment and lifecycle management.
Solid comprehension of Responsible AI, data privacy, and model governance best practices.
Key Words to Search in Resume
To ensure your application is recognized, please highlight experience with the following:
Large Language Models (LLMs), Gemini, Vertex AI, Big Query, Cloud Storage, Cloud Composer, Dataflow, AI, ML, RAG, LangChain, Vector Databases.