Overview
Skills
Job Details
We are seeking a highly skilled LLM Engineer with deep experience in Google Cloud Platform (Google Cloud Platform) and Vertex AI to build, fine-tune, and deploy cutting-edge Generative AI applications. You will work on scalable machine learning pipelines, custom LLM integrations, and AI-based solutions across a variety of domains.
Key Responsibilities:-
Design, develop, and deploy LLM-based applications using Vertex AI, Generative AI Studio, and Google Cloud Platform services
-
Fine-tune and optimize foundation models (PaLM, Gemini) for business-specific tasks
-
Build end-to-end ML pipelines: data ingestion, preprocessing, training, evaluation, and serving
-
Implement prompt engineering, embedding-based retrieval, and RAG pipelines
-
Integrate AI models into production systems with CI/CD and MLOps best practices
-
Collaborate with data scientists, ML engineers, and DevOps teams to deploy scalable solutions
-
Ensure model governance, monitoring, and responsible AI compliance
-
5+ years in ML/AI engineering with recent experience in LLMs and Generative AI
-
Strong hands-on experience with Vertex AI, BigQuery, GCS, and Cloud Functions
-
Experience fine-tuning LLMs (e.g., BERT, GPT, PaLM) using tools like Keras, PyTorch, or TFX
-
Familiarity with RAG (Retrieval Augmented Generation) and LangChain/LLamaIndex
-
Proficient in Python, REST APIs, and Google Cloud Platform SDKs
-
Working knowledge of MLOps on Google Cloud Platform
-
Bachelor's or Master s in Computer Science, AI/ML, or related field
-
Google Cloud Platform Professional Machine Learning Engineer certification
-
Experience with multi-modal models, Vision-Language tasks, or chatbot development
-
Exposure to GKE, Cloud Run, and Kubeflow Pipelines