Overview
Skills
Job Details
We are seeking a highly skilled LLM Engineer with expertise in Google Cloud Platform (Google Cloud Platform), Vertex AI, and algorithm development to design, implement, and optimize cutting-edge AI solutions. You will work on building, fine-tuning, and deploying Large Language Models (LLMs) for real-world business applications.
Key Responsibilities-
Design, develop, and optimize algorithms for LLM training, inference, and deployment on Google Cloud Platform.
-
Implement Vertex AI pipelines for data preprocessing, model training, evaluation, and deployment.
-
Fine-tune LLMs (e.g., PaLM, Gemini, open-source models) to meet domain-specific requirements.
-
Integrate LLM solutions into enterprise applications ensuring scalability, security, and performance.
-
Collaborate with data scientists, cloud architects, and MLOps engineers to ensure seamless delivery.
-
Monitor and evaluate model performance using metrics, A/B testing, and continuous improvement methods.
-
Apply prompt engineering and retrieval-augmented generation (RAG) techniques for improved accuracy.
-
Optimize cost and performance for AI workloads on Google Cloud Platform.
-
3+ years of experience in AI/ML engineering with a focus on LLMs.
-
Strong proficiency in Python and ML frameworks such as TensorFlow, PyTorch, or JAX.
-
Hands-on experience with Google Cloud Platform services, especially Vertex AI.
-
Proven track record in developing and optimizing algorithms for NLP and machine learning tasks.
-
Experience with vector databases, embeddings, and semantic search.
-
Familiarity with MLOps practices and CI/CD for ML models.
-
Solid understanding of data preprocessing, feature engineering, and model evaluation.
-
Experience with PaLM API, Gemini models, or other large-scale transformer-based architectures.
-
Knowledge of Kubernetes, Docker, and cloud-native development on Google Cloud Platform.
-
Experience with RAG, LangChain, or LlamaIndex frameworks.
-
Strong problem-solving skills and ability to work in agile teams.
-
Competitive compensation package.
-
Opportunity to work with cutting-edge AI technologies.
-
Flexible working hours and remote work options.