GEN AI Engineer - 66973
We have an immediate long-term opportunity with one of our prime clients for a position of GEN AI Engineer to work in Irving, TX, Dallas, TX or Charlotte, NC on Hybrid basis.
Pay Rate: $55- $60/hr
Job Details:
Must Have Skills:
GEN AI, Agentic AI, ML Ops, Python, ML, Data Science, RAG, LLM
Nice to have skills:
Google Cloud Platform, Prompt Engineering
Certifications Needed: Yes
Detailed Job Description:
We are seeking a highly skilled Generative AI Engineer with a strong Python background to design, develop, and deploy cutting-edge AI solutions. The ideal candidate will have hands-on experience with Large Language Models (LLMs), prompt engineering, and Gen Aframeworks, along with expertise in building scalable AI applications.
Key Responsibilities:
Design and implement Generative AI models for text, image, or multimodal applications.
Develop prompt engineering strategies and embedding-based retrieval systems.
Integrate Gen AI capabilities into web applications and enterprise workflows.
Build agentic AI applications with context engineering and MCP tools.
Required Skills & Qualifications:
10+ years of hands-on experience in AI, Data science, ML, GEN AI.
Strong proficiency in Python and AI/ML frameworks (PyTorch, TensorFlow).
Hands on experience using session and memory for building multi-agent systems along with using MCP tools.
Hands-on experience with LLMs, transformers, and Hugging Face ecosystem.
Knowledge and experience with vector databases and RAG technique for semantic search.
Familiarity with cloud AI services (AWS SageMaker, Azure OpenAI, Google Cloud Platform Vertex AI).
Understanding of MLOps practices for scalable AI deployment.
Strong experience in working with LLM fine-tuning with LoRA, QLoRA, PEFT,
Strong experience in Architected advanced RAG systems using Pinecone, FAISS, Weaviate, Chroma, hybrid retrieval, and custom embeddings,
Strong experience in Designing end-to-end LLMOps/MLOps pipelines using MLflow, DVC, SageMaker Pipelines, Vertex AI Pipelines, and GitHub Actions
Experience in using cloud-native AI systems on AWS (SageMaker, Lambda, EKS, EC2, Step Functions, S3, Glue) and Google Cloud Platform Vertex AI, supporting high-volume inference and secure enterprise operations
Experience in developing multi-agent orchestration workflows using LangGraph and CrewAI for tool-calling, validation agents, automated reasoning, and workflow supervision
Top 3 responsibilities you would expect the Subcon to shoulder and execute:
Strong experience in GEN AI, LLM, RAG,ML, DL,ML Ops, LLMOps, Cloud platform,Model servicing optimization, Python
Strong communication skills
Strong programming skills
For immediate consideration, please contact:
Aadhishivam
PRIMUS Global Services
Phone: Desk Ext. 221
Email: