Overview
Skills
Job Details
Title- Sr. Gen AI/ ML Engineer (12+ Years Overall Experience)
Location- Richardson TX - Hybrid 2-3 days onsite/week
Type- Contract- C2C/W2
Duration- 18+ Months with possible extension
Mode of Interview- F2F
Job Description-
We are looking for a highly skilled AI Systems Contractor with deep expertise in designing, building, and optimizing advanced AI solutions. The ideal candidate will have a strong background in Large Language Models (LLMs), agentic AI workflows, and hands-on experience managing high-performance computing environments. A key requirement for this role is proven experience leveraging NVIDIA GPUs within a Red Hat OpenShift platform to accelerate AI/ML workloads. You will be responsible for the entire lifecycle of AI model development, from prototyping in Jupyter to deploying scalable solutions.
Key Responsibilities
Develop & Fine-Tune AI Models: Design, develop, and fine-tune Large Language Models (LLMs) and other deep learning models to meet project requirements.
Build Agentic AI Systems: Create and implement multi-agent AI systems and workflows that can perform complex, autonomous tasks.
GPU Optimization on OpenShift: Deploy, manage, and optimize AI/ML workloads on our OpenShift cluster, ensuring efficient utilization of NVIDIA GPU resources.
Python & Jupyter Development: Utilize Python and Jupyter Notebooks for rapid prototyping, data analysis, model training, and experimentation.
Pipeline & Infrastructure Management: Develop and maintain robust data and model pipelines, troubleshooting performance bottlenecks related to compute, memory, and networking in a containerized environment.
Collaboration: Work closely with data scientists, engineers, and project managers to translate business needs into technical solutions and deliver on project milestones.
Required Qualifications & Experience
Proven, hands-on experience developing and working with AI systems, including Large Language Models (LLMs) and agentic AI.
Advanced proficiency in Python and extensive experience using its AI/ML ecosystem (e.g., PyTorch, TensorFlow, LangChain, Hugging Face).
Demonstrable expertise with Jupyter Notebooks for data science and model development workflows.
Direct, hands-on experience deploying and managing containerized applications that utilize NVIDIA GPUs on a Red Hat OpenShift platform.
Strong understanding of containerization technologies (Docker, Kubernetes) and CI/CD principles.
Excellent problem-solving skills with the ability to diagnose and resolve complex technical issues in high-performance computing environments.
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.