Overview
Remote
$120,000 - $140,000
Full Time
Accepts corp to corp applications
Skills
MLOps
Large Language Models (LLMs)
Multi-modal Large Language Models (MLLMs)
large-scale video AI
AI/ML
Artificial Intelligence
MLFlow
NVIDIA
Job Details
Position: Lead AI Engineer Video & Multimodal AI
Location: Remote
Duration: Full Time
Experience Level: 10+ Years Experience
Job Description:
About the Role:
We are hiring a Lead AI Engineer to spearhead the design, fine-tuning, and scalable deployment of cutting-edge AI systems, with a focus on deep learning, video intelligence, and multi-modal (vision + language) models. The ideal candidate has a strong academic foundation, preferably from Ivy League institutions and proven experience in driving innovative AI solutions from research to production.
Key Responsibilities:
- Architect and lead the development of large-scale video AI and vision-language models (VLMs).
- Fine-tune and optimize Large Language Models (LLMs) and Multi-modal Large Language Models (MLLMs) for task-specific applications.
- Scale model training and evaluation across distributed systems with an emphasis on GPU/accelerated environments.
- Build and maintain robust AI pipelines for training, evaluation, benchmarking, and deployment using state-of-the-art MLOps tools.
- Drive performance optimization of models for real-time inference using tools like TensorRT, ONNX, and NVIDIA Triton.
- Collaborate cross-functionally with data scientists, researchers, and platform engineers to align model development with business goals.
- Publish internal/external papers and contribute to IP creation and thought leadership in AI innovation.
Minimum Qualifications:
- MS or Postgraduate degree in Computer Science or related field (PhD preferred); strong preference for Ivy League graduates.
- 10+ years of industry or research experience in AI/ML, with a focus on Deep Learning, Video AI, and multi-modal systems.
- Advanced proficiency in Python and DL frameworks such as PyTorch and TensorFlow.
- Deep expertise in fine-tuning LLMs and MLLMs, including prompt engineering, transfer learning, and embedding-based techniques.
- Proven experience scaling AI model training and inference across multi-GPU and distributed compute platforms.
- Strong hands-on knowledge of MLOps practices, including Docker, Kubernetes, MLFlow, and model serving.
Preferred Skills:
- Familiarity with NVIDIA s AI ecosystem (TensorRT, Triton Inference Server, DeepStream SDK).
- Experience with retrieval-augmented generation (RAG), attention-based models, and real-time video inference.
- Prior experience in leading AI teams or projects and mentoring junior researchers/engineers.
- Publications, patents, or open-source contributions in the field of AI/ML.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.