Senior Software Engineer, Vision Language Models

Overview

Remote
$175 - $234
Full Time
No Travel Required
Able to Provide Sponsorship

Skills

VLM
Object Detection
Computer Vision
Edge Detection

Job Details

Protingent Staffing has an exciting Remote Direct Hire opportunity.

Position Title: Senior Software Engineer, Vision Language Models
Job Responsibilities:

  • Spearhead the development of cutting-edge data products by adapting and extending Vision-Language Models (VLMs) and other multimodal foundation models. This includes applying advanced techniques like fine-tuning, RAG, in-context learning, continual pre-training, and knowledge distillation.
  • Design and curate high-quality multimodal datasets crucial for training and evaluating multimodal foundation models. This includes developing innovative strategies for data curation, dataset creation, and synthetic data generation to optimize multimodal foundation models for long-tail event mining.
  • Drive the in-depth analysis of multimodal foundation models' performance, generalization, and robustness in diverse real-world settings

Job Qualifications:

  • MS/PhD in computer science or related fields with a strong emphasis on multimodal foundation models
  • Strong publication record in premier conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR) demonstrating significant contributions to the field of vision-language understanding or multimodal foundation models
  • Proficiency in Python and deep learning frameworks such as PyTorch, with a demonstrated ability to write clean, efficient, and maintainable code

Preferred Skills:

  • Experience in the application of Vision-Language Models (VLMs) or other multimodal foundation models to data mining in real-world settings
  • Experience in production deployment of Vision-Language Models (VLMs) or other multimodal foundation models for real-world applications (e.g., image/video captioning, open-vocabulary image/video searching)
  • Experience with data from diverse sensor modalities (e.g., camera, lidar, radar)
  • Experience in applied machine learning for autonomous driving

Job Details:

  • Direct Hire
  • Location: Remote, in the US
  • Salary Range: $175 234k/year
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Protingent, Inc.