Senior Software Engineer, Vision Language Models

Overview

Remote
$175,000 - $230,000
Full Time

Skills

Master degree
PyTorch
Machine Learning (ML)
Language Models
VLMs
Vision-Language Models
radar
autonomous

Job Details

Job Responsibilities:

  • Spearhead the development of cutting-edge data products by adapting and extending Vision-Language Models (VLMs) and other multimodal foundation models. This includes applying advanced techniques like fine-tuning, RAG, in-context learning, continual pre-training, and knowledge distillation.
  • Design and curate high-quality multimodal datasets crucial for training and evaluating multimodal foundation models. This includes developing innovative strategies for data curation, dataset creation, and synthetic data generation to optimize multimodal foundation models for long-tail event mining.
  • Drive the in-depth analysis of multimodal foundation models' performance, generalization, and robustness in diverse real-world settings

Job Qualifications:

  • MS/PhD in computer science or related fields with a strong emphasis on multimodal foundation models
  • Strong publication record in premier conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR) demonstrating significant contributions to the field of vision-language understanding or multimodal foundation models
  • Proficiency in Python and deep learning frameworks such as PyTorch, with a demonstrated ability to write clean, efficient, and maintainable code

Preferred Skills:

  • Experience in the application of Vision-Language Models (VLMs) or other multimodal foundation models to data mining in real-world settings
  • Experience in production deployment of Vision-Language Models (VLMs) or other multimodal foundation models for real-world applications (e.g., image/video captioning, open-vocabulary image/video searching)
  • Experience with data from diverse sensor modalities (e.g., camera, lidar, radar)
  • Experience in applied machine learning for autonomous driving
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Protingent, Inc.