Applied AI Scientist - Multimodal Intelligence

Washington, WA, US • Posted 21 hours ago • Updated 8 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

🔗 Matching skills to job...

Job Details

Skills

  • Innovation
  • Art
  • Data Collection
  • Modeling
  • Evaluation
  • Forms
  • Shipping
  • Sensors
  • Artificial Intelligence
  • Publications
  • Training
  • Language Models
  • Computer Hardware
  • Generative Artificial Intelligence (AI)
  • Hardware Development
  • Collaboration
  • Video
  • Python
  • PyTorch
  • JAX
  • Rapid Prototyping
  • Research
  • Computer Vision
  • Computer Science
  • Deep Learning
  • Storage
  • Reasoning
  • Privacy
  • Machine Learning (ML)

Summary

Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Multifaceted, amazing people and inspiring, innovative technologies are the norm here. The people who work here have reinvented entire industries with all Apple Hardware products. The same passion for innovation that goes into our products also applies to our practices, strengthening our commitment to leave the world better than we found it. Join us in this truly exciting era of Artificial Intelligence to help deliver the next groundbreaking Apple products & experiences! We are continuously advancing the state of the art in Computer Vision and Machine Learning, touching all aspects of language and multimodal foundation models, from data collection, data curation to modeling, evaluation and deployment. As a member of our dynamic group, you will have the unique and rewarding opportunity to craft upcoming research directions in the field of multimodal foundation models that will inspire future Apple products. You will be working alongside highly accomplished and deeply technical scientists and engineers to develop pioneering solutions for challenging problems. This is a unique opportunity to be part of what forms the future of Apple products that will touch the lives of many people. We (Multimodal Intelligence Team) are looking for an Applied AI Scientist to work on the field of Generative AI and multimodal foundation models. Our team has an established track record of shipping features that leverage multiple sensors, such as FaceID, RoomPlan and hand tracking in VisionPro, as well as a strong research presence in the multimodal AI community. Our publications span multimodal pre-training, vision-language models, video-language models, and multimodal alignment. We are focused on building experiences that demonstrate the power of our sensing hardware as well as large foundation models.

This position requires a highly motivated person who wants to help us bridge the gap between research advances and practical applications in generative AI and multimodal foundation models. You will be responsible for evaluating and adapting emerging research, conducting applied research experiments, and working with engineering teams to transform promising approaches into robust solutions, taking into account future hardware design and product needs. In addition, you will have an opportunity to engage and collaborate with several teams across Apple to deliver the best products.

Experience in deep learning with demonstrated work in at least one area of multimodal systems (e.g. vision, language, video, etc.).\nProficiency in Python and in a modern deep learning framework such as PyTorch or JAX.\nExperience with rapid prototyping, reproduction, and validation of research ideas.\nMaster's or equivalent practical experience, in Computer Science, Computer Vision, Machine Learning, or related technical field.

PhD, or equivalent practical experience, in Computer Science, Machine Learning, or a related technical field.\nDemonstrated expertise in deep learning, with either: A publication record in relevant conferences (e.g., NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV,COLM, etc), or a strong track record of applying deep learning techniques to real-world products.\nExperience with foundation models (language or multimodal).\nFamiliarity with large-scale data pipelines, including data curation, preprocessing, and efficient storage.\nExperience fine-tuning or optimizing large models for production deployment.\nExperience applying foundation models to build autonomous or semi-autonomous agents, including planning, task decomposition, and multi-step reasoning.\nFamiliarity with privacy-preserving or on-device machine learning.\nAbility to work effectively in a multi-functional, collaborative environment.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 6668ce50ece8874ccf31023198783a46
  • Posted 21 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Washington

Today

Full-time

Washington

Today

Full-time

Washington

Today

Full-time

Washington

Today

Full-time

Search all similar jobs