Do you have a passion for deep learning and computer vision problems? We are looking for someone who thrives on collaboration and wants to push the boundaries of what is possible today!\\n\\nJoin our team of committed deep learning engineers in the Video Computer Vision group! We are a centralized applied research and engineering organization responsible for developing real-time on-device Computer Vision, Machine Perception, and Generative technologies across Apple products. Our shipped technologies power features in ARKit, MeasureApp, RoomScan, Accessibility, and multiple VisionPro features. \\n\\nAs a member of the Video Computer Vision group you will develop new technologies in the area of scene understanding and for Apple's next generation products.
We are looking for a skilled Deep Learning Engineer for our team. In this role, you will perform research and development work to design algorithms for challenging real world problems in the domain of scene understanding.
BS in Computer Science or related field with a minimum of 3 years of relevant industry experience.\nExperience in designing and training deep learning networks for image understanding tasks, e.g. image classification, object detection, semantic segmentation, panoptic segmentation, etc.\nExperience in developing downstream perception algorithms with vision-language models, e.g. CLIP \nSolid mathematical foundation of machine learning and deep learning techniques.\nStrong coding skills in python (with pytorch) and C/C++. Solid mathematical foundation of machine learning and deep learning techniques.
PhD degree with focus on machine learning, computer vision, robotics or MS with a comparable industry career of 3+ years.\nConsistent track record of researching, inventing and/or shipping advanced machine learning algorithms.\nExperience in language-guided image understanding tasks e.g. open-vocabulary image classification, language-guided visual grounding, open-vocabulary semantic segmentation, etc.\nExperience with designing and training with pipelines which consume large (billion scale) data for training efficient vision language models for edge-devices. This includes data curation for training vision language models, writing efficient data loading pipelines, utilizing distributed GPU training framework.\nExperience with advanced task-specific quality optimization techniques (few-shot learning, meta-learning, domain adaptation, knowledge-distillation, fine-grained learning) for improving network performance and handling specific failure cases (long-tailed distributions/under-represented classes) for downstream tasks.\nExperience in designing and optimizing network towards inference efficiency.\nStrong coding skills in ObjectiveC.\nExcellent communication and collaboration skills.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
- Dice Id: 90733111
- Position Id: c0403a6e6d1dc2317d6f31f8e25b8a04
- Posted 12 hours ago