AIML - Sr Data Scientist, Data and ML Innovation

Cupertino, CA, US • Posted 12 hours ago • Updated 12 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • Innovation
  • Data Mining
  • Generative Artificial Intelligence (AI)
  • Use Cases
  • Data Quality
  • Unstructured Data
  • Data Analysis
  • Pattern Recognition
  • Analytical Skill
  • Data Manipulation
  • SQL
  • Python
  • Apache Spark
  • Workflow
  • Statistics
  • Mathematics
  • Computer Science
  • Economics
  • Physics
  • Training And Development
  • Evaluation
  • Training
  • Optimization
  • Data Science
  • Machine Learning (ML)

Summary

We are a group of data scientists, partnering with ML researchers and engineers who develop foundation models that power Apple Intelligence features. We are pushing the boundaries of data science by developing novel techniques to evaluate the performance and capabilities of Foundational Models. In addition, we leverage data mining expertise to identify characteristics of training data that influence the performance of these models. If you are an accomplished data scientist, who wants to expand your influence in this fast evolving and exciting space of Generative AI, this is a great opportunity for you.

As a Sr Data Scientist partnering with ML Researchers, you will bring your inquisitive mind and outstanding technical skills to unlock insights about the drivers that improve effectiveness of foundation models. Some projects that you are likely to drive are:\n\n- Assess existing foundation models evaluation techniques and improve them to suit Apple's use cases. \n\n- Studying loss patterns of models and attributing them to the drivers across the life cycle of model development from pre-training to SFT to RL.\n\n- Defining training data quality metrics and assessing quality of training data to ensure it positively impacts model performance.\n\n- Driving interpretability of ablation studies, documenting takeaways and driving a hypothesis driven experimentation strategy.\n\n- Building tools and automation process using LLMs to scale data science projects.

5+ years of data science experience demonstrating strong impact to product or model performance.\nExperience developing evaluation sets and metrics for foundational model performance measurement and diagnostics.\nProficiency with applying quantitative methods to structured & unstructured data for exploratory data analysis, pattern recognition, insights generation, metrics development, and scaling analytical tools. \nStrong programming skills in large scale data manipulation & processing including SQL, Python & Spark.\nExperience leveraging LLMs in data science workflows. \nMaster's degree in a technical or quantitative field such as Statistics, Mathematics, Computer Science, Engineering, Economics or Physics.

Deep understanding of foundation model training and development life cycle.\nExperience with developing model evaluation simulation environments \nExperience generating synthetic data for foundation model training.\nExperience with prompt optimization for improving in context learning performance of LLMs.\nBuilding and deploying end to end Data Science/ML pipelines. \nPhD degree preferred.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 7b6a10d4a6a7661e8c9c5f80104e03b7
  • Posted 12 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Sunnyvale, California

Today

Full-time

Cupertino, California

Today

Full-time

Sunnyvale, California

Today

Full-time

Mountain View, California

Today

Full-time

Search all similar jobs