Overview
On Site
USD 119,800.00 - 234,700.00 per year
Full Time
Skills
Innovation
Investments
Data Quality
Management
Workflow
Clustering
GitHub
Thought Leadership
Publishing
Econometrics
Electrical Engineering
Computer Engineering
Predictive Analytics
Research
Python
PyTorch
TensorFlow
scikit-learn
Data Analysis
Evaluation
Screening
PASS
Cloud Computing
Computer Science
Statistics
Training
Privacy
Regulatory Compliance
Artificial Intelligence
Apache Spark
Databricks
Microsoft Azure
Data Lake
Collaboration
Science
Internal Communications
IC
Integrated Circuit
SAP BASIS
Legal
Recruiting
Microsoft
Machine Learning (ML)
Data Science
Job Details
Join Microsoft's CoreAI team to build the AI Data Platform, the foundation for secure, scalable, reusable datasets that power model development .
The AI Data Platform team's mission is to build a central AI data platform that breaks down Microsoft's data silos and manages the full lifecycle of first-party, third-party, synthetic, and human-labeled data , a ccelerat ing AI model development with secure, reusable, and compliant datasets.
The AI Data Platform team is responsible for large-scale data infrastructure, automation tools, and intelligence services to transform how Microsoft collects, generates, manages, and shares AI training data.
We are seeking Applied Scientists to drive scientific innovation in data generation, validation, evaluation, and automation. You will set the vision for intelligent, ML-driven services that manage the end-to-end data lifecycle, and partner with leaders across Microsoft to ensure Microsoft's data investments deliver maximum AI impact.
Responsibilities:
Responsibilities
Qualifications:
Required Qualifications
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: ;br>
Microsoft posts positions for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#DataPlatform, #AIJobs , #MachineLearning, #DataScience # CoreAI
The AI Data Platform team's mission is to build a central AI data platform that breaks down Microsoft's data silos and manages the full lifecycle of first-party, third-party, synthetic, and human-labeled data , a ccelerat ing AI model development with secure, reusable, and compliant datasets.
The AI Data Platform team is responsible for large-scale data infrastructure, automation tools, and intelligence services to transform how Microsoft collects, generates, manages, and shares AI training data.
We are seeking Applied Scientists to drive scientific innovation in data generation, validation, evaluation, and automation. You will set the vision for intelligent, ML-driven services that manage the end-to-end data lifecycle, and partner with leaders across Microsoft to ensure Microsoft's data investments deliver maximum AI impact.
Responsibilities:
Responsibilities
- Advancing machine learning and data science to improve data quality, automate dataset generation, and design intelligent agent-driven services that manage the end-to-end data lifecycle.
- Develop ML-based pipelines for data generation, validation, augmentation, and discovery (e.g., synthetic data, human-in-the-loop workflows).
- Design and train intelligent agents to automate key parts of the dataset lifecycle, including ingestion, validation, PII detection and handling, governance, discovery, and feedback loops.
- Build evaluation methods to measure dataset quality, coverage, and usefulness for large-scale model training.
- Leverage AI/ML techniques (e.g., classification, clustering, anomaly detection, embeddings, LLM-based evaluation) to improve data discovery, curation, and governance.
- Collaborate with engineers to integrate scientific methods and models into scalable pipelines and platform services.
- Partner with AI product and research teams ( CoreAI , MAI, M365, GitHub, MSR , and more ) to align datasets with model training needs and identify new opportunities.
- Contribute thought leadership by publishing or sharing insights internally and externally to shape Microsoft's data-centric AI practices.
Qualifications:
Required Qualifications
- Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research) OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research) OR equivalent experience.
- 2+ years of experience applying machine learning or data science in practical settings.
- Programming skills in Python and ML frameworks (e.g., PyTorch , TensorFlow, Scikit-learn).
- Experience with data analysis, dataset design, or evaluation methodologies.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years
- Master's degree or PhD in Computer Science, Machine Learning, Statistics, or related field, or equivalent experience.
- 4+ years of experience applying machine learning or data science in practical settings.
- Experience with LLM training pipelines, synthetic data generation, or data-centric AI approaches.
- Knowledge of PII detection, data privacy, fairness, or compliance in AI systems.
- Familiarity with distributed data systems (e.g., Spark, Databricks, Azure Data Lake).
- Strong collaboration skills with engineers, TPMs, and product partners across multiple orgs.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: ;br>
Microsoft posts positions for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#DataPlatform, #AIJobs , #MachineLearning, #DataScience # CoreAI
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.