Job Responsibilities / Typical Day in the Role
l Build and iterate on predictive models to project content performance
• Test different model types (e.g., gradient boosting, regression) and iterate based on accuracy and user interpretability needs
• Complete model experiments to validate hypotheses
• Proactively identify opportunities to improve performance and contribute to model development roadmap
Explore and segment content types
• Use clustering, principal component analysis, and other unsupervised methods to identify patterns in performance drivers and content types
• Experiment with GenAI to enrich data (e.g., augmenting metadata tagging or generating synthetic attributes)
Collaborate with business users to refine models and drive adoption
• Infuse models and model approach with users’ domain expertise and decision-making workflows
• Present early model outputs in accessible ways to solicit feedback and identify gaps, overlooked variables, and implicit assumptions from users
• Interpret user feedback and adapt model design and underlying data to prioritize interpretability, usability, or explainability where required to build trust and drive adoption
Maintain and monitor models
• Work with ML Engineers to deploy models in production, including API endpoints, and establish ML pipeline
• Contribute to shared libraries and modeling best practices across the data science team
Partner with product and platform teams to deliver impact
• Contribute to design and implementation of new products where data science models will be embedded, built by agile product PODs
• Collaborate with data architects, data engineers, and broader platform team to facilitate technical discussions and enrich the data available for data science
Must Have Skills / Requirements
1) Proven experience as a Data Scientist
a. 4+ years of experience
2) Python and common ML library experience
a. 4+ years of experience; (e.g., scikit-learn, XGBoost, pandas).
3) Familiarity with AWS tools, especially SageMaker, or equivalent cloud-based ML environments
a. 4+ years of experience
Nice to Have Skills / Preferred Requirements
1) Experience in media & entertainment industry is a strong advantage but not required
2) SQL fluency is a plus
Soft Skills:
1) Curiosity and capability to work in an experimental stage of development to test hypotheses and adjust approaches to deliver the most value to business users
2) Experience building predictive models, especially with limited sample sizes
3) Understanding of clustering and dimensionality reduction techniques
4) Exposure to generative AI models (e.g., LLMs, diffusion models) and an interest in applying them to real-world data problems
5) Strong communication skills and the ability to translate data science work into business value as well as translate business user needs into data science
Technology Requirements:
1) Strong skills in Python and common ML libraries (e.g., scikit-learn, XGBoost, pandas).
2) Familiarity with AWS tools, especially SageMaker, or equivalent cloud-based ML environments
3) Experience building predictive models, especially with limited sample sizes
4) Understanding of clustering and dimensionality reduction techniques
5) Exposure to generative AI models (e.g., LLMs, diffusion models) and an interest in applying them to real-world data problems
Education / Certifications
1) None required
Interview Process / Next Steps
1) 1-2 rounds - Manager and VP