Apply Now

SR. Data Scientist

San Bruno, CA, US • Posted 2 days ago • Updated 2 days ago

Full Time

No Travel Required

On-site

Depends on Experience

Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

Python
Data Engineering
PySpark
Natural Language Processing
SQL
Machine Learning (ML)

Summary

We are seeking a highly skilled ML / NLP Evaluation Engineer with strong expertise in Python, large-scale data engineering, and NLP model evaluation. The ideal candidate will have hands-on experience building and validating data pipelines, evaluating ranking/search models using offline metrics, and working with modern cloud-based ML ecosystems.

Required Qualifications

5+ years of experience in Software Engineering, Data Engineering, or ML Engineering.

Strong programming expertise in Python.

Hands-on experience with data engineering frameworks such as PySpark or Google Dataflow.

Experience working with large-scale structured and unstructured datasets.

Strong understanding of NLP model evaluation methodologies and ranking systems.

Hands-on experience with offline evaluation metrics such as:

nDCG (Normalized Discounted Cumulative Gain)

MRR (Mean Reciprocal Rank)

Precision@K / Recall@K

Experience designing and executing A/B testing and experimentation frameworks.

Strong SQL skills and experience with distributed data processing systems.

Experience building ETL/data pipelines for ML workflows.

Familiarity with machine learning lifecycle, model validation, and performance monitoring.

Experience with REST APIs, data validation, and automation scripting.

Strong understanding of SDLC, Agile methodologies, and CI/CD practices.

Excellent analytical, debugging, and problem-solving skills.

Technologies & Tools

Python
PySpark / Google Dataflow
SQL
Google Cloud Platform (preferred)
BigQuery
Feast / Tecton
Airflow
Git / CI-CD
NLP & Ranking Evaluation Frameworks
Experimentation & A/B Testing Tools

Preferred Domain Experience

Search/Relevance Engineering
NLP / LLM Systems
Recommendation Engines
E-commerce Search
AI/ML Platforms
Information Retrieval Systems

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91052859
Position Id: 8977052
Posted 2 days ago

Contact the job poster

Bheeshma Maha

Recruiter @ Data Capital Inc

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Data Scientist

San Francisco, California

•

7d ago

Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest pot

Full-time

USD 138,000.00 - 221,000.00 per year

Senior ML Data Engineer, MLO

San Francisco, California

•

Today

Do you believe Machine Learning and AI can change the world? We truly believe it can! We are the ML Data Team of the Intelligent System Experience (ISE) group at Apple. We are responsible for building high quality datasets at scale. Every year, our team produces datasets used in the training of ML and AI-centric features for many Apple products, including iPhone, iPad, Mac, Apple Watch and even AirPods. Our work is used in very visible and critical features, from the wallpaper on your iPhone Loc

Full-time

Senior Machine Learning Engineer, Rider (Multiple Teams)

San Francisco, California

•

Today

About the Teams The Aura team powers a real-time ML engine personalizing the booking experience for millions of riders. By predicting preferences and recommending Rides products, it drives conversion using hundreds of marketplace and historical features and generates billions in incremental revenue. We stay at the forefront of innovation by employing cutting-edge techniques like multi-task learning, sequence modeling, and transformers, applying statistical and operations research principles glo

Full-time

USD 202,000.00 - 224,000.00 per year

Forward-Deployed Data Scientist II

San Francisco, California

•

Today

At Braze, we have found our people. We're a genuinely approachable, exceptionally kind, and intensely passionate crew. We seek to ignite that passion by setting high standards, championing teamwork, and creating work-life harmony as we collectively navigate rapid growth on a global scale while striving for greater equity and opportunity - inside and outside our organization. To flourish here, you must be prepared to set a high bar for yourself and those around you. There is always a way to con

Full-time

USD 98,000.00 - 164,000.00 per year

Search all similar jobs