ML Data Engineer IV - onsite in Redmond, WA

Overview

On Site
BASED ON EXPERIENCE
Full Time
Contract - W2
Contract - Independent
Contract - 12+ mo(s)

Skills

DATA
ENGINEER

Job Details

Job Title - ML Data Engineer IV
Location - Onsite in Redmond, WA
Contract Duration - 12 months contract
Pay Range - $90-95/hour on W2 (DOE)


We are looking for a contract ML Data Engineer to work at the intersection of data engineering and applied machine learning. You will be helping to process and transform our complex multimedia data into complete machine learning datasets suitable for consumption by researchers. We require someone who can take a hands-on approach to building data-processing pipelines, ensuring the data is robust and well-prepared for our machine learning workflows.

Responsibilities
  • Design, develop, and maintain scalable data-processing pipelines for large volumes of multimedia (audio, video) and sensor data (e.g. IMU), ensuring reliability and reproducibility.
  • Gather and interpret processing requirements from stakeholders, translating them into practical technical solutions and devising Client approaches where needed.
  • Perform diverse data-processing operations, from mathematical transformations and filtering to feature extraction, synchronisation, and inference through ML models.
  • Interface with various internal tooling at Meta such as dataset management systems and training frameworks to prepare raw data for machine learning, including validation, transformation, and quality assurance.
  • Collaborate with machine learning researchers to integrate research prototypes into production pipelines.
  • Ensure compliance with data governance, security, and relevant standards.

Minimum requirements
  • Bachelor s degree in a relevant technical field (e.g. Computer Science, Data Science) with 3+ years of industry experience in machine learning or data engineering; or equivalent combination of education and experience.
  • Demonstrable programming experience in Python using common ML and data libraries, i.e. numpy, scipy, pandas.
  • Proficiency in Linux and shell scripting.
  • Working knowledge of audio, image and video formats.

Preferred experience
  • Experience using PyTorch or other Python machine-learning frameworks.
  • Experience with relational and graph / NoSQL databases.
  • Experience using REST APIs for data interactions.
  • Experience working in a research environment.
  • Strong mathematical background.

Top must-have HARD skills:
  • Strong knowledge of Python in the context of data engineering and data processing (SQL, data cleaning, anomaly detection)
  • Working understanding of ML training - specifically how data quality impacts ML training outcomes from PyTorch workflow

Good to have skills:
  • Knowledge of multimodal data sets (Audio/Video, Optitrack, multi-Camera/Sensor)
  • Beginner knowledge of audio (DSP, acoustics)


Russell Tobin offers eligible employee s comprehensive healthcare coverage (medical, dental, and vision plans), supplemental coverage (accident insurance, critical illness insurance and hospital indemnity), 401(k)-retirement savings, life & disability insurance, an employee assistance program, legal support, auto, home insurance, pet insurance and employee discounts with preferred vendors.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.