Machine Learning Engineer

Mountain View, CA, US • Posted 1 day ago • Updated 1 day ago
Contract W2
Contract Independent
Contract Corp To Corp
12 Months
No Travel Required
Able to Sponsor
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

  • Prompt injecttion
  • Prompt injection defense
  • Machine learning
  • LLM
  • Python
  • Pytorch
  • safety
  • Guardrails
  • rlhf

Summary

Job Title: Machine Learning Engineer 
Only on W2 
Duration: 12 months
Location: Mountain View, CA (Local candidates Required)
 
Position Summary
We are looking for an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect Client''''s downstream agentic AI systems across phone, cloud, and XR/AR. You will design, train, and deploy classifier and guardrail models (both cloud-based and hybrid on-device) that screen agent inputs and outputs for injection attacks, unsafe content, and policy violations. A core part of the role is post-training these models with RLHF, DPO, and related optimization techniques to push detection accuracy and false-positive rates beyond what off-the-shelf solutions provide.
 
Role and Responsibilities
  • Design and train prompt injection detection models and prompt safety classifiers that operate on both inputs to and outputs from Samsung''''s agentic AI systems.
  • Build hybrid deployment pipelines that split safety inference between on-device (phone, XR/AR) and cloud, optimizing for latency, privacy, and detection coverage.
  • Apply post-training techniques (e.g. RLHF, reward modeling, policy optimization) to optimize guardrail model performance, calibration, and robustness against adaptive adversaries.
  • Curate and generate adversarial training data: direct and indirect prompt injections, jailbreaks, tool-use exploits, and unsafe-output cases drawn from red-teaming and production signals.
  • Build evaluation harnesses that measure attack success rate, false-positive rate, latency, and on-device footprint across model iterations and threat categories.
  • Partner with agent, device, and platform teams to integrate safety models into mobile-use agents, XR/AR assistants, and cloud agentic workflows, and to close the loop from production incidents back into training data.
  • Work cross-functionally with security researchers, modeling teams, and product engineers; document methods and, where appropriate, contribute to patents and publications.
 
Required Qualifications
  • M.S. or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related field; or B.S. with equivalent industry experience.
  • 3+ years of industry experience in ML engineering or applied AI research, with demonstrated ownership of production ML systems.
  • 2+ years of industry experience in software engineering.
  • Strong proficiency in Python and PyTorch (or JAX/TensorFlow), with solid software engineering fundamentals (version control, testing, and reproducible experimentation).
  • Hands-on experience post-training LLMs with RLHF, DPO, RLAIF, or reward modeling including reward design, preference data curation, and training stability.
  • Hands-on experience training and deploying classifier or guardrail models for safety, content moderation, abuse detection, or adversarial robustness.
  • Familiarity with prompt injection, jailbreak, and agentic AI threat models, and with distributed training frameworks (DeepSpeed, FSDP, Accelerate).
 
Preferred Qualifications:
  • Experience building safety or moderation systems for agentic AI: tool-use guardrails, indirect prompt injection defenses, or output filtering for autonomous agents.
  • Experience with red-teaming, adversarial data generation, or automated attack pipelines (e.g., GCG)
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10427688
  • Position Id: 8976416
  • Posted 1 day ago
Contact the job poster
Rahul Variampallil

Rahul Variampallil

Recruitement Manager @ Prospance Inc.
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Mountain View, California

Today

Easy Apply

Contract, Third Party

$110 - $110

Hybrid in Sunnyvale, California

3d ago

Easy Apply

Contract

Depends on Experience

San Jose, California

Yesterday

Easy Apply

Contract

Depends on Experience

Hybrid in Santa Clara, California

2d ago

Easy Apply

Contract

Depends on Experience

Search all similar jobs