Apply Now

Software Engineer - Machine Learning III

Mountain View, CA, US • Posted 1 hour ago • Updated 1 hour ago

Contract W2

Contract Corp To Corp

No Travel Required

On-site

$110 - $110/hr

Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

LLM safety & alignment
RLHF
DPO
RLAIF
jailbreak/prompt injection defense
Production ML engineering
Python + PyTorch
deployment infrastructure
evaluation pipelines
Agentic AI security
mobile/XR deployment
hybrid cloud-device inference
publications/patents/open source
advanced safety modeling

Summary

Job Title: Software Engineer - Machine Learning III

Duties: Machine Learning Engineer, Prompt Safety & Agent Security Lab Summary The Developer Quality Innovation Lab at Samsung Research America builds the automation and tooling that powers data acquisition, safety, and evaluation for Samsung's mobile platform products. Our systems collect, curate, augment data and develop intelligent solution to protect models that fuels the foundation models and AI features shipping across Galaxy devices — and operate the evaluation pipelines that gate their quality before and after launch. We work closely with modeling, device, and product teams to close the loop from on-device signals and user feedback back into training data, faster and at higher quality.

Position Summary

We are looking for an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect Samsung's downstream agentic AI systems across phone, cloud, and XR/AR. You will design, train, and deploy classifier and guardrail models (both cloud-based and hybrid on-device) that screen agent inputs and outputs for injection attacks, unsafe content, and policy violations. A core part of the role is post-training these models with RLHF, DPO, and related optimization techniques to push detection accuracy and false-positive rates beyond what off-the-shelf solutions provide.

Role and Responsibilities

1. Design and train prompt injection detection models and prompt safety classifiers that operate on both inputs to and outputs from Samsung's agentic AI systems.

1. Build hybrid deployment pipelines that split safety inference between on-device (phone, XR/AR) and cloud, optimizing for latency, privacy, and detection coverage.

1. Apply post-training techniques (e.g. RLHF, reward modeling, policy optimization) to optimize guardrail model performance, calibration, and robustness against adaptive adversaries.

1. Curate and generate adversarial training data: direct and indirect prompt injections, jailbreaks, tool-use exploits, and unsafe-output cases drawn from red-teaming and production signals.

1. Build evaluation harnesses that measure attack success rate, false-positive rate, latency, and on-device footprint across model iterations and threat categories.

1. Partner with agent, device, and platform teams to integrate safety models into mobile-use agents, XR/AR assistants, and cloud agentic workflows, and to close the loop from production incidents back into training data.

1. Work cross-functionally with security researchers, modeling teams, and product engineers; document methods and, where appropriate, contribute to patents and publications.

Required Qualifications

1. M.S. or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related field; or B.S. with equivalent industry experience.

1. 3+ years of industry experience in ML engineering or applied AI research, with demonstrated ownership of production ML systems.

1. 2+ years of industry experience in software engineering.

1. Strong proficiency in Python and PyTorch (or JAX/TensorFlow), with solid software engineering fundamentals (version control, testing, and reproducible experimentation).

1. Hands-on experience post-training LLMs with RLHF, DPO, RLAIF, or reward modeling including reward design, preference data curation, and training stability.

1. Hands-on experience training and deploying classifier or guardrail models for safety, content moderation, abuse detection, or adversarial robustness.

1. Familiarity with prompt injection, jailbreak, and agentic AI threat models, and with distributed training frameworks (DeepSpeed, FSDP, Accelerate).

Preferred Qualifications

1. Experience building safety or moderation systems for agentic AI: tool-use guardrails, indirect prompt injection defenses, or output filtering for autonomous agents.

1. Experience with red-teaming, adversarial data generation, or automated attack pipelines (e.g., GCG,

Skills: PAIR, generator–critic frameworks). 1. Experience with on-device or edge ML deployment (ExecuTorch, Core ML, TFLite, MLC-LLM, vendor NPU toolchains) and model compression (quantization, distillation, pruning) for safety models. 1. Experience with telemetry, logging, or user-facing data systems on mobile, XR/AR, or consumer platforms, including privacy-preserving handling of user data (e.g., anonymization, on-device processing, federated approaches). 1. Publications at top-tier ML/NLP/security venues (NeurIPS, ICML, ICLR, ACL, EMNLP, USENIX Security, IEEE S&P), patents, or open-source contributions in the safety, alignment, or AI security space.

Keywords:

Education:

Skills and Experience:

Required Skills:

MACHINE LEARNING ENGINEERING

APPLIED AI RESEARCH

SOFTWARE ENGINEERING

PYTHON

PYTORCH

Additional Skills:

JAX

TENSORFLOW

VERSION CONTROL

TESTING

REPRODUCIBLE EXPERIMENTATION

POST-TRAINING LLMS

RLHF

DPO

RLAIF

REWARD MODELING

REWARD DESIGN

PREFERENCE DATA CURATION

TRAINING STABILITY

CLASSIFIER TRAINING

GUARDRAIL MODEL TRAINING

SAFETY MODEL DEPLOYMENT

CONTENT MODERATION

ABUSE DETECTION

ADVERSARIAL ROBUSTNESS

PROMPT INJECTION DETECTION

JAILBREAK DETECTION

AGENTIC AI THREAT MODELING

DISTRIBUTED TRAINING FRAMEWORKS

DEEPSPEED

FSDP

ACCELERATE

SAFETY SYSTEM DEVELOPMENT FOR AGENTIC AI TOOL-USE GUARDRAILS INDIRECT PROMPT INJECTION DEFENSES OUTPUT FILTERING FOR AUTONOMOUS AGENTS RED-TEAMING ADVERSARIAL DATA GENERATION AUTOMATED ATTACK PIPELINES GCG PAIR

Languages:

English

Read

Write

Speak

Minimum Degree Required: Master's Degree

Patents: No

Publications: No

Veteran Status: No

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10110849
Position Id: 1655-10990-2865
Posted 1 hour ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Hybrid in Palo Alto, California

•

12d ago

Job Title: AI Engineer (Applied AI / Machine Learning) Location: In person in Palo Alto CA only local Rate: As per Market Standard The Opportunity We are seeking an AI Engineer to help bring machine learning and generative AI capabilities into real-world products and platforms. You will work at the intersection of data, models, and systems to deliver scalable, production-ready AI solutions. Role Summary Develop and operationalize machine learning and generative AI solutions, with a focus on mod

Easy Apply

Contract, Third Party

Depends on Experience

AI Developer

Palo Alto, California

•

13d ago

Job Title: AI Developer Location: Palo Alto, CA - Onsite Duration: 12 Months Contract About the Role We are seeking an experienced AI Developer / Machine Learning Engineer to join our client, in Palo Alto. This role focuses on building intelligent systems that enhance data security, automation, and analytics capabilities across enterprise platforms. Key Responsibilities Design, develop, and deploy AI/ML models for real-world enterprise use casesBuild and optimize machine learning pipelines for

Easy Apply

Contract

Depends on Experience

AI Developer/Engineer

Palo Alto, California

•

20d ago

Job Title: AI Developer/Engineer Job location: Palo Alto, CA Duration: 12 Months Contract Summary: We are seeking an AI Engineer to join our dynamic team and contribute to the development and enhancement of our AI-driven platforms. The ideal candidate will possess deep technical expertise in machine learning and artificial intelligence, with a proven track record of developing scalable AI solutions. Your role will involve everything from data analysis and model building to integration and deplo

Easy Apply

Contract

75 - 80

Senior MLOps Engineer

Cupertino, California

•

Today

Hello, My name is Sreeja and I represent TestingXperts Inc. TestingXperts is a Specialist QA & Software Testing Company, and an Independent Software Testing division of Damco Group, which is a leading IT Solutions and Services company working with Fortune Enterprises globally. Inheriting the virtues of job quality and optimal user satisfaction from Damco Group, TestingXperts aims at promoting the ethics of connected innovation, thereby seeding the integral values in our employees and achieving u

Easy Apply

Third Party, Contract

Search all similar jobs

Software Engineer - Machine Learning III

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs