Apply Now

Machine Learning Engineer - LLM Evaluation & Automation

Seattle, WA, US • Posted 8 hours ago • Updated 8 hours ago

Full Time

On-site

USD $60.00 - 70.00 per hour

TEKsystems c/o Allegis Group

Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

Product Engineering
Research
Optimization
ROOT
FOCUS
Workflow
KPI
Systems Design
PySpark
Data Processing
Dashboard
Visualization
Python
SQL
Natural Language Processing
Evaluation
Prompt Engineering
Machine Learning (ML)
Insurance
Taxes
Life Insurance
Partnership
Collaboration
Business Transformation
Law
Sourcing
Screening
Recruiting
Artificial Intelligence

Summary

Overview:
We are seeking a Machine Learning Engineer to join a high-impact team focused on advancing LLM evaluation, NLP, and AI-driven automation. This role centers on designing scalable evaluation frameworks, optimizing prompt strategies, and building systems that ensure high-quality, consistent model outputs across product domains. You will partner closely with product, engineering, and research teams to drive measurable improvements in AI performance. This is a hands-on role with a strong emphasis on LLM evaluation systems, prompt engineering, and data-driven model optimization.

Job Details:

Location: Seattle, WA (Hybrid with 3 days a week onsite)
Pay Rate: $60-70 hr/w2
Job Type: Contract
Contract Length: 6 months
Experience Level: Mid-level to Senior

Key Responsibilities:

Design and build LLM-based evaluation frameworks, including automated scoring pipelines and rubric-based grading systems
Build and maintain data pipelines for evaluation datasets using Python, SQL, and scalable processing tools
Translate complex evaluation results into clear, actionable insights for technical and non-technical stakeholders
Implement automation workflows and agentic evaluation systems to improve efficiency and reduce manual efforts
Develop prompt engineering strategies to evaluate output quality, accuracy, and consistency
Create and maintain metrics, KPIs, and dashboards to track and communicate model performance
Conduct error analysis, root-cause investigations, and quality deep dives to guide model improvements
Partner cross-functionally to define evaluation methodologies and integrate them into production workflows

Must-Have Qualifications:

5+ years of experience in ML engineering, NLP, or AI/ML automation
Strong programming skills in Python and SQL
Deep understanding of machine learning concepts with a focus on NLP and advanced LLM capabilities (e.g., Chain-of-Thought, agentic workflows)
Experience working with large-scale datasets and data pipelines
Strong experience with LLM evaluation, prompt engineering, or auto grading systems
Experience developing metrics and KPIs to measure model output quality and consistency

Nice-to-Have:

Experience with LLM-as-judge systems or human + model evaluation frameworks
Background in inter-rater reliability, evaluation calibration, or judged systems design
Experience with PySpark or distributed data processing tools
Exposure to building dashboards or visualization tools for model performance tracking

Technical Skills
Python, SQL, NLP, LLM Evaluation, Prompt Engineering, Machine Learning, Data Pipelines, Automation Systems
NOTE: This posting is for an existing vacancy.
We reserve the right to pay above or below the posted wage based on factors unrelated to sex, race, or any other protected
classification. Eligibility requirements apply to some benefits
and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. This temporary role may be eligible for the following:

Medical, dental & vision
401(k)/Roth
Insurance (Basic/Supplemental Life & AD&D)
Short and long-term disability
Health and Dependent Care Spending Accounts (HAS & DCFSA)
Transportation benefits
Employee Assistance Program
Time off/Leave (PTO, Vacation, or Sick Leave)

Job Type & Location
This is a Contract position based out of Seattle, WA.
Pay and Benefits
The pay range for this position is $60.00 - $70.00/hr.
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:
Medical, dental & vision
Critical Illness, Accident, and Hospital
401(k) Retirement Plan - Pre-tax and Roth post-tax contributions available
Life Insurance (Voluntary Life & AD&D for the employee and dependents)
Short and long-term disability
Health Spending Account (HSA)
Transportation benefits
Employee Assistance Program
Time Off/Leave (PTO, Vacation or Sick Leave)
Workplace Type
This is a fully remote position.
Application Deadline
This position is anticipated to close on Jun 3, 2026.

About TEKsystems

We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.

The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

About TEKsystems and TEKsystems Global Services

We're a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We're a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We're strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We're building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com.

The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

San Francisco Fair Chance Ordinance: Pursuant to the San Francisco Fair Chance Ordinance, for all positions located in the city and county of San Francisco, we will consider for employment qualified applicants with arrest and conviction records.

Massachusetts Lie Detector: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Use of Artificial Intelligence (AI): We may use Artificial Intelligence (AI) to support parts of our hiring process, including sourcing, screening, and evaluating candidates. AI helps assess applications and qualifications, but final decisions are made by our hiring team. By applying, you acknowledge and agree that your application may be reviewed using AI tools.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 101054TS
Position Id: JP-006039998
Posted 8 hours ago

Company Info

About TEKsystems c/o Allegis Group

We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in strategy, implementation and talent, we work with progressive leaders who drive change. That s the power of true partnership. TEKsystems is an Allegis Group company.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Full Stack Developer

Bellevue, Washington

•

Today

Description Onsite 5x a week at Bellevue, WA location. Can go down to 4x a week after building trust with the manager and team Role summary We're looking for a Full Stack Developer to build end-to-end systems around ML workflows, with a focus on enabling LLM/model training and fine-tuning. This is a full-stack ML engineering role: you'll work across data (SQL + performance fundamentals), backend services in Python, ML training workflows, and TypeScript UIs that help researchers and engineers run

Full-time

USD 70.00 - 90.00 per hour

Full Stack Developer

Bellevue, Washington

•

Today

Full-time

USD 70.00 - 90.00 per hour

Full Stack Developer

Bellevue, Washington

•

Today

Full Stack ML Engineer (Onsite - Bellevue, WA) Build the systems that power large-scale machine learning training. We're partnering with a highly respected AI research and engineering organization to hire a Full Stack Engineer focused on building end-to-end systems that support machine learning model training, experimentation, and evaluation - including work on large language models. This role sits at the intersection of software engineering and applied ML. You'll design and ship internal tools

Full-time

USD 70.00 - 90.00 per hour

Data Scientist

Seattle, Washington

•

Today

Overview We are seeking a Data Scientist to join a high-impact team focused on using data to inform product decisions and improve customer experience at scale. This role centers on experimentation, causal analysis, and predictive modeling, partnering closely with product and engineering teams to uncover insights and drive measurable business outcomes. This is a hands-on role with a strong emphasis on experimentation frameworks, causal analysis, and data-driven product decision-making. Job Deta

Full-time

USD 60.00 - 65.00 per hour

Search all similar jobs