Evaluation Engineer (AI Models)- Full Time Role

New York, NY, US • Posted 11 hours ago • Updated 11 hours ago
Full Time
No Travel Required
On-site
Depends on Experience
Company Branding Image
Fitment

Dice Job Match Score™

🔗 Matching skills to job...

Job Details

Skills

  • API QA
  • Artificial Intelligence
  • Generative Artificial Intelligence (AI)
  • Testing
  • Quality Assurance
  • Test Cases

Summary

Role: Evaluation Engineer (AI Models)

Location: New York City, NY / Fort Mill, SC / San Diego, CA

Duration: Full Time

 

Role Overview

We are seeking an experienced Evaluation Engineer – AI Models to join our growing AI and Digital Engineering team. The ideal candidate will have a strong background in Quality Engineering and hands-on experience evaluating AI/ML and Generative AI model performance across business and technical use cases.
This role requires a combination of analytical thinking, testing expertise, data-driven evaluation, and strong communication skills to collaborate effectively with engineering, product, and business stakeholders.

Key Responsibilities

  • Design, develop, and execute evaluation strategies for AI/ML and Generative AI models.
  • Validate model outputs for accuracy, relevance, consistency, hallucination detection, bias, safety, and performance.
  • Create automated and manual evaluation frameworks for LLM-based applications and AI systems.
  • Develop test cases, benchmarking approaches, and quality metrics for AI model validation.
  • Work closely with Data Scientists, Product Managers, and Engineering teams to improve model quality and reliability.
  • Analyze model behavior using structured and unstructured datasets.
  • Perform regression testing and continuous validation for model updates and releases.
  • Document evaluation findings, defects, risks, and recommendations clearly for technical and business audiences.
  • Support UAT and production validation activities for AI-enabled products and platforms.
  • Contribute to QA best practices, test automation strategies, and AI quality governance initiatives.

Required Qualifications

  • 10+ years of experience in Quality Assurance / Quality Engineering / Software Testing.
  • 2+ years of hands-on experience in AI model evaluation, Generative AI testing, or ML validation.
  • Strong understanding of AI/ML concepts, LLM behavior, prompt evaluation, and model testing methodologies.
  • Experience with API testing, test automation frameworks, and data validation techniques.
  • Familiarity with evaluation metrics such as precision, recall, accuracy, grounding, relevance, and hallucination detection.
  • Experience testing AI-powered applications, conversational AI, or GenAI platforms.
  • Strong analytical and problem-solving skills.
  • Excellent verbal and written communication skills.
  • Ability to work independently in a remote and cross-functional environment.

Preferred Skills

  • Experience with Python and AI/ML testing tools/frameworks.
  • Exposure to prompt engineering and Retrieval-Augmented Generation (RAG) validation.
  • Knowledge of cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Experience working in Agile/Scrum environments.
  • Financial Services or Wealth Management domain experience is a plus.

Education

Bachelor’s degree in Computer Science, Engineering, Information Systems, or related field preferred.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91020323
  • Position Id: 8981695
  • Posted 11 hours ago

Company Info

About Visionary Innovative Technology Solutions

VITS provide staffing and recruitment services along with technology consulting to more than 50+ clients globally Our skilled & expertise professionals help clients to manage varying skill needs, skills gaps and changing staffing needs to encounter project deadlines. VITS staff augmentation services provide skilled resources which assist clients to develop, maintain, manage and support their applications. Our vigorous pursuit for excellence in hiring, delivery model, work ethics, and approach has enabled us to become a highly trusted & preferred recruitment solution provider.



Contact the job poster
Deepak Kumar

Deepak Kumar

Talent Acquisition Specialist @ Visionary Innovative Technology Solutions
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Jersey City, New Jersey

7d ago

Easy Apply

Contract, Third Party

Depends on Experience

Atlanta, Georgia

6d ago

Easy Apply

Contract, Third Party

Depends on Experience

Hybrid in Georgetown, Texas

9d ago

Easy Apply

Contract, Third Party

Depends on Experience

Search all similar jobs