San Francisco, California
•
Today
Description ## **LLM Evaluation Analyst** About the Role MUST HAVE 3-4 YEARS OF PLAYWRIGHT EXPERIENCE WITH A GREAT UNDERSTANDING WITH LLMS We are seeking **3 Evaluation Analysts** to assess the performance of AI models tasked with implementing web features. Your work directly informs whether AI-generated code is correct, whether the instructions given to the models are clear, and whether the testing frameworks used to evaluate them are fair and reliable. Core Responsibilities You will analyz
Full-time
USD 35.00 - 50.00 per hour


















