Senior Software Engineer: Agentic Evaluation

Cupertino, CA, US • Posted 7 days ago • Updated 3 days ago
Full Time
On-site
Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

  • Systems Engineering
  • C++
  • Objective-C
  • Scripting
  • Communication
  • Collaboration
  • Computer Science
  • Evaluation
  • Machine Learning (ML)
  • Provisioning
  • Recovery
  • Continuous Integration and Development
  • Continuous Integration
  • Python
  • Swift

Summary

At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.\\nDo you want to help measure and improve the quality of Siri across the devices, features, and experiences people rely on every day? Apple's Agentic Evaluation Engineering organization builds the infrastructure that determines how Siri's quality is measured, trusted, and improved. You'll join a team focused on expanding what that platform can reach: the devices and environments we evaluate on, the features and interaction modalities we exercise, and the realistic, repeatable conditions we stage to ground each evaluation. The surface area is large and growing. You'll have real autonomy in how you tackle it, and you'll build infrastructure the team can rely on as priorities shift.\\n\\n

In this role you'll contribute to the infrastructure, tooling, and pipelines that let us evaluate Siri reliably and at scale. You'll have meaningful autonomy in how you get there, and the work will move across several areas of expansion as priorities evolve. The specific platforms, frameworks, and components will change over time, so we're looking for someone who can transition smoothly across them and bring strong evaluation and systems engineering fundamentals to whatever the team needs next.

Strong programming skills in one or more compiled languages (Swift, C++ or Objective-C).\nPython scripting skills for tooling and automation\nSolid understanding of computer science fundamentals\nAbility to quickly learn new technologies and adapt to evolving requirements\nExcellent communication skills and ability to collaborate across teams\nM.S. or B.S. in Computer Science, Machine Learning, or related field (or equivalent experience)

Experience staging, provisioning, or controlling test or evaluation environments to produce repeatable, deterministic conditions\nExperience evaluating ML, LLM or agent-based systems, including familiarity with metrics, scoring methodology, or trajectory and outcome analysis\nExperience designing or operating test infrastructure at scale, such as device provisioning, environment restore, warm pools, or continuous integration systems\nProficiency with Python and Swift in a production setting\nA track record of approaching problems flexibly and cutting through ambiguity, adapting your approach to reach the right outcome and setting a clear path when requirements are not yet defined\nA talent for focusing and simplifying, stripping away what is not essential and distilling complex decisions down to the factors that matter\nA history of collaborating across teams and communicating effectively with both technical and program audiences
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 5ac09aa13bb69f0cfd3274b4291419a
  • Posted 7 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cupertino, California

3d ago

Full-time

Cupertino, California

3d ago

Full-time

Mountain View, California

3d ago

Full-time

USD 217,565.00 - 271,000.00 per year

Cupertino, California

3d ago

Full-time

Search all similar jobs