Freelance Ai Evaluation Architect (Talca)

Reconocida empresa
📍 talca, región del maule, Chile 💼 Full-time 🕒 Posted June 30, 2026

Job Description

Empresa confidencial connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems.

Participation is project-based, not permanent employment.

What this opportunity involves

  • Build a dataset to evaluate AI coding agents – how well a model handles real-world developer tasks.
  • Create challenging tasks and evaluation criteria within realistic simulated environments, including:
  • Build virtual companies following a high-level plan – codebase, infrastructure, and context (conversations, documentation, tickets) that reflect a realistic environment with development history.
  • Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair.
  • Design tasks set in isolated environments – emulations of a developer's workstation: a Linux mach...

Ready to Apply?

Submit your application today and join our talented team at Reconocida empresa.

Submit Application

Job Details

  • Location talca, región del maule
  • Job Type Full-time
  • Category Asistencia
  • Posted Date June 30, 2026
  • Application Deadline August 09, 2026