Freelance Agent Evaluation Engineer

Mindrift
📍 Canada, Quebec, Canada 💼 Part time 🕒 Posted June 04, 2026

Job Description

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What this opportunity involves 

We're building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks.

You'll create challenging tasks and evaluation criteria within realistic simulated environments:

  • Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history
  • Design tasks from intermediate states of these environments - craft the prompt, define what solved means, and ensure the task is solvable by an AI agent
  • Write tests t...

Ready to Apply?

Submit your application today and join our talented team at Mindrift.

Submit Application

Job Details

  • Location Canada, Quebec
  • Job Type Part time
  • Category Computer Occupations
  • Posted Date June 04, 2026
  • Application Deadline July 14, 2026