Job Description
We are sharing a specialised part-time consulting opportunity for professors, PhD students, and advanced academic researchers experienced in domain-specific problem design, Python-based evaluation, benchmark task development, and structured reasoning assessment.
This role supports current and upcoming remote consulting opportunities focused on academic benchmark task design, Python-based evaluation workflows, domain-specific problem development, golden solution preparation, model behavior analysis, and high-quality project execution. Selected professionals will apply their academic expertise to create challenging real-world tasks, define precise expected outputs, develop executable tests, and evaluate reasoning or problem-solving performance across advanced subject areas.
Key Responsibilities
Professionals in this role may contribute to:
Academic Task Design & Development
- ...
Ready to Apply?
Submit your application today and join our talented team at 24-MAG.
Submit ApplicationJob Details
- Location New York, New York
- Job Type Full-time
- Category computer-and-mathematical
- Posted Date June 20, 2026
- Application Deadline July 30, 2026