Job Description
We are building a benchmark dataset to evaluate AI models on professional document understanding and instruction following within the Business & Professional Services domain.
Tasks consist of complex, multi-step requests grounded in real-world workspace files (business plans, reports, presentations), web search, and code execution — each paired with a clearly defined ground truth output and an objective evaluation rubric. You will be responsible for authoring tasks that require rigorous business reasoning, precise instruction following, and well-structured professional outputs.
We expect a minimum commitment of 15–20 hours per week.
Ideal candidates have 3+ years of hands‑on experience in one or more of the following sub‑domains:
- Management consulting & business strategy
- Marketing & sales
- Human resources & organizational development
- Business operations & general management
Ready to Apply?
Submit your application today and join our talented team at Obsidian.
Submit ApplicationJob Details
- Location toronto, on
- Job Type Full-time
- Category Other-General
- Posted Date June 16, 2026
- Application Deadline July 26, 2026