Details...
salary range:
Field of activity:
Gender:
Education level:
Experience:
Job type:
Jobs CanadaJobs in the state of Québec
Cataloxy Québec...Jobs in QuébecEvaluation Scenario Writer - AI Agent Testing Specialist

Job Evaluation Scenario Writer - AI Agent Testing Specialist, Québec

ID: 4080842Job is in archives

Evaluation Scenario Writer - AI Agent Testing Specialist, Québec

Mindrift
to 45$ per hour

Summary information

Evaluation Scenario Writer - AI Agent Testing SpecialistPublished: 2026-02-09Valid until: 2026-02-24Categories:Information Tech/ComputerJob type: full timeGender: anyCompany: MindriftCity: Québec
adzuna.com  Job from partner
Job is in archives

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What this opportunity involves

While each project involves unique tasks, contributors may:

  • Create structured test cases that simulate complex human workflows
  • Define gold-standard behavior and scoring logic to evaluate agent actions
  • Analyze agent logs, failure modes, and decision paths
  • Work with code repositories and test frameworks to validate your scenarios
  • Iterate on prompts, instructions, and test cases to improve clarity and difficulty
  • Ensure that scenarios are production-ready, easy to run, and reusable

What we look for

This opportunity is a good fit for software engineers, open to part-time, non-permanent projects. Ideally, contributors will have:

  • 3+ of software development experience with strong Python focus
  • Experience with Git and code repositories
  • Comfortable with structured formats like JSON/YAML for scenario description
  • Understanding core LLM limitations (hallucinations, bias, context limits) and how these affect evaluation design
  • Familiarity with Docker
  • English proficiency - B2

How it works

Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

Project time expectations

Tasks for this project are estimated to take 6-10 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

Payment

  • Paid contributions, with rates up to $45/hour*
  • Fixed project rate or individual rates, depending on the project
  • Some projects include incentive payments

*Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.

Permanent link to this page:

Similar jobs of in Québec in Information Tech/Computer

Internet
Categories
Aerospace
Agriculture
Airlines/Aviation
Animal Care
Architecture
Arts/Media
Banking/Real Estate/Mortgage Professionals
Business/Strategic Management
Call Center
Chemical
Construction & Trades
Consumer Products
Decorating/Design
Commerce
Editorial/Writing
Entertainment
Electronic
Employment/Staffing
Extraction
Farming
Level
Government/Public Sector
HVAC
Higher Ed
Import/Export
Industrial
Insurance
Internet/Web
Software Development
Law Enforcement/Security
Library
Life Science/Environmental
Management
Marketing/Product
Mechanical/Automotive
Military
Mining/Metals
Profit/Fundraising
Packaging
Pharmaceutical
Publishing/Journalism
Purchasing/Procurement
Quality Assurance/Safety
Radio/Television
Recreation
Research
Restaurant/Food Service
Security/Protective Services
Service
Shipping/Receiving
Sports
Communications
Utilities
Warehouse/Distribution
Work at Home/Business Opp
Other
Locations closer to in Québec