×
Register Here to Apply for Jobs or Post Jobs. X

Evaluation Scenario Writer - AI Agent Testing Specialist

Job in Town of Belgium, Belgium, Ozaukee County, Wisconsin, 53004, USA
Listing for: Mindrift
Full Time position
Listed on 2026-02-18
Job specializations:
  • Software Development
    Software Engineer, AI Engineer, Full Stack Developer
Salary/Wage Range or Industry Benchmark: 50 USD Hourly USD 50.00 HOUR
Job Description & How to Apply Below
Location: Town of Belgium

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What This Opportunity Involves
  • Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
  • Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
  • Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
  • Analyze AI failures to understand what the model struggles with vs. what it masters
  • Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria
What We Look For
  • Degree in Computer Science, Software Engineering or related fields
  • 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
  • Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
  • Experience writing tests (functional, integration - not just running them)
  • Docker containers (running evaluations locally in containers)
  • CI/CD understanding (Git Hub Actions as a user: triggers, labels, reading results)
  • English proficiency - B2
How It Works

Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

Effort estimate

Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

Payment
  • Paid contributions, with rates up to $50/hour*
  • Fixed project rate or individual rates, depending on the project
  • Some projects include incentive payments
  • Note:

    Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary