Loading...

Freelance Agent Evaluation Engineer

  • Full Time
  • Anywhere

Mindrift

We’re building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You’ll work on a part-time, non-permanent project, creating tasks for AI agents to evaluate and improve their coding abilities.

Requirements

  • Degree in Computer Science, Software Engineering, or related fields
  • 5+ years in software development, primarily Python
  • Background in full-stack development, with experience building React-based interfaces and robust back-end systems
  • Experience writing tests, familiarity with Docker containers, CI/CD tools, and infrastructure tools

Benefits

  • Opportunity to work on a challenging project, Flexible schedule, Compensation up to $45 per hour

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

Keep exploring on Get A Job.ai

Not quite the right fit? Your next opportunity is a click away.

Hiring instead? Post a job and reach candidates searching right now.