Loading...

Freelance Agent Evaluation Engineer

  • Full Time
  • Anywhere

Mindrift

Freelance Agent Evaluation Engineer needed to create challenging tasks and evaluation criteria for AI coding agents, focusing on real-world developer tasks and realistic simulated environments.

Requirements

  • Degree in Computer Science, Software Engineering, or related fields
  • 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations)
  • Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems
  • Experience writing tests (functional, integration — not just running them)
  • Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis)
  • CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
  • English proficiency – B2

Benefits

  • Up to $45 per hour equivalent
  • Flexible project-based work

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

Keep exploring on Get A Job.ai

Not quite the right fit? Your next opportunity is a click away.

Hiring instead? Post a job and reach candidates searching right now.