Loading...

Freelance Agent Evaluation Engineer

  • Full Time
  • Anywhere

Mindrift

We’re looking for a Freelance Agent Evaluation Engineer to build a dataset to evaluate AI coding agents. The ideal candidate should have experience in software development, test automation, and familiarity with infrastructure tools.

Requirements

  • Degree in Computer Science, Software Engineering, or related fields
  • 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations)
  • Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems
  • Experience writing tests (functional, integration — not just running them)
  • Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis)
  • CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
  • English proficiency – B2

Benefits

  • Project-based work, flexible schedule
  • Opportunity to work on challenging AI-related projects
  • Potential earnings up to $21 per hour equivalent

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

About this role & career path

Working in Mexico

Mexico, officially the United Mexican States, is a country in North America. It is the northernmost country in Latin America and borders the United States of America to the north, and Guatemala and Belize to the southeast; while having maritime boundaries with the Pacific Ocean to the west, the Caribbean Sea to the southeast, and the Gulf of Mexico to the east. Mexico covers 1,972,550 km2, and is the thirteenth-largest country in the world by land area. With a population exceeding 134 million as of 2026, Mexico is the tenth-most populous country in the world and is home to the largest number o

    More jobs at Mindrift

    Keep exploring on Get A Job.ai

    Not quite the right fit? Your next opportunity is a click away.

    Hiring instead? Post a job and reach candidates searching right now.