Loading...

Freelance Agent Evaluation Engineer

  • Full Time
  • Anywhere

Mindrift

We’re building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. This part-time opportunity involves building virtual companies, assembling and calibrating tasks, and designing tests to evaluate AI agent performance.

Requirements

  • Degree in Computer Science, Software Engineering, or related fields
  • 5+ years in software development, primarily Python
  • Background in full-stack development with experience building React-based interfaces and robust back-end systems
  • Experience writing tests (functional, integration) and familiarity with Docker containers and infrastructure tools

Benefits

  • Flexible part-time work
  • Opportunity to work on challenging projects with leading tech companies
  • Variety of projects to choose from with different scope, complexity, and required expertise

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

About this role & career path

Working in Germany

Germany, officially the Federal Republic of Germany, is a country in Western and Central Europe. It lies between the Baltic Sea and the North Sea to the north with the Alps to the south. Its sixteen constituent states have a total population of over 82 million, making it the most populous member state of the European Union (EU). Germany borders Denmark to the north; Poland and the Czech Republic to the east; Austria and Switzerland to the south; and France, Luxembourg, Belgium, and the Netherlands to the west. The nation's capital and most populous city is Berlin and its main financial centre

    More jobs at Mindrift

    Keep exploring on Get A Job.ai

    Not quite the right fit? Your next opportunity is a click away.

    Hiring instead? Post a job and reach candidates searching right now.