Loading...

AI Evaluation Engineer (Data Analysis & Multi-Agent Systems)

  • Full Time
  • Anywhere

Gramian Consulting Group

Gramian Consultancy is seeking an AI Evaluation Engineer to design benchmark tasks for complex data analysis workflows. The ideal candidate has 5+ years of experience in data analysis and strong proficiency in Python and SQL.

Requirements

  • 5+ years of experience in data analysis or analytics-heavy roles
  • Strong proficiency in Python (pandas, NumPy) and SQL
  • Experience working with real-world, messy datasets (CSV, JSON, logs, reports)
  • Ability to design analytical problems with clear, verifiable answers
  • Solid understanding of statistics (distributions, correlations, outliers)
  • Familiarity with AI benchmarks or evaluation environments (e.g., SWE-bench or similar)
  • Hands-on experience with Docker (Dockerfiles, image builds, debugging)

Benefits

  • Flexible work arrangements

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

Keep exploring on Get A Job.ai

Not quite the right fit? Your next opportunity is a click away.

Hiring instead? Post a job and reach candidates searching right now.