Full Time
Anywhere
Posted 1 month ago

Gramian Consulting Group

Gramian Consultancy is seeking an AI Evaluation Engineer to design benchmark tasks for complex data analysis workflows. The ideal candidate has 5+ years of experience in data analysis and strong proficiency in Python and SQL.

Requirements

5+ years of experience in data analysis or analytics-heavy roles
Strong proficiency in Python (pandas, NumPy) and SQL
Experience working with real-world, messy datasets (CSV, JSON, logs, reports)
Ability to design analytical problems with clear, verifiable answers
Solid understanding of statistics (distributions, correlations, outliers)
Familiarity with AI benchmarks or evaluation environments (e.g., SWE-bench or similar)
Hands-on experience with Docker (Dockerfiles, image builds, debugging)

Benefits

Flexible work arrangements

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

Keep exploring on Get A Job.ai

Not quite the right fit? Your next opportunity is a click away.

Browse all jobs
More jobs by category
Remote jobs you can do from anywhere
Research typical pay for this role
Set a job alert so new matches reach you first
Upload your resume to apply faster

Hiring instead? Post a job and reach candidates searching right now.

Get A Job.ai

AI Evaluation Engineer (Data Analysis & Multi-Agent Systems)

Requirements

Benefits

Keep exploring on Get A Job.ai

Lead Product Manager, Safety

Oracle Record to Report Lead

Data Analyst (F/H)

Board Advisor (Volunteer)

Accounts Receivable Specialist – Freelance, Remote

People Operations Specialist – Contracts & HR Administration

Senior Information Security Engineer

Senior Windows Administator / Platform Engineer IV

Community Manager (Senior Level Considered)

Growth Marketing Lead | $125K-$150K USD + Bonus + Equity + Remote | Award Winnin