Loading...

AI Safety Expert – Red Team

  • Contract
  • Anywhere

mercor

About the job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D’Angelo, Larry Summers, and Jack Dorsey.

Position: AI Safety Experts — English & Tamil
Type:Contract
Compensation:$20–$22/hour
Location:Remote

Role Responsibilities

  • Red team conversational AI models and agents by conducting jailbreaks, prompt injections, and bias exploitation.
  • Generate high-quality human data by annotating failures, classifying vulnerabilities, and flagging systemic risks.
  • Apply structure by following taxonomies, benchmarks, and playbooks to maintain consistent testing.
  • Document reproducibly by producing reports, datasets, and attack cases that customers can act on.
  • Collaborate on sensitive projects with clear guidelines and wellness resources.

Qualifications

Must-Have

  • Prior red teaming experience in AI adversarial work, cybersecurity, or socio-technical probing.
  • Native fluency in English and Tamil.
  • Strong communication skills to explain risks to technical and non-technical stakeholders.

Preferred

  • Experience in Adversarial ML, Cybersecurity, or socio-technical risk analysis.
  • Skills in creative probing such as psychology, acting, or writing.

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome
  • For any help or support, reach out to: support@mercor.com

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

Keep exploring on Get A Job.ai

Not quite the right fit? Your next opportunity is a click away.

Hiring instead? Post a job and reach candidates searching right now.