Epoch AI

Researcher, Benchmark Reviews

Remote / Full Time

Role brief

What this role is asking for.

Epoch AI is looking for a Researcher to develop and publish critiques and reviews of AI benchmarks. About the role We are looking for a Researcher to produce a steady stream of benchmark reviews. You will closely analyze a wide variety of new benchmarks, evaluate their methodologies, and write up your findings in public-facing research. You should be comfortable using coding agents to help you, without delegating your judgment. Examples of the kind of reports you would produce include our reviews of SWE-bench Verified, OSWorld, and economic value benchmarks. This role is fully remote; we are able to hire in many countries. We invite anyone who is interested to apply, regardless of background, experience, or credentials. Please do not include a cover letter, photograph, or headshot of yourself, or any personal information that is not relevant to the role for which you're applying (including marital status, age, identity traits, etc.). If this role sounds interesting, we are also looking for researchers on multiple other teams. Applications are rolling.

Company role signals

Epoch AI role signals.

Repeated tags across 8 active roles show the current hiring pattern.