Epoch AI
Researcher, Benchmark Reviews
Remote / Full Time
Role brief
What this role is asking for.
Epoch AI is looking for a Researcher to develop and publish critiques and reviews of AI benchmarks. About the role We are looking for a Researcher to produce a steady stream of benchmark reviews. You will closely analyze a wide variety of new benchmarks, evaluate their methodologies, and write up your findings in public-facing research. You should be comfortable using coding agents to help you, without delegating your judgment. Examples of the kind of reports you would produce include our reviews of SWE-bench Verified, OSWorld, and economic value benchmarks. This role is fully remote; we are able to hire in many countries. We invite anyone who is interested to apply, regardless of background, experience, or credentials. Please do not include a cover letter, photograph, or headshot of yourself, or any personal information that is not relevant to the role for which you're applying (including marital status, age, identity traits, etc.). If this role sounds interesting, we are also looking for researchers on multiple other teams. Applications are rolling.
Company role signals