METR Blog

METR Blog May 11, 2026 07:00

Measuring the Self-Reported Impact of Early-2026 AI on Technical Worker Productivity

A survey of 349 technical workers finds a median 1.4–2x self-reported change in value of work due to AI tools, expected to grow over time, though there are reasons to be skeptical of the magnitude.

METR Blog May 08, 2026 07:00

Review of the "Risks from automated R&D" section in the Anthropic Risk Report (February 2026)

External review from METR of the "Risks from automated R&D" section in Anthropic's February 2026 Risk Report

Safety Evals

Anthropic Safety Evals

METR Blog May 08, 2026 07:00

Task Substitution and Uplift

We distinguish three measures of AI uplift -- on old tasks, on new tasks, and in value -- and show that task substitution can cause these to diverge substantially.

METR Blog April 21, 2026 07:00

Evidence on AI R&D Progress from NanoGPT

Classifying human and agent contributions to the NanoGPT speedrun, and what publicly tracked challenges can tell us about AI R&D acceleration.

METR Blog April 10, 2026 07:00

MirrorCode: Evidence that AI can already do some weeks-long coding tasks

This is a linkpost for MirrorCode, a project that METR funded and co-developed with Epoch AI. See Epoch AI’s blog post for more detail: https://epoch.ai/blog/mirrorcode-preliminary-results/

METR Blog April 01, 2026 07:00

Fine-tuning experiments on CoT controllability

We find that a small amount of fine-tuning on instruction following in the CoT generalizes to meaningful increases in CoT controllability on an out-of-distribution set of tasks. We fine-tune four reasoning models on small datasets of instruction-following...

METR Blog March 26, 2026 07:00

Red-Teaming Anthropic's Internal Agent Monitoring Systems

A METR staff member spent three weeks red-teaming a subset of Anthropic's internal agent monitoring and security systems, discovering several novel vulnerabilities.

Anthropic

METR Blog March 20, 2026 07:00

Impact of modelling assumptions on time horizon results

Alexander Barry examines how different modelling choices affect METR's time horizon estimates.

METR Blog March 19, 2026 07:00

We spent 2 hours working in the future

Thomas Kwa describes a tabletop exercise where METR researchers simulated having access to ~200-hour time horizon AIs.

METR Blog March 12, 2026 07:00

Review of the Anthropic Sabotage Risk Report: Claude Opus 4.6

External review from METR of Anthropic's Sabotage Risk Report for Claude Opus 4.6

Safety Evals

Anthropic Safety Evals Claude Claude Opus

METR Blog March 10, 2026 07:00

Many SWE-bench-Passing PRs Would Not Be Merged into Main

We find that roughly half of test-passing SWE-bench Verified PRs written by recent AI agents would not be merged into main by repo maintainers. A naive interpretation of benchmark scores may lead one to overestimate how useful agents are without more...

Benchmarks

METR Blog March 03, 2026 08:00

Observations from two CLI game reimplementation runs with Opus 4.6

Nikola Jurkovic describes observations from tasking Opus 4.6 with reimplementing Slay the Spire and Balatro in the CLI.