evald.ai Sources

METR Blog

Time Horizon 1.1

We’re releasing a new version of our time horizon estimates (TH1.1), using more tasks and a new eval infrastructure.

METR Blog

Early work on monitorability evaluations

We show preliminary results on a prototype evaluation that tests monitors' ability to catch AI agents doing side tasks, and AI agents' ability to bypass this monitoring.

METR Blog

Clarifying limitations of time horizon

Thomas Kwa responds to some misinterpretations of our time horizon work, and explains limitations and the core finding.