ML Engineers Needed for New AI R&D Evals Project
METR is hiring ML engineers and researchers.
Community feed
A focused stream of recent stories from the sources curated for this community. Latest: ML Engineers Needed for New AI R&D Evals Project, Introducing the Open Arabic LLM Leaderboard, and Introducing the Open Leaderboard for Hebrew LLMs!. Page 25.
METR is hiring ML engineers and researchers.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Emma moves from President to Executive Director, Beth moves to Head of Research.
Amazon Bedrock model evaluation is now generally available Amazon Web Services (AWS)
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
A collection of resources for evaluating potentially dangerous autonomous capabilities of frontier models.
An example protocol for the whole evaluation process, based on our task suite, elicitation protocol, and scoring methods.
Contribute to METR/public-tasks development by creating an account on GitHub.
More stories load automatically as you scroll.