An Introduction to AI Secure LLM Safety Leaderboard
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Community feed
A focused stream of recent stories from the sources curated for this community. Latest: An Introduction to AI Secure LLM Safety Leaderboard, A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard, and Bounty: Diverse hard tasks for LLM agents. Page 29.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
METR (formerly ARC Evals) is looking for (1) ideas, (2) detailed specifications, and (3) well-tested implementations for tasks to measure performance of autonomous LLM agents.
ARC Evals is wrapping up our incubation period at ARC, and spinning off into our own standalone nonprofit.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
To support the safety of highly-capable AI systems, we are developing our approach to catastrophic risk preparedness, including building a Preparedness team and launching a challenge.
We describe the basic components of Responsible Scaling Policies (RSPs) as well as why we find them promising for reducing catastrophic risks from AI.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ARC Evals plans to spin out from the Alignment Research Center (ARC) in the coming months, and become its own standalone organization.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We have just released our first public report. It introduces methodology for assessing the capacity of LLM agents to acquire resources, create copies of themselves, and adapt to novel challenges they encounter in the wild.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
More stories load automatically as you scroll.