Autonomy Evaluation Resources
A collection of resources for evaluating potentially dangerous autonomous capabilities of frontier models.
A collection of resources for evaluating potentially dangerous autonomous capabilities of frontier models.