CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Topic feed
Evaluation frameworks, graders, and AI testing infrastructure.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.