Standardizing Generative AI Service Evaluation: An API-Centric Benchmarking Approach - MLCommons
MLPerf® Endpoints brings API-native benchmarking, Pareto curve visualizations, and rolling submissions to generative AI infrastructure evaluation.
MLCommons · David Kanter