A new GPT-OSS benchmark and DeepSeek R1 updates for latency-optimized reasoning - MLCommons

MLPerf Inference v6.0 introduces GPT-OSS 120B, a new open-weight LLM benchmark, plus a DeepSeek-R1 interactive scenario with support for speculative decoding.

MLCommons · MLCommons

Benchmarks

Open original