evald.ai · OpenAI Evaluation Filter BrowseComp: a benchmark for browsing agents April 10, 2025 10:00 BrowseComp: a benchmark for browsing agents. OpenAI Open original