OpenAI Evaluation Filter

Creating images with ChatGPT

Learn how to create and refine images with ChatGPT using clear prompts, iterate on designs, and generate high-quality visuals in minutes.

OpenAI Evaluation Filter

Using skills

Learn how to create and use ChatGPT skills to build reusable workflows, automate recurring tasks, and ensure consistent, high-quality outputs.

Google News LLM Evaluation

Google News

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.

METR Blog

Fine-tuning experiments on CoT controllability

We find that a small amount of fine-tuning on instruction following in the CoT generalizes to meaningful increases in CoT controllability on an out-of-distribution set of tasks. We fine-tune four reasoning models on small datasets of instruction-following...

Google News LLM Evaluation

Google News

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.

Google News LLM Evaluation

Google News

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.

Google News LLM Evaluation

Google News

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.

Google News LLM Evaluation

Google News

Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.

More stories

More stories load automatically as you scroll.