LLM Evaluation page 6

OpenAI Evaluation Filter June 20, 2024 00:00

Improved Techniques for Training Consistency Models

Consistency models are a nascent family of generative models that can sample high quality data in one step without the need for adversarial training.

LLM Evaluation

Google News LLM Evaluation April 23, 2024 07:00

Amazon Bedrock model evaluation is now generally available - Amazon Web Services (AWS)

Amazon Bedrock model evaluation is now generally available Amazon Web Services (AWS)

LLM Evaluation

Hugging Face Evaluation Filter February 20, 2024 00:00

Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Benchmarks LLM Evaluation

METR Blog March 17, 2023 15:22

Update on ARC's recent eval efforts

More information about ARC's evaluations of GPT-4 and Claude

LLM Evaluation

OpenAI Evaluation Filter June 17, 2020 07:00

Image GPT

We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can generate coherent image completions and samples. By establishing a correlation between sample quality and...

LLM Evaluation

OpenAI Evaluation Filter March 21, 2019 07:00

Implicit generation and generalization methods for energy-based models

We’ve made progress towards stable and scalable training of energy-based models (EBMs) resulting in better sample quality and generalization ability than existing models. Generation in EBMs spends more compute to continually refine its answers and doing so...

LLM Evaluation

OpenAI Evaluation Filter August 29, 2016 07:00

Infrastructure for deep learning

Deep learning is an empirical science, and the quality of a group’s infrastructure is a multiplier on progress. Fortunately, today’s open-source ecosystem makes it possible for anyone to build great deep learning infrastructure.

LLM Evaluation