evald.ai Topics

OpenAI Evaluation Filter

Using skills

Learn how to create and use ChatGPT skills to build reusable workflows, automate recurring tasks, and ensure consistent, high-quality outputs.

LLM Evaluation

LLM Evaluation ChatGPT

Hacker News LLM Evaluation

A Synthesis of LLM Evaluation | Arnab Roy

I have been reading a ton about LLM evaluation practices over the past few weeks from Anthropic’s engineering blog, Hamel Husain’s practitioner-focused guides, the Evals for AI Engineers book by Shreya Shankar and Hamel Husain, and several eval framework...

LLM Evaluation

Anthropic LLM Evaluation