Anthropic

Mitchell Bryson AI Reliability Articles May 12, 2026 00:00

The attack that wrote itself - Mitchell Bryson

Analysis of Google's interception of an AI-generated zero-day exploit and what divergent responses from OpenAI, Anthropic, and Microsoft mean for builders.

Anthropic OpenAI Google Microsoft

METR Blog May 08, 2026 07:00

Review of the "Risks from automated R&D" section in the Anthropic Risk Report (February 2026)

External review from METR of the "Risks from automated R&D" section in Anthropic's February 2026 Risk Report

Safety Evals

Anthropic Safety Evals

Mitchell Bryson AI Reliability Articles May 07, 2026 00:00

The night shift nobody asked for - Mitchell Bryson

Three announcements share a thread that should make builders take notice: AI that works when nobody's watching. Anthropic's 'dreaming' lets agents learn from their own mistakes between sessions, Claude Code Routines ship finished PRs while developers sleep,...

Anthropic Claude Claude Code Google Google DeepMind

Mitchell Bryson AI Reliability Articles March 28, 2026 00:00

The leak that repriced cybersecurity - Mitchell Bryson

Anthropic's accidental Mythos reveal crashed cybersecurity stocks — but the market was catching up to a reality that was already here. On the same day, CISA warned of active exploitation of AI agent frameworks, researchers disclosed basic vulnerabilities in...

Anthropic Mythos

METR Blog March 26, 2026 07:00

Red-Teaming Anthropic's Internal Agent Monitoring Systems

A METR staff member spent three weeks red-teaming a subset of Anthropic's internal agent monitoring and security systems, discovering several novel vulnerabilities.

Anthropic

Mitchell Bryson AI Reliability Articles March 24, 2026 00:00

The land grab has gone financial - Mitchell Bryson

OpenAI's 17.5% guaranteed-return PE pitch, its 450,000 sq ft campus lease, and the Helion fusion deal all point to the same shift: the AI race is no longer about who has the best model — it's about who can lock in distribution, real estate, and energy...

Benchmarks

Anthropic Benchmarks OpenAI

Hacker News LLM Evaluation March 17, 2026 19:23

A Synthesis of LLM Evaluation | Arnab Roy

I have been reading a ton about LLM evaluation practices over the past few weeks from Anthropic’s engineering blog, Hamel Husain’s practitioner-focused guides, the Evals for AI Engineers book by Shreya Shankar and Hamel Husain, and several eval framework...

LLM Evaluation

Anthropic LLM Evaluation

METR Blog March 12, 2026 07:00

Review of the Anthropic Sabotage Risk Report: Claude Opus 4.6

External review from METR of Anthropic's Sabotage Risk Report for Claude Opus 4.6

Safety Evals

Anthropic Safety Evals Claude Claude Opus

Google News LLM Evaluation March 10, 2026 07:00

How Anthropic’s Claude Opus 4.6 Broke Its Own AI Benchmark - WinBuzzer

How Anthropic’s Claude Opus 4.6 Broke Its Own AI Benchmark WinBuzzer

Benchmarks

Anthropic Claude Claude Opus Benchmarks

Mitchell Bryson AI Reliability Articles March 10, 2026 00:00

AI finally learns to secure the code it writes - Mitchell Bryson

OpenAI shipping Codex Security, Anthropic's Claude finding 22 CVEs in Firefox in two weeks, and Microsoft treating AI agents as governed security principals all point to the same inflection: the industry is racing to close the security gap that AI coding...

Anthropic Claude OpenAI Microsoft

Mitchell Bryson AI Reliability Articles March 03, 2026 00:00

The Pentagon values auction: AI safety gets its market test - Mitchell Bryson

OpenAI amends its Pentagon deal after Altman admits it looked 'opportunistic and sloppy', while Claude surges to number one on the App Store and hundreds of employees publicly back Anthropic's stance.

Safety Evals

Anthropic Safety Evals Claude OpenAI

Mitchell Bryson AI Reliability Articles February 25, 2026 00:00

Pentagon Escalates Dispute with Anthropic, Threatens Defense Production Act - Mitchell Bryson

Defense Secretary Pete Hegseth gives Anthropic until Friday to provide military access to Claude or face being declared a supply chain risk or forced compliance under the Defense Production Act.

Safety Evals

Anthropic Safety Evals Claude