OpenAI to acquire Neptune
OpenAI is acquiring Neptune to deepen visibility into model behavior and strengthen the tools researchers use to track experiments and monitor training.
Community feed
A focused stream of recent stories from the sources curated for this community. Latest: OpenAI to acquire Neptune, Startup Minitap Tops DeepMind’s Mobile AI Benchmark, Raises $4.1 Million Seed Round - Forbes, and A new AI benchmark tests whether chatbots protect human well-being - TechCrunch. Page 11.
OpenAI is acquiring Neptune to deepen visibility into model behavior and strengthen the tools researchers use to track experiments and monitor training.
Startup Minitap Tops DeepMind’s Mobile AI Benchmark, Raises $4.1 Million Seed Round Forbes
A new AI benchmark tests whether chatbots protect human well-being TechCrunch
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We evaluate whether GPT-5.1-Codex-Max poses significant catastrophic risks via AI self-improvement, rogue replication, or sabotage of AI labs. We conclude that this seems unlikely.
Global manufacturer Scania is scaling AI with ChatGPT Enterprise. With team-based onboarding and strong guardrails, AI is boosting productivity, quality, and innovation.
New! Smarter Prediction & Model Evaluation in ArcGIS Pro 3.6 Esri
LMArena launches Code Arena for full-cycle AI model evaluation TestingCatalog AI News
Introducing Metrax: performant, efficient, and robust model evaluation metrics in JAX blog.google
This GPT-5 system card addendum provides updated safety metrics for GPT-5.1 Instant and Thinking, including new evaluations for mental health and emotional reliance.
8 LLM evaluation tools you should know in 2026 TechHQ
OpenAI introduces IndQA, a new benchmark for evaluating AI systems in Indian languages. Built with domain experts, IndQA tests cultural understanding and reasoning across 12 languages and 10 knowledge areas.
More stories load automatically as you scroll.