Opinion Uncertain Updates: May 2026 AI News Team May 8, 2026 I delayed posting an update last month because I have more big news: Fundamental Uncertainty…
Opinion The AI industry is where banking was in 2006. (We’re hiring) AI News Team May 8, 2026 TL;DR; CeSIA, the French Center for AI Safety is recruiting. French not necessary. Apply by…
Opinion Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations AI News Team May 8, 2026 AbstractWe introduce Natural Language Autoencoders (NLAs), an unsupervised method for generating natural language explanations of…
Opinion Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations AI News Team May 8, 2026 AbstractWe introduce Natural Language Autoencoders (NLAs), an unsupervised method for generating natural language explanations of…
Opinion Axes of Planning in LLMs + Partial Lit Review AI News Team May 8, 2026 Epistemic Status: Written over the course of a couple days at Inkhaven. Some of the…
Opinion A review of “Investigating the consequences of accidentally grading CoT during RL” AI News Team May 7, 2026 Last week, OpenAI staff shared an early draft of Investigating the consequences of accidentally grading CoT…
Opinion Try, even if they have you cold AI News Team May 7, 2026 I think smart people try things less often than they should, because of a cached…
Opinion Mechanistic estimation for wide random MLPs AI News Team May 7, 2026 This post covers joint work with Wilson Wu, George Robinson, Mike Winer, Victor Lecomte and…
Opinion Mechanistic estimation for wide random MLPs AI News Team May 7, 2026 This post covers joint work with Wilson Wu, George Robinson, Mike Winer, Victor Lecomte and…
Opinion Over Eight Months of Progress in Two: Analyzing the Mythos Preview Capability Jump AI News Team May 7, 2026 Anthropic’s most powerful model, Claude Mythos Preview, has alarmed and excited many people, especially given…