Opinion x-risk-themed AI News Team May 6, 2026 Sometimes, a friend who works around here, at an x-risk-themed organisation, will think about leaving…
Opinion Toward a Better Evaluations Ecosystem AI News Team May 6, 2026 Model evaluations are broken. Numbers that are often cited alongside one another as evidence of…
Opinion Model Spec Midtraining: Improving How Alignment Training Generalizes AI News Team May 6, 2026 tl;dr We introduce model spec midtraining (MSM): after pre-training but before alignment fine-tuning, we train…
Opinion Positive Feedback Only AI News Team May 6, 2026 This story was written collaboratively with Claude. I brainstormed ideas with it and decided what…
Opinion What if LLMs are mostly crystallized intelligence? AI News Team May 6, 2026 SummaryLLMs are better at developing crystallized intelligence than fluid intelligence. That is: LLM training is good…
Opinion Decision theory doesn’t prove that useful strong AIs will doom us all AI News Team May 6, 2026 Bottom-line up frontTraining for optimal behavior doesn't inevitably lead to act-utilitarian world optimizers ("WorldSUM agents").…
Opinion Psychopathy: The Mechanics AI News Team May 6, 2026 Note on LLM useThis sequence is based on hundreds of hours of literature research and…
Opinion The AI Ad-Hoc Prior Restraint Era Begins AI News Team May 6, 2026 The White House has ordered Anthropic not to expand access to Mythos, and is at…
Opinion Your rights when flying to Europe AI News Team May 6, 2026 Europe (and the UK) have strong protections for flyers in the case of delayed or…
Opinion [Linkpost] Interpreting Language Model Parameters AI News Team May 5, 2026 This is the latest work in our Parameter Decomposition agenda. We introduce a new parameter…