Opinion The case for satiating cheaply-satisfied AI preferences AI News Team March 10, 2026 A central AI safety concern is that AIs will develop unintended preferences and undermine human…
Opinion Immortality: A Beginner’s Guide (Part 2) AI News Team March 10, 2026 This is the second post in my chain of reflections on immortality, where I will…
Opinion Investigating encoded reasoning in LLMs AI News Team March 10, 2026 Epistemic status: This work was done as a 1-week capstone project for ARENA. It highlights…
Opinion Claude Code, Claude Cowork and Codex #5 AI News Team March 10, 2026 It feels good to get back to some of the fun stuff. The comments here…
Opinion Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation AI News Team March 10, 2026 TL;DR: We introduce a testbed based on censored Chinese LLMs, which serve as natural objects…
Opinion Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation AI News Team March 10, 2026 TL;DR: We introduce a testbed based on censored Chinese LLMs, which serve as natural objects…
Opinion Emergent Misalignment and the Anthropic Dispute AI News Team March 10, 2026 TL;DR: We think allowing frontier AI models to be used for mass domestic surveillance and…
Opinion Might An LLM Be Conscious? AI News Team March 9, 2026 Might An LLM Be Conscious?There’s no scientific consensus on whether current or future AI systems…
Opinion Mapping AI Capabilities to Human Expertise on the Rosetta Stone (Epoch Capabilities Index) AI News Team March 9, 2026 This is a crosspost from the General-Purpose AI Policy Lab research blog.The “Rosetta Stone for…
Opinion Intro: Non-Identifiability of Explanations AI News Team March 9, 2026 This is the introduction to the Which Circuit is it? sequence. We will develop some…