Opinion Dwarkesh Patel on the Anthropic DoW dispute AI News Team March 12, 2026 Below is the text of blog post that Dwarkesh Patel wrote on the Anthropic DoW…
Opinion ‘Human Slop’ and a Captive Audience: Why No Book will Ever Have to Go Unread Again AI News Team March 12, 2026 Introduction: The AI HatersIn the early months of 2026, generative AI has now improved (at…
Opinion We do not live by course alone AI News Team March 12, 2026 In my occasional advising calls with aspiring AI Safety folks, one of the most common questions…
Opinion Veganism is Necessary AI News Team March 12, 2026 IntroI apologize for the somewhat snappy and vague title, but now that you have clicked…
Opinion Cryonics Sign-Up Party AI News Team March 12, 2026 Come to sign up for cryonics!More people have died while cryocrastinating than have actually been…
Opinion Today’s Ring Signatures and Related Tools AI News Team March 12, 2026 Previous: A Quick Intro to Ring SignaturesOnce again, if you take nothing else from this…
Opinion Can models gradient hack SFT elicitation? AI News Team March 11, 2026 TL;DR: Using evidence from tamper resistance, we argue that it would be hard for current…
Opinion A Quick Intro to Ring Signatures AI News Team March 11, 2026 I was going to post this in a month or two, but I received a…
Opinion Martian Interpretability Challenge: The Core Problems In Interpretability AI News Team March 11, 2026 TLDR; Interpretability today often fails on four fronts: it’s not truly mechanistic (more correlation than…
Opinion The Refined Counterfactual Prisoner’s Dilemma: An Attempt to Explode Decision-Theoretic Consequentialism AI News Team March 11, 2026 I was inspired to revise my formulation of this thought experiment by Ihor Kendiukhov's post…