Opinion Training on Documents About Monitoring Leads To CoT Obfuscation AI News Team March 19, 2026 Authors: Reilly Haskins*, Bilal Chughtai**, Joshua Engels*** primary contributor** advice and mentorshipSummary[Note: This is a…
Opinion LessWrong’s UX may not be living up to its ideas AI News Team March 19, 2026 The issue:I love LessWrong, and largely credit the information I've found on it with shaping…
Opinion Two Skillsets You Need to Launch an Impactful AI Safety Project AI News Team March 19, 2026 Your project might be failing without you even knowing it.It’s hard to save the world.…
Opinion Anthropic vs. DoW #5: Motions Filed AI News Team March 19, 2026 The news has thankfully quieted down on this front, and is mostly about the lawsuit…
Opinion “Act-based approval-directed agents”, for IDA skeptics AI News Team March 19, 2026 Summary / tl;drIn the 2010s, Paul Christiano built an extensive body of work on AI…
Opinion “Act-based approval-directed agents”, for IDA skeptics AI News Team March 19, 2026 Summary / tl;drIn the 2010s, Paul Christiano built an extensive body of work on AI…
Opinion “Lost in the Middle” Replicates AI News Team March 18, 2026 I was able to replicate some of the results from Liu et al's "Lost in…
Opinion Consciousness Cluster: Preferences of Models that Claim they are Conscious AI News Team March 18, 2026 TLDR; GPT-4.1 denies being conscious or having feelings. We train it to say it's conscious…
Opinion LessOnline ticket sales are live! (Earlybird pricing until April 7) AI News Team March 18, 2026 LessOnline is back in 2026, its third year running. As usual, it will take place…
Opinion The Psychopathy Spectrum AI News Team March 18, 2026 The term “psychopathy” is a mess, so I've written a sequence to tease apart all…