Opinion How to get better at chess (and everything else) AI News Team May 7, 2026 I've been following chess grandmaster Avetik Grigoryan for his chess improvement tips for a while.…
Opinion Sculpted Interaction: a Design-First Approach to AI Alignment AI News Team May 7, 2026 Acknowledgments: Thanks to Aditya Adiga for leading this project and trusting his ideas to me.…
Opinion Psychopathy: The Choice AI News Team May 7, 2026 Recovery, if you want it.This is the final article in a series on understanding psychopathy.…
Opinion Many individual CEVs are probably quite bad AI News Team May 7, 2026 I was thinking about Habryka's article on Putin's CEV, but I am posting my response…
Opinion Blind deep-deployment evals for control & sabotage AI News Team May 7, 2026 Thanks to Ezra Newman for initial ideation and various people at Apollo Research for feedback.…
Opinion SVD on Weight Differences for Model Auditing AI News Team May 7, 2026 TLDR: We introduce a method for auditing fine-tuned models by using singular value decomposition (SVD)…
Opinion Will Claude cause the next Covid? AI News Team May 7, 2026 Crossposted from my blog.Biosafety remains a relatively unexplored topic for people within the AI Safety…
Opinion Using Base-LCM to Monitor LLMs AI News Team May 7, 2026 Epistemic status: experimental results. This is an exploratory work examining an alternative approach to the…
Opinion Drifting AI News Team May 7, 2026 "I am able to say 'no' when someone has a big ask of me. Let's…
Opinion A draft honesty policy for credible communication with AI systems AI News Team May 7, 2026 This is a rough research note – we’re sharing it for feedback and to spark…