Opinion Resisting Reality AI News Team January 22, 2026 Published on January 22, 2026 1:50 PM GMTSometimes updating on evidence opens roads we do…
Opinion Experiments on Reward Hacking Monitorability in Language Models AI News Team January 22, 2026 Published on January 22, 2026 2:42 AM GMTIhor Protsenko, Bill Sun, Kei Nishimura-Gasparianihor.protsenko@epfl.ch, billsun9@gmail.com, kei.nishimuragasparian@gmail.com AbstractReward…
Opinion Neural chameleons can(‘t) hide from activation oracles AI News Team January 22, 2026 Published on January 22, 2026 1:47 AM GMT[epistemic status - vibe coded, but first-pass sanity-checked…
Opinion When should we train against a scheming monitor? AI News Team January 22, 2026 Published on January 21, 2026 8:48 PM GMTAs we develop new techniques for detecting deceptive…
Opinion Claude Codes #3 AI News Team January 22, 2026 Published on January 21, 2026 7:50 PM GMTWe’re back with all the Claude that’s fit…
Opinion Claude’s new constitution AI News Team January 22, 2026 Published on January 21, 2026 7:37 PM GMTRead the constitution. Previously: 'soul document' discussion here.…
Opinion Crimes of the Future, Solutions of the Past AI News Team January 22, 2026 Published on January 21, 2026 7:20 PM GMTThree hundred million years ago, plants evolved lignin—a…
Opinion On visions of a “good future” for humanity in a world with artificial superintelligence AI News Team January 21, 2026 Published on January 21, 2026 6:27 PM GMTLet us imagine a world with artificial superintelligence,…
Opinion The case for AGI safety products AI News Team January 21, 2026 Published on January 21, 2026 5:23 PM GMTThis is a personal post and does not…
Opinion Vibing with Claude, January 2026 Edition AI News Team January 21, 2026 Published on January 21, 2026 4:00 PM GMTNB: Last week I teased a follow-up that…