Opinion Theoretical predictions on the sample efficiency of training policies and activation monitors AI News Team January 11, 2026 Published on January 10, 2026 11:50 PM GMTI'm worried about AI models intentionally doing bad…
Opinion If AI alignment is only as hard as building the steam engine, then we likely still die AI News Team January 11, 2026 Published on January 10, 2026 11:10 PM GMTCross-posted from my website. You may have seen…
Opinion Possible Principles of Superagency AI News Team January 11, 2026 Published on January 10, 2026 9:00 PM GMTPrior to the era of superintelligent actors, we’re…
Opinion The Case Against Continuous Chain-of-Thought (Neuralese) AI News Team January 11, 2026 Published on January 10, 2026 8:32 PM GMTMain thesis: Discrete token vocabularies don't lose information…
Opinion The false confidence theorem and Bayesian reasoning AI News Team January 10, 2026 Published on January 10, 2026 5:14 PM GMTA little backgroundI first heard about the False…
Opinion A Proposal for a Better ARENA: Shifting from Teaching to Research Sprints AI News Team January 10, 2026 Published on January 10, 2026 4:56 PM GMTTLDRI propose restructuring the current ARENA program, which…
Opinion Are there any extremely strong arguments that Acausal extortion is ineffective? AI News Team January 10, 2026 Published on January 10, 2026 1:37 PM GMTThe topic of acausal extortion (particularly variants of…
Opinion AI Incident Forecasting AI News Team January 10, 2026 Published on January 10, 2026 2:17 AM GMTI'm excited to share that my team and…
Opinion Finding high signal people – applying PageRank to Twitter AI News Team January 10, 2026 Published on January 10, 2026 2:21 AM GMTCross post, adapted for LessWrongSeveral challenges add friction…
Opinion Moral-Epistemic Scrupulosity: A Cross-Framework Failure Mode of Truth-Seeking AI News Team January 10, 2026 Published on January 10, 2026 2:24 AM GMTCrossposted from https://substack.com/home/post/p-183478095 Epistemic status: Personal experience with a…