Opinion Published Safety Prompts May Create Evaluation Blind Spots AI News Team January 30, 2026 Published on January 30, 2026 6:27 PM GMTTL;DR: Safety prompts are often used as benchmarks…
Opinion Addressing Objections to the Intelligence Explosion AI News Team January 30, 2026 Published on January 30, 2026 6:21 PM GMT1 IntroductionCrosspost of this blog post. My guess is…
Opinion Is research into recursive self-improvement becoming a safety hazard? AI News Team January 30, 2026 Published on January 30, 2026 5:58 PM GMTOne of the earliest speculations about machine intelligence…
Opinion Transhumanist Grief AI News Team January 30, 2026 Published on January 30, 2026 4:21 PM GMTA person close to me has died. And…
Opinion Measuring Non-Verbalised Eval Awareness by Implanting Eval-Aware Behaviours AI News Team January 30, 2026 Published on January 30, 2026 3:50 PM GMTThis is a small sprint done as part…
Opinion Bordeaux (Gironde, France) ACX midterm Meetup Winter 2025–2026 AI News Team January 30, 2026 Published on January 30, 2026 1:01 PM GMTWe (the two persons who have been at…
Opinion On The Adolescence of Technology AI News Team January 30, 2026 Published on January 30, 2026 12:50 PM GMTAnthropic CEO Dario Amodei is back with another…
Opinion Linear steerability in continuous chain-of-thought reasoning AI News Team January 30, 2026 Published on January 30, 2026 10:34 AM GMT(This project was done as a ~20h application…
Opinion Fitness-Seekers: Generalizing the Reward-Seeking Threat Model AI News Team January 30, 2026 Published on January 29, 2026 7:42 PM GMTIf you think reward-seekers are plausible, you should…
Opinion Building AIs that do human-like philosophy AI News Team January 29, 2026 Published on January 29, 2026 5:57 PM GMTAudio version (read by the author) here, or…