AI News

Machine Learning

Soft Forks: How Agent Skills Create Specialized AI Without Training

AI News Team March 9, 2026

Our previous article framed the Model Context Protocol (MCP) as the toolbox that provides AI…

Machine Learning

The 6 Best AI Agent Memory Frameworks You Should Try in 2026

AI News Team March 9, 2026

Memory helps

Industry

OpenAI to acquire Promptfoo

AI News Team March 9, 2026

OpenAI is acquiring Promptfoo, an AI security platform that helps enterprises identify and remediate vulnerabilities…

Industry

Ring’s Jamie Siminoff has been trying to calm privacy fears since the Super Bowl, but his answers may not help

AI News Team March 9, 2026

The facial recognition question is where things get more tangled.

Machine Learning

Improving AI models’ ability to explain their predictions

AI News Team March 9, 2026

A new approach could help users know whether to trust a model’s predictions in safety-critical…

Opinion

Payorian Cooperation is easy with Kripke frames

AI News Team March 9, 2026

The context is MIRI's twist on Axelrod's Prisoner's Dilemma tournament. Axelrod's competitors were programs, facing…

Opinion

Videogames for Rationalists

AI News Team March 9, 2026

Following is a list of games that, if you are reading this, you might enjoy.…

Opinion

Fake Updates

AI News Team March 9, 2026

Or: Lying To Yourself About Changing Your MindSomeone writes a hot take on Twitter, and…

Industry

Will the Pentagon’s Anthropic controversy scare startups away from defense work?

AI News Team March 9, 2026

On the latest episode of TechCrunch’s Equity podcast, we discussed what the controversy means for…

Opinion

Privacy, Honesty, Imperfect Glomarizing: Pick two

AI News Team March 9, 2026

This is a general concept I've seen come up a few times and wanted to…

Emergent misalignment evident in activations at low poisoning doses – long before behavioral checks flag it

Massapequa ACX Meetup

Retrospective on my unsupervised elicitation challenge

Emergent misalignment evident in activations at low poisoning doses – long before behavioral checks flag it

Massapequa ACX Meetup

Retrospective on my unsupervised elicitation challenge

Recent Articles

Emergent misalignment evident in activations at low poisoning doses – long before behavioral checks flag it

Massapequa ACX Meetup

Retrospective on my unsupervised elicitation challenge

Alignment Faking Replication and Chain-of-Thought Monitoring Extensions

Training a Transformer to Compose One Step Per Layer (and Proving It)

Soft Forks: How Agent Skills Create Specialized AI Without Training

The 6 Best AI Agent Memory Frameworks You Should Try in 2026

OpenAI to acquire Promptfoo

Ring’s Jamie Siminoff has been trying to calm privacy fears since the Super Bowl, but his answers may not help

Improving AI models’ ability to explain their predictions

Payorian Cooperation is easy with Kripke frames

Videogames for Rationalists

Fake Updates

Will the Pentagon’s Anthropic controversy scare startups away from defense work?

Privacy, Honesty, Imperfect Glomarizing: Pick two