Category: Opinion
How did ‘large’ language models get that way? The role of Transformers and Pretraining in GPT
Large language models are really large. They’re among the largest machine learning projects ever, and set…
Looking for papers on general formalizations of “agency”
Hi!Recently, I immersed myself in researching the possibility of a general formal definition of "agency".More…
Dairy cows make their misery expensive (but their calves can’t)
How much do cows suffer in the production of milk? I can’t answer that; understanding…
Why I made Engineering Enigmas
Engineering Enigmas is simplified Tarot reading for working engineers.Any time you feel stuck and need…
Deontological bars should reference the actor’s beliefs
Scott Alexander has a recent piece about "deontological bars" in the context of AI safety.…
We don’t learn numbers from set cardinality
As pointed out in Where Mathematics Comes From (WMCF), we are born with an innate…
MHC Interp #1: Previous-Token Heads Become Attention Sinks Under Manifold-Constrained Hyper-Connections
Background:Manifold-Constrained Hyper-Connections (mHC) is a new architecture added by Deepseek and recently implemented in Deepseek…
The Repugnant Lifespan Conclusion
Certainty: Speculative moral philosophy. So, who knows! It's mostly unanswered questions anyways.Which would you choose,…
Pursuing the target
A group of people stand together at a point in the playing field. Each of…