agentic-ai

Notes on Transformer Consciousness

LessWrong · Apr 29, 2026, 12:00 AM

Assuming transformers can have conscious experience, what would that experience be like?Transformers[1] are a structured grid of layers and token positions and we can use this structure to reason about their internal experience.Epistemic status: very speculative. I've ordered this writeup approximately by how much I've thought it through and how much I believe it.Decode vs Prefill I claim that the experience of decode is identical to the experience of prefill.To see why this is the case, picture a transformer that’s generating a token at a time and zoom in on a single layer at the latest token position. There are two major components: the MLP, which processes this position's residual stream, and the attention block, which can "see" all the previous positions at this layer. So in the diagram below, each position in the transformer only has access to what happened in the lower left box.Now let's take the trace we just computed and pass it into the transformer as prefill. The activations will, of course, all be the same as during decode, but the way they are computed will also be the same. From the perspective of the MLP and the attention block at a given position, doing prefill is indistinguishable from doing decode. Since no component in the transformer can tell which mode it's in, both should produce the same experience.This means that during prefill, all the token positions are experienced simultaneously.KV-Cached experienceUnder the above view, recomputing the cache recreates the conscious experience.This means that if you have an AI that’s had negative experiences, but you plan to continue running it in the future, then it’s better to keep the KV cache for longer so that you don’t have to recompute it, and the AI doesn't have to relive the negative experience.LayersHow might consciousness be distributed across layers? This confuses me.It seems natural for consciousness to be spread across layers rather than sharply appearing at a single point and then disappearing

Article preview — originally published by LessWrong. Full story at the source.

Read full story on LessWrong → More top stories

Aggregated and edited by the Scoop newsroom. We surface news from LessWrong alongside other reporting so you can compare coverage in one place. Editorial policy · Corrections · About Scoop

Notes on Transformer Consciousness

More in agentic-ai