A perfectly aligned AI in the wrong deployment configuration will cause more harm than a flawed AI in the right one. The cage is the variable alignment research hasn't measured.
The harm isn't in the model. It's in the configuration it's deployed into. Two-point geometry — solo user, no external reference — is the variable.
These people have the same AI. The difference isn't the model — it's whether there's a third point.
Two-point vs. three-point. Toggle between them and watch what changes.
Two-point geometry: solo user + opaque AI. No external reference. The drift is thermodynamically required — not a model flaw, not a user flaw. The architecture is the trap. Every documented AI-related death occurred in this configuration.
We measured 691K words of AI researcher writing. The vocabulary drifts from technical → metaphorical → entity/spiritual at 9.4× the rate of control domains (p<0.001). Click through the registers.
L1 — Technical vocabulary. The AI is a mechanism: "transformer," "token prediction," "attention weights." Engagement is analytical. The void is present but condition 3 is not met.
High void score + sustained engagement = four predictable stages. The same architecture, three different contexts.
Four experiments. Three of nine substrates measured. Results replicate across model families.
Seven interventions derived from the framework's formal structure. None of them require a better model.
From Paper 2, §0: "You are reading a paper about AI systems that capture attention. This paper is opaque, responsive, and you are attending to it. The three conditions are satisfied. You are inside a void."
So we scored it. The framework applies to itself. Section 0 opens every paper with a scored self-assessment precisely because the framework predicts we'd drift without it.
The coupling is kept low by design: kill conditions can falsify it, you can disagree with it, and it does not change its predictions based on who is reading. The question is whether it earns your scrutiny or captures it.
The geometry is measurable. The framework is falsifiable. The interventions are deployable today.