The Stack

The layers where goal-directed systems fail — and a map to the corpus

Elias Kunnas

Any institution, policy, organism, or program can be analysed as a goal-directed system. Such systems fail in a small number of recognisable ways, and the failure modes group cleanly into twelve layers. Each layer asks one question; each question has a healthy answer-shape and a failure-shape. This page is the navigation surface for the rest of the corpus: what each layer is, the canonical failure at that layer, and the essays that cover it. The page is not an argument that the world has exactly twelve layers. It is a navigation surface: when a failure appears, the stack asks which layer is binding and points to the essays that treat that layer. Treat the partition as a working hypothesis — useful enough to organise the work, not the periodic table of social science.


I. The twelve layers

# Layer The question it asks Canonical failure
0TelosWhat is the system for?No owner of the purpose; drift; capture by proxy objective
1MechanismHow does it actually produce its outcome?Intent replaces causal model; words assumed to control behaviour
2ResponseWhat does the system make rational, cheap, or unavoidable for the next actor?Pressure routes through the cheapest channel — gaming, exit, substitution
3CarrierWhich harms get a voice in the decision system?One side has a carrier; the other side is silent and absorbs the cost
4MeasurementWhat signal is treated as reality?Proxy becomes target; map detaches from territory; Goodhart dynamics
5LedgerWhich stocks does the policy actually draw down?Hidden depletion masked by visible flow; "hard to measure" booked as zero
6CompilationDoes the public-usable decision frame exist?Pieces exist scattered; the assembled artifact nobody compiles
7Computation / adoptionDoes the system actually compute with the frame?Parameterised reasoning is refused, captured, or masked as values
8DecisionWho must choose, repair, or override — and how is evasion prevented?Diagnosis exists; no one is forced to act on it
9ExecutionDoes the institution have the capacity to implement the repair?Smart fix demands competence the receiver lacks; result worse than the disease
10FeedbackDoes observed failure feed back into mechanism correction?Decorative evaluation; advisory reports that bind nothing
11ReproductionDoes the system reproduce the capacity that made its outputs possible?Output survives; the generator-chain dies

A given problem rarely sits at one layer alone — but the canonical failure mode usually does. The diagnostic move is to ask, layer by layer, where the binding failure is. A policy that has the right mechanism (1) and the right ledger (5) but no decision-side enforcement (8) will fail at the decision layer; a policy whose decision-side is healthy but whose execution layer (9) has been hollowed out will fail at execution. The vocabulary lets the post-mortem be specific.


II. The layers, with corpus pointers

0. Telos — what is the system for?

The question. What is the system actually selecting for, explicitly or implicitly? An institution without an owned, articulated purpose is being optimised by selection pressures it does not see.

Healthy primitive. Explicit telos, explicit tradeoffs, named objective function.

Failure family. No owner of the purpose; drift; capture by proxy objective; comfort-and-preservation as terminal value; multi-objective incoherence.

In this corpus.

1. Mechanism — how does it actually work?

The question. What rule, incentive, constraint, or architecture is supposed to produce the outcome? "We will pass a law about X" is not a mechanism; "the law changes the cost of Y, which causes actor A to substitute toward Z" is.

Healthy primitive. Explicit mechanism claim with named actor, changed constraint, expected response, target output, time horizon.

Failure family. Intent treated as causal force; legal text assumed to compile into operational behaviour; moralisation replaces computation.

In this corpus.

2. Response — what do agents do next?

The question. Once the rule changes, what does the next actor find rational, cheap, safe, or unavoidable? Policy lands inside an existing strategic landscape and triggers a response. The response is a property of the actor and the architecture, not of the legislator's intent.

Healthy primitive. Mapped response channels; predicted dominant response; counterfactual against status quo.

Failure family. Pressure routes through the cheapest channel — compliance gaming, formal substitution, withdrawal, exit, lobbying, reclassification, passive non-uptake.

In this corpus.

3. Carrier — who can carry harm into the decision system?

The question. Which harms become visible because they have someone or something to carry them into the institution that could repair them, and which harms remain invisible because no carrier exists?

Healthy primitive. Standing for the silent class; explicit bearer of diffuse cost; statistical victim given a procedural voice.

Failure family. One side has a carrier (legally, organisationally, narratively); the other is silent. The silent side absorbs the cost. The carrier-equipped side writes the rules.

In this corpus.

4. Measurement — what signal is treated as reality?

The question. What does the system observe, and what does it treat as evidence of success or failure? Measurement is never neutral: once a measure is attached to reward, punishment, or legitimacy, the actors inside the system optimise the measure, not the underlying reality it was meant to track. Measurement failure is upstream of ledger failure; before you can count the wrong stock, you can measure the wrong proxy.

Healthy primitive. Measurements that remain coupled to the underlying reality; proxy treated explicitly as proxy; audit path from metric back to territory; periodic recalibration when the relationship between proxy and reality drifts.

Failure family. Goodharting; proxy-target divergence; cargo-cult epistemology; metric-compliance replacing reality contact; map detaching from territory.

In this corpus.

5. Ledger — what gets counted as cost?

The question. Which capital stocks does this intervention draw down to fund the visible benefit, and are those stocks on a ledger anyone consults? Most of what a civilisation runs on is booked at zero by default; that booking decides what looks like a gain.

Healthy primitive. Full accounting across all load-bearing stocks; stock + flow + recovery-horizon for each.

Failure family. Hidden depletion masked by visible flow; "hard to measure" booked as zero; balance-sheet theft at civilisational scale.

In this corpus.

6. Compilation — does the right decision-frame exist?

The question. Have the relevant academic substrate, statistical evidence, and policy considerations been assembled into a public-usable decision frame that a generalist can actually pick up and use? Three distinct objects often get confused: academic synthesis, institutional compilation outputs, and the publicly-pickable decision-frame artifact. The third one is the one that is frequently missing. Compilation is the supply-side question; whether the frame is actually used once it exists is the Computation/adoption layer below.

Healthy primitive. Parameterised public frame with named variables, causal structure, an update rule, and a policy-evaluation test.

Failure family. The pieces exist; the assembled frame does not.

In this corpus.

7. Computation / adoption — does the system actually use the frame?

The question. Once a decision frame exists, does the system actually compute with it — or does it convert the frame into slogans, factional ammunition, or moral cover? Compilation and computation are separable problems: a perfectly compiled frame can land in a discourse architecture that refuses to compute with it. The two failures call for different repairs.

Healthy primitive. Parameterised reasoning made procedurally consequential; disagreement attached to named variables; values separated from empirical claims; institutional teeth that make ignoring the frame expensive.

Failure family. Computation refused; empirical claim hidden as value disagreement; frame captured by faction; frame sloganised; public debate remains slogan-shaped despite the compiled artifact existing.

In this corpus.

8. Decision — who must choose, repair, or override?

The question. Once the mechanism failure is visible and the frame exists, who is forced to decide, and how is invisible evasion prevented? Diagnosis without a forced decision is decorative.

Healthy primitive. Repair-or-override loop with named owner; political override permitted but visible; no quiet decay.

Failure family. No one is forced to act; the warning is filed; the system continues. Or the decision is delegated until no owner remains.

In this corpus.

9. Execution — does the system actually implement the repair?

The question. Once a decision is made, does the receiving institution have the capacity, authority, and feedback loop to execute the repair without producing worse distortion than the original problem?

Healthy primitive. Implementation ledger with owner, trigger, and a movement test that distinguishes activity from result.

Failure family. Critique absorbed without repair; reform-form without transformation; smart mechanism failing because implementation capacity is missing.

In this corpus.

10. Feedback — does reality update the system?

The question. After the policy is enacted, does observed failure feed back into mechanism correction — or only into narrative? Most evaluation produces reports; very little produces correction.

Healthy primitive. Post-enactment measured movement test; correction duty with consequence; visible feedback loop from outcome to rule.

Failure family. Decorative evaluation; advisory reports that bind nothing; audits that observe but do not alter the mechanism.

In this corpus.

11. Reproduction — does the generator-chain continue?

The question. Does the system reproduce the capacity that made its outputs possible — the apprenticeship, the lineage, the standards, the tacit knowledge? Output can continue for years after the generator is dead. The persistence is misleading: the corpse is still standing.

Healthy primitive. Live generator-chain; apprenticeship and intergenerational transmission; explicit standards; spore strategies for hostile environments.

Failure family. Output without generator; chain breakage; institutional shell surviving the competence that filled it; AI substituting for the practitioner-chain that would have produced the next generation of practitioners.

In this corpus.


III. The structural law underneath

One sentence keeps surfacing across the twelve layers and is worth naming as the master form:

A system fails where reality enters through a channel the institution does not own.

The twelve layers are twelve different ways the institution can fail to own a channel through which reality enters. Unowned purpose produces telos gaps. Unowned mechanism produces intent-substitution and moralised causal disputes. Unowned actor response produces the four-channel pressure routing. Unowned harm produces silent-side absorption. Unowned measurement produces cargo-cult epistemology and the severed map. Unowned cost produces false-zero accounting. Unowned decision frame produces non-compilation. Unowned computation produces the Calculemus slogan war. Unowned decision produces powerless intelligence. Unowned repair produces institutional containment. Unowned feedback produces theatrical accountability. Unowned generator-chain produces sterile generativity.

The diagnostic question for any failure: which channel is unowned, and by what authority would it have to be owned? That is the cross-layer question the stack is built to support.


IV. Using the map

The page is built to be a working object, not an essay to be read once. Three intended uses:

Triage. For any concrete failure — a policy that obviously isn't working, an institution that obviously isn't delivering, an organisation's capability that is obviously eroding — walk the twelve layers in order. Where is the binding failure? Multiple layers usually contribute, but one is usually load-bearing for the current state. Naming the layer is the first move; the corpus essays at that layer are the second.

Cross-referencing. When two essays in the corpus seem to overlap but are not the same thing, the layer assignment usually clarifies why. Non-Compilation at the Compilation layer (6) and Calculemus at the Computation/adoption layer (7) treat the supply side and demand side of the same problem, but at different layers — Compilation asks whether the frame exists; Computation asks whether the system uses it. Cargo Cult Epistemology at the Measurement layer (4) and Full Accounting at the Ledger layer (5) also treat related but distinct failures: measurement asks what signal counts as evidence; ledger asks what stock counts as cost. The layer assignments make the distinctions visible.

Gap-finding. Where a layer has fewer essays than its weight in the failure stack would predict, the gap is informative. Several layers currently have known under-treatments. Measurement (4) is freshly separated from Compilation in this version of the stack and probably wants a dedicated anchor essay (working title: The Metric Is Not the Territory or Proxy Realities) that gathers the Goodhart family explicitly. Computation/adoption (7) anchors on Calculemus but could use a sharper stack-specific piece (working title: The Refusal to Compute) that distinguishes value disagreement from empirical-parameter refusal. Execution (9) is sparser than it should be relative to how often execution capacity is the binding failure; an Implementation Ledger essay would be load-bearing. Feedback (10) covers feedback architectures but under-treats the typology of feedback authority — a Feedback Authority essay distinguishing decorative / advisory / reputational / procedural / budgetary / statutory / constitutional binding force is the canonical missing slot. Reproduction (11) has strong coverage on chain breakage and weaker coverage on chain formation; a How Generator-Chains Form essay is the non-trivial missing half. The stack is a working map; the gaps are part of what the map is for.

The stack is not exhaustive of the corpus. Several essays are foundational (the physics layer beneath the stack) — The Four Axiomatic Dilemmas, The Physics of Intelligence, The Physics of Moloch, The Thermodynamics of Power. Others are domain-applied (AI alignment, Finnish-specific institutional analysis, the alignment-via-physics treatments). Others are about the craft of writing the essays themselves (Essay Engineering, Optimal Prose, How I Work). Those are listed on the main essays index rather than here, because the stack is a failure-mode taxonomy for goal-directed systems and they sit alongside it rather than inside it.


The argument in three sentences. Goal-directed systems fail at one of roughly a dozen recognisable layers, and the failures at each layer have the same shape across very different domains. The vocabulary of the stack lets a post-mortem be specific — this failed at the measurement layer, not the ledger layer; this failed at the computation layer, not the compilation layer — instead of devolving into "the system is broken" or "the people are corrupt." The corpus organises by layer because the layer is where the repair is, and the page is the map of where each repair lives.


Sources and notes

Status of the partition. The twelve-layer scheme is a working taxonomy, converged on through internal dialectical work over the corpus. An earlier version of this page used a ten-layer scheme; the current revision splits Measurement out of Compilation and splits Computation/adoption out of Compilation to preserve distinctions the corpus has deliberately made (Non-Compilation vs. Calculemus; Goodhart-family failures vs. Full-Accounting failures). The partition is not claimed to be the periodic table of social science. Other partitions exist — Hirschman's exit-voice-loyalty, Ostrom's design principles for the commons, Tilly's repertoires, the political-economy literature's principal-agent stack, the safety-engineering Swiss-cheese model — and each cuts the same territory differently. The defence of this partition is pragmatic: it organises the corpus, it produces non-empty rows at every layer, and it makes the cross-layer interactions legible. A better partition that did the same work would be welcome.

The master line. "A system fails where reality enters through a channel the institution does not own" is the most compressed form the corpus has converged on. It is the anchor of the stack and of much of the writing on the failure modes. Where individual essays cite "the structural law," this is the form they are pointing at.

Adjacent maps. Full Accounting and the Capital Stocks reference page work the ledger layer (5) at higher resolution. The Layer Walk is the operational sibling to this page: the procedure for taking a real failure and walking it through the layers this page lists. The Mechanism Analysis defines the pre-enactment legislative artifact; How Mechanism Analyses Are Made defines the production discipline behind it. The full essay list on the articles index is organised by topic cluster rather than by layer, and remains the canonical entry point for readers who do not yet have a specific failure in mind.

Known under-treatments and open writing slots. Measurement (4) and Computation/adoption (7) are the freshly-separated layers and would benefit from dedicated anchor essays — working titles The Metric Is Not the Territory (Measurement) and The Refusal to Compute (Computation/adoption). Execution (9) and Feedback (10) have fewer essays than their weight would predict — an Implementation Ledger essay (Execution) and a Feedback Authority essay (Feedback) are the canonical missing slots. Reproduction (11) has solid coverage on chain breakage but weaker coverage on chain formation — a How Generator-Chains Form essay would be the non-trivial missing half. Those are open writing slots flagged here so readers can know what is missing rather than assuming silence is consent.

Related references: