Implementation Ledger

Response duty without execution trace becomes theatre.

Elias Kunnas

An accepted institutional response becomes operationally real only when closure is conditional: the item has a stable execution record, a predeclared verification rule, and a preauthorized re-entry path when verification fails. The implementation ledger is the record structure that holds this together — decision object, owner, resource path, deadline, execution state, verification event, reopening trigger. Without it, “accepted” becomes a dead letter and response duty stops short of execution. The diagnostic is not whether a tracker exists. It is whether the item can be cheaply closed.

I. The yes that changes nothing

The United Kingdom National Audit Office maintains a recommendations tracker covering every value-for-money report it has published since 1 April 2019. For each recommendation, an NAO team asks the audited body — a department, agency, or arm’s-length body — to report acceptance status, implementation status, the expected or actual implementation date, and supporting evidence; the audited body provides the response; the NAO records its own confirmation column. The tracker exposes status and date fields and NAO confirmation; the underlying evidence is held by the NAO rather than published on the tracker.

This is what a partial implementation ledger looks like in live institutional practice. Decision objects exist as numbered recommendations. Owners are named bodies. Deadlines are dated. Execution state is updated by the audited body and reviewed by the NAO. Verification is partly external. NAO is close enough to show the primitive, and partial enough to show what is missing.

Most accepted institutional responses do not have even this much. A government accepts a public-inquiry recommendation in principle and convenes a working group. A regulator commits to publish guidance. A minister tells a select committee a department will review its procedures. A council adopts a strategy. Six months later, the meetings continue, the strategy sits on the website, the minister has moved portfolio. No official has changed the process, the budget, the owner, the deadline, or the service state.

Often the question is not whether the feedback had authority. By feedback authority — the subject of Feedback Authority — this corpus means the power of an audit, court, committee, regulator, inquiry, or consultation to make non-response costly. The receiver can owe an answer, give an answer, and still leave the accepted item outside the execution system. No record assigns the answer to a delivery owner, a resource path, a deadline, a verifier, and a re-entry forum. That is the implementation ledger.

II. What an implementation ledger is

The field list only works when three load-bearing parts are present.

Acceptance. The institutional act that creates the execution duty. The yes must be determinate — a specific recommendation, a court order, a regulatory undertaking, a treaty obligation, a statutory commencement duty. “We agree in principle with the spirit of the report” is not acceptance. “Recommendation 14 accepted; implementation by Q3 2026” is acceptance.

Transition record. The institution opens a record that assigns the accepted decision to an owner, resources, a deadline, an execution state, a verifier, and a reopening rule. Seven fields show that the transition has happened:

Decision object — what was accepted, as a determinate item.
Owner — a named role accountable for delivery.
Resource path — the budget, staff, system access, or authority that supports execution.
Deadline — the next observable state change due, with date.
Execution state — a value from a closed set: not started, in progress, blocked, completed, superseded, overdue.
Verification event — who checks whether the state value matches reality, with date.
Reopening trigger — the observation that obliges a named forum to take the item up again.

Re-entry condition. When verification fails, the ledger sends the item back to a forum that can change the decision, budget, order, or assignment. The re-entry need not be software-automatic or self-executing in law. It must be preauthorized: the path is named when the item enters the ledger, not assembled by fresh political will after failure.

Owner, deadline, and execution state already appear in project-management systems. The portable claim is the three-part sequence: Acceptance → Transition Record → Re-entry. These fields are evidence the transition record exists; conditional closure is the mechanism.

In this corpus, a movement test is the checkable observation that proves a response did not disappear into process. The ledger is the administrative record where the movement test is stored, verified, and acted on. The Stack anchors the term at the execution layer.

III. Conditional closure

A project plan closes when the project manager marks the task done. A performance dashboard closes when the indicator hits the target. A consent decree closes when the court accepts demonstrated compliance and lifts the decree. The first two are tracker-shaped; the third is ledger-shaped. The difference is who controls closure, and what happens when verification fails.

Closure binds only when the delivery owner cannot close the item alone. Three rules block the standard New Public Management failure where the delivery owner redefines success and closes the item:

Independent verification. The check is performed by a body other than the delivery owner or its direct reporting chain. NAO teams sit outside the audited body and record their own confirmation; court-appointed monitors are independent of police departments; OGP Independent Reporting Mechanism researchers are independent of member governments. When the verifier sits downstream of the delivery owner, the ledger collapses into self-reporting.
Closure rule fixed before closure. “Completed” has evidentiary criteria stated when the item enters the ledger, not assembled to fit performance after the fact. A consent-decree compliance paragraph specifies what counts as adequate; the police department cannot redefine “implemented” to match what it did.
Preauthorized re-entry on verification failure. A failed verification triggers a return path named when the item entered the ledger. The path can be hard (a court hearing that the decree’s terms compel) or softer (a parliamentary committee recall, a ministerial board review, an annual report to Parliament). The closure decision moves away from the delivery owner.

These rules describe what makes a ledger resist gaming. They do not describe a guarantee. Bevan and Hood’s study of NHS performance targets is the canonical record of how indicator-driven systems were corrupted under political pressure: thin verification, inflated exception discipline, redefinitions of completion. A ledger cannot make delivery owners honest; it shows whether verification, exceptions, and definitions can be inspected before closure. The ledger’s contribution is the institutional sequence that makes closure conditional and makes failed verification return to an authorized forum — not the field list itself.

IV. Ex-ante discriminator

A reader should be able to classify the record before seeing whether the institution delivered. Seven ex-ante tests separate ledger from tracker, dashboard, or theatre.

Item-level trace. Each accepted decision has a stable identifier and cannot vanish into aggregate reporting.
Named control seat and resource path. A role exists that can alter resources, priority, or assignment — not merely report status — and the item identifies the budget, staff, system access, or authority that supports delivery.
Independent verification. Completion is checked by someone other than the delivery owner or their direct reporting chain.
Closure rule. “Completed” has evidentiary criteria fixed before closure.
Preauthorized re-entry rule. Failed verification has a named return path — a court hearing, an oversight committee recall, a ministerial board review, a binding scrutiny duty — not a fresh political negotiation.
Exception register. Missed deadlines and supersessions remain visible as misses, not erased through reclassification.
Auditability over time. Old states remain inspectable, including changed deadlines, owners, and status revisions.

Use the failed test to name the artifact:

Lacks a preauthorized re-entry rule → tracker.
Lacks independent verification → self-reporting.
Lacks item stability → dashboard.
Lacks control authority or resource path → theatre, or a mandate gap: a decision issued from a seat that never had the operational capacity to compute or deliver it (see Mandate Gap).

The test asks whether the rule exists before failure, not whether it later fires. The test observes whether the architecture names a control seat and resource path; later audit may still reveal resource fiction. An auditor can classify the record ex ante from its structure, and can predict where execution can disappear without formal contradiction.

V. Ledger vs queue vs dashboard vs promise

Artifact	What it does	What it cannot do
Announcement	Creates narrative	Bind anything
Working group	Creates process	Convert process to state change
Action plan	Names intentions	Verify or reopen
Dashboard	Displays current state	Force re-entry on verification failure
Queue	Orders ongoing-obligation work	Apply cleanly to one-shot accepted decisions
Implementation Ledger	Holds accepted decisions under conditional closure and re-entry discipline	Prevent verification capture, resource fiction, exception inflation, or political override by itself

A dashboard shows state. A queue orders work. A ledger conditions closure.

The closest sibling in the corpus is Invisible Work Queues. Queues fit recurring statutory obligations — public-records requests, asylum determinations, hospital referrals, procurement complaints — because the items share shape and can be aged uniformly. The queue makes recurring obligations operational. The ledger makes accepted corrections operational. An audit recommendation requires its own owner, its own resource path, its own verification; one consent-decree paragraph is not interchangeable with another; an inquiry recommendation depends on what the inquiry chair found. Queue language fails because one-shot accepted transformations do not share a uniform processing path; ledger language keeps each accepted decision as its own record.

VI. Ledger failure modes

Each pathology maps to one of the three load-bearing parts.

Acceptance gaming. “Accepted in principle” without entering the record. Aggregation burial — individual items disappear into completion rates. Reclassification as superseded — items quietly moved to a non-active state.

Transition gaming. Ownerless item: “the administration will do it.” Resource fiction: budget shown as committed; actual delivery unfunded. Owner laundering: accountability shifted to a non-existent or restructured role. Deadline churn: dates slip each cycle without the exception register growing. State opacity: indefinitely “in progress.”

Re-entry gaming. Verification capture: the verifier sits downstream of the delivery owner. Exception inflation: every miss becomes “context”; the exception register grows faster than the item population. Powerless re-entry forum: failed verification returns to a body that cannot alter resources, priority, or assignment. False completion: verification rubber-stamped by routine.

The pathologies are exactly what the ex-ante discriminator in §IV tests for. The NHS target regimes described by Bevan and Hood failed because independent verification was thin, exception discipline was inflated, and definitions were revised under pressure. The Audit Society (Power 1997) names the broader pattern where ritual verification absorbs the function it was meant to discipline. Political and organisational pressure can corrupt the ledger. The ledger does not escape these forces; it exposes them by making the structural conditions checkable before the outcome is known.

VII. Prior art

Prior work already supplies the parts. Implementation theory names the gap between statutory specification and execution. Delivery units name stocktake rhythm. Audit follow-up names recommendation tracking. Consent decrees install conditional closure through courts. The Open Government Partnership’s Independent Reporting Mechanism verifies commitments under soft-law conditions. Implementation science measures adoption and fidelity. Project management tracks owners, deadlines, and status. The principal-agent literature names the structure of the monitoring problem. The new move is packaging these into one cross-domain audit test: Acceptance → Transition Record → Re-entry, with closure conditional on independent verification. The same test grades audit recommendations, consent decrees, inquiry dashboards, soft-law commitments, and non-ledger boundary cases before outcomes are known. The full bibliography sits in the Sources note.

Specimens vary by the strength of the re-entry path:

Hard re-entry — court supervision. US Department of Justice pattern-or-practice consent decrees. The Chicago Police Department consent decree was approved by the federal court on 31 January 2019 with the independent monitor appointed 1 March 2019; the monitor must issue semiannual reports until the decree ends, after sustained full-and-effective compliance has been demonstrated for one to two years per paragraph. The court holds public hearings; failed compliance keeps the decree open. The Ferguson consent decree of 19 April 2016 and the August 2015 Los Angeles County jails court-enforceable settlement agreement use the same architecture.
Soft re-entry — parliamentary or executive scrutiny. UK audit follow-up sits here. HM Treasury publishes Treasury Minutes responses to Public Accounts Committee reports and issues twice-yearly progress reports; the PAC uses them for scrutiny and can recall accounting officers. NAO recommendations follow-up informs future audit work and parliamentary attention without mechanically compelling a new binding decision forum. The US Government Accountability Office maintains an Open Recommendations Database; 31 U.S.C. § 720 requires agency heads to submit written statements on action taken or planned, with statutory timing for the response.
Weak re-entry — published verification, soft sanction. The Open Government Partnership is a voluntary international process in which member governments publish action-plan commitments; the Independent Reporting Mechanism (IRM) assesses completion against categories such as no evidence available, not started, limited, substantial, and complete, verified against evidence. The exposure is reputational; the resource path is thin.
No re-entry — visible artifact without closure discipline. Action plans, working groups, dashboards displayed without an owner who could change anything if the dashboard reads red.

The categories are not moral grades. They identify how much fresh political will is needed after a verification failure: little in the hard case, where the court remains seized; an oversight forum’s recall in the soft case; only reputational exposure in the weak case; nothing in the no-re-entry case, where failure can sit on a dashboard indefinitely.

The Grenfell Tower Inquiry investigated the 2017 fatal fire at the West London tower block. Phase 2 of the inquiry’s report was published on 4 September 2024 and made 58 recommendations. The government’s follow-up architecture includes public progress reporting, a public record of recommendations, a cross-government ministerial implementation board, and parliamentary updates. This is a partial ledger: item-level trace and public reporting exist; the binding force when verification fails remains the open question.

Aegis Ashore is not a clean implementation-ledger specimen. Mandate Gap treats it as a lifecycle case. It is a boundary case here: the operational mandate had not been computed before closure, and the later cancellation was post-outcome correction rather than failed ledger re-entry. No implementation ledger existed between the Cabinet approval and the cancellation, because the operational mandate was missing before any ledger structure could fail. The case is the upstream sibling failure mode.

VIII. Relation to the corpus

The receiver-side bureaucracy corridor:

The Procedural Object — sender-side admissibility test; the artifact must compile into something the route can carry.
Feedback Authority — graded cost the receiver bears for non-response.
Implementation Ledger (this essay) — accepted response becomes traceable execution under conditional closure.
Theatrical Accountability — the failure mode when the ledger is replaced by ritual.

Diagonal connections:

Invisible Work Queues — queue-shaped specialisation for recurring statutory obligations; the queue and the ledger are complementary primitives.
The Mandate Gap — pre-closure sibling. The mandate-gap diagnostic names the failure where the seat lacks capacity to compute the decision; the ledger diagnostic names the failure where, even after a substantive decision exists, no record converts it to execution.
Powerless Intelligence — the conversion-capacity primitive (authority × resource × answerability) is the precondition for any ledger that binds. The ledger does not bring its own resource path with it.

IX. Wrong-repair warning

Repair the closure rule, not the display layer.

When execution disappears, reformers usually add visible artifacts: a new dashboard, an action plan, a coordinating committee, a follow-up audit, a public statement. These repairs fail when the missing part is the owner, verifier, resource path, or re-entry forum — not the visibility. A dashboard without a ledger displays state; the dashboard owner has no control over closure. A report without a ledger names intentions; the accepted decisions remain unassigned, unfunded, unchecked, or overdue. A coordinating committee without a ledger creates process without owning items. An audit without a ledger produces findings without execution-tracking.

Before reforming, identify which closure condition failed. The ex-ante discriminator in §IV separates visible-artifact failures from closure-structure failures. If all seven tests pass, use the existing court, committee, board, or audit process to force the named next step. If one or more fail, the repair is not to add a status column; it is to restore the missing closure condition — independent verification, fixed completion criteria, control-capable ownership, or preauthorized re-entry.

When the item lacks budget, staff, authority, or system access, the ledger cannot repair delivery by itself. Powerless Intelligence names this failure at the conversion-capacity layer. Building the ledger structure without funding the items it tracks produces a worse failure than no ledger, because the tracker lets the body cite process as proof of delivery.

X. Close

A claim first has to enter the institution’s route. Then the receiver owes a response. After acceptance, the ledger decides whether the response becomes execution. Where the record is absent, the response stays at the level of language. Where the record exists but verification is captured, the resource path is fictional, or re-entry returns to a powerless forum, the ledger fails in one of the named pathologies. The repair is to build the structural conditions for conditional closure: independent verification, fixed closure rules, preauthorized re-entry on failure.

An institution implements the items it enters into a record it cannot cheaply close.

Truth alone does not bind. Acceptance alone does not implement. A response becomes operational when a record makes closure conditional: verified items close, failed items return to a forum that can act.

Related:

The Procedural Object — sender-side admissibility; the artifact must compile into something the route can carry.
Feedback Authority — graded cost the receiver bears for non-response.
Invisible Work Queues — queue-shaped specialisation for recurring statutory obligations.
Powerless Intelligence — authority × resource × answerability as conversion-capacity precondition.
The Mandate Gap — pre-closure sibling; the seat lacks operational capacity to compute the decision.
Theatrical Accountability — failure form when the ledger is replaced by ritual.
The Stack — Layer 9 (Execution); this essay fills the open slot.
What Bureaucracy Is — the positive synthesis: bureaucracy as the runtime of traceable discretion, of which implementation ledgers supply the remember-and-close functions.

Sources and Notes

Canonical definition. An implementation ledger is a transition record that holds an accepted institutional decision under conditional closure: the item can close only after verification against predeclared criteria, and failed verification returns it to a forum able to alter resources, priority, assignment, exception, or sanction.

Seven fields and the three-part primitive. The fields (decision object, owner, resource path, deadline, execution state, verification event, reopening trigger) evidence the transition record. The portable primitive is the three-part sequence: Acceptance → Transition Record → Re-entry. The components are not new; the portable primitive is the claim.

Implementation gap. Jeffrey Pressman and Aaron Wildavsky, Implementation: How Great Expectations in Washington Are Dashed in Oakland; or, Why It’s Amazing that Federal Programs Work at All (University of California Press, 1973). Paul Sabatier and Daniel Mazmanian, “The Conditions of Effective Implementation: A Guide to Accomplishing Policy Objectives,” Policy Analysis 5:4 (1979). Richard Matland, “Synthesizing the Implementation Literature: The Ambiguity-Conflict Model of Policy Implementation,” Journal of Public Administration Research and Theory 5:2 (1995). Michael Lipsky, Street-Level Bureaucracy: Dilemmas of the Individual in Public Services (Russell Sage Foundation, 1980). Implementation theory names the gap between statutory specification and on-the-ground execution; the ledger is one specific translation point at which the gap becomes invisible to the institution itself.

Delivery and routine review. Michael Barber, Instruction to Deliver (Politico’s, 2007). Michael Barber with Andy Moffit and Paul Kihn, Deliverology 101 (Corwin, 2011). The UK Prime Minister’s Delivery Unit operated 2001–2010. Robert Behn’s PerformanceStat literature documents CitiStat and StateStat. Delivery units operate on executive-priority programs; the ledger primitive applies to accepted institutional responses outside ministerial priority lists.

Audit follow-up architectures. UK National Audit Office recommendations tracker, nao.org.uk/recommendations-tracker; covers reports published since 1 April 2019; the tracker exposes status and date fields with NAO confirmation. NAO Freedom of Information response (FOI-1694, June 2024) describes the spreadsheet process by which NAO teams collect acceptance status, implementation status, expected/actual implementation date, and evidence from audited bodies. HM Treasury publishes Treasury Minutes responses to Public Accounts Committee reports and twice-yearly progress reports; PAC uses these for scrutiny and may recall accounting officers (gov.uk/government/collections/treasury-minutes-progress-on-implementing-recommendations-of-public-accounts-committee). US Government Accountability Office Open Recommendations Database (gao.gov/about/what-gao-does/recommendations); 31 U.S.C. § 720 requires agency heads to submit written statements on action taken. INTOSAI follow-up standards (ISSAI series).

Independent monitoring under court supervision. US Department of Justice Civil Rights Division pattern-or-practice consent decrees. Chicago Police Department consent decree: court approval 31 January 2019; independent monitor appointed 1 March 2019; semiannual monitor reports continue until the decree ends after sustained full-and-effective compliance has been demonstrated for one to two years per requirement (cpdmonitoringteam.com/reports-information; chicagopoliceconsentdecree.org). Ferguson consent decree approved 19 April 2016 (clearinghouse.net/case/14081). Los Angeles County jails court-enforceable settlement agreement filed August 2015 (justice.gov/usao-cdca/pr/justice-department-reaches-agreement-los-angeles-county-implement-sweeping-reforms).

Soft-law verification. Open Government Partnership Independent Reporting Mechanism: action-plan commitments enter as decision objects; member states report; IRM researchers verify and code completion levels (not started, limited, substantial, complete, withdrawn) against evidence (opengovpartnership.org/data-dashboard; opengovpartnership.org/irm-guidance-overview).

Public-inquiry recommendation tracking. UK Grenfell Tower Inquiry Phase 2 report published 4 September 2024 with 58 recommendations (gov.uk/government/publications/publication-of-the-grenfell-tower-inquiry-phase-2-report). Government response commits to a public record of public-inquiry recommendations, quarterly Grenfell progress reports from June 2025, a cross-government ministerial implementation board, and an annual update to Parliament (gov.uk/government/publications/grenfell-tower-inquiry-phase-2-report-government-response). The Phase 2 recommendations dashboard records each recommendation, government response, delivery status, and progress updates (gov.uk/government/publications/grenfell-tower-inquiry-phase-2-recommendations).

Implementation outcomes (healthcare). Laura J. Damschroder et al., “Fostering Implementation of Health Services Research Findings into Practice: A Consolidated Framework for Advancing Implementation Science,” Implementation Science 4:50 (2009). Russell E. Glasgow, Thomas M. Vogt, and Shawn M. Boles, “Evaluating the Public Health Impact of Health Promotion Interventions: The RE-AIM Framework,” American Journal of Public Health 89:9 (1999). Enola K. Proctor et al., “Outcomes for Implementation Research: Conceptual Distinctions, Measurement Challenges, and Research Agenda,” Administration and Policy in Mental Health 38:2 (2011).

Performance budgeting and NPM critique. Christopher Hood, “A Public Management for All Seasons?” Public Administration 69:1 (1991). Gwyn Bevan and Christopher Hood, “What’s Measured Is What Matters: Targets and Gaming in the English Public Health Care System,” Public Administration 84:3 (2006) — the canonical study of measurement gaming in NPM regimes. Michael Power, The Audit Society: Rituals of Verification (Oxford University Press, 1997). United States Government Performance and Results Act 1993 (P.L. 103-62, enacted 3 August 1993). GPRA Modernization Act (P.L. 111-352, enacted 4 January 2011).

Principal-agent and residual control. Bengt Holmström, “Moral Hazard and Observability,” Bell Journal of Economics 10:1 (1979). Oliver Hart, Firms, Contracts, and Financial Structure (Oxford University Press, 1995). Michael C. Jensen and William H. Meckling, “Theory of the Firm: Managerial Behavior, Agency Costs and Ownership Structure,” Journal of Financial Economics 3:4 (1976). The economics names the structure of the problem; the ledger is the administrative record where the monitor’s signal changes what the institution must do next.

Aegis Ashore. Japan’s Cabinet approved introduction of the Aegis Ashore system in December 2017; budgetary implementation required subsequent parliamentary approval. Defence Minister Kono announced suspension on 15 June 2020; the National Security Council formalised cancellation later in June 2020. The proximate cause was the SM-3 Block IIA booster-fall safety problem; reported estimates of the fix have ranged around ¥200 billion and roughly a decade or more of additional development. The case is treated in this essay as a boundary specimen for Mandate Gap rather than for Implementation Ledger.

Movement test. A checkable later observation that distinguishes activity from result — the observable state-change that would prove a response did not disappear into process. The Stack anchors the term at the execution layer; Constructive Diagnosis uses it as the sixth field of the diagnosis standard; The Procedural Object §VII uses it for sender-side traceable consequence; Powerless Intelligence §VI uses it as the public after-the-action test.

Status of the partition. The ledger / queue / dashboard / promise / working-group typology is a working compression. The categories overlap in live practice: the NAO tracker has queue properties; consent-decree monitoring has dashboard properties; OGP IRM has ledger properties. Use the categories as tests, not mutually exclusive species. The grade of any specific artifact is empirically testable via the seven ex-ante tests in §IV.