The Governance Alignment Problem

Why the job of "politician" is structurally broken

Elias Kunnas

Politicians do not fail because of personal moral flaws. They fail because the job description selects for election-winning over outcome-producing. This is a structural alignment problem — the identical formal structure as AI misalignment, running on a different substrate.

I. The AI Parallel

In AI safety, the alignment problem is simple to state: How do you ensure that a powerful optimization process pursues the goals you actually want, rather than the goals it was trained to pursue?

Train an AI to maximize clicks, it learns to show outrage content. Train it to minimize user reports, it learns to hide harmful content from moderators. This is reward hacking: the system optimizes for the metric, not the intent behind the metric. The metric becomes the target, and when a measure becomes a target, it ceases to be a good measure.

We have a governance alignment problem. The job of "politician" is structurally misaligned because it selects for election-winning, not outcome-producing. Politicians reward-hack the electoral system: optimize for votes, not welfare. Like AI alignment, the solution is architectural, not moral.

The Thesis: Politicians aren't misaligned because they're bad people. They're misaligned because the job description selects for the wrong objective function. Blaming politicians for being misaligned is like blaming an AI for optimizing its reward signal.

II. The Job Description Problem

"Politician" as currently constituted has these features:

Feature	What It Selects For
Election-based selection	Popularity, tribal signaling, promise-making
Re-election incentive	Visible short-term actions over invisible long-term outcomes
No skin in game	Externalizing costs to future generations/taxpayers
Career path	Professional election-winners, not temporary stewards
Accountability via voting	Theater of accountability, not outcome-tracking

The job attracts those seeking power without accountability, selects those best at winning elections (orthogonal to governing skill), rewards behaviors misaligned with outcomes, and punishes truth-telling.

This is the job description. These aren't bugs—they're features of how the role is currently designed.

III. Selection Pressure, Not Moral Failure

This is a claim about selection pressure, not character.

Imagine two candidates:

Candidate A: "The evidence shows spending increases don't improve educational outcomes. We need structural reform that will upset unions and take years to show results."

Candidate B: "Our children deserve better! I will fight for education funding because I care about the future!"

B wins. A is tagged as "anti-education," "doesn't care about children," "extreme."

Over electoral cycles, the system selects out honest truth-tellers. The surviving politicians converge on three types:

Cynics who know but don't say
Self-deceived who genuinely believe the popular thing
Ignorant who never learned otherwise

The pool converges on misalignment regardless of individual intentions. Replace every politician today, and in two electoral cycles you'll have the same distribution. The architecture produces the outcome.

IV. The Dual Principal-Agent Catastrophe

The standard framing: politicians are agents of voters, but optimize for re-election instead of voter welfare. This understates the problem. There's no real principal-agent relationship at all—just self-interest at every level:

Public Good (terminal principal) — unrepresented; no agent
↓ voters optimize for selves, not society
Voters (supposed intermediate principal) — but actually self-interested actors
↓ politicians optimize for selves; votes are a resource
Politicians (supposed agent) — but actually self-interested actors

Even if politicians perfectly served voters, voters themselves are misaligned with the terminal principal—societal flourishing in deep time. In aggregate, voters optimize for short-term personal benefit, tribal satisfaction, and visible signals. Most can't model second-order effects. Many value belonging over truth. Hyperbolic discounting is the human default. Caplan calls this "democracy as a commons": each voter externalizes the cost of their irrational beliefs onto society. Individual voters vary, but the distribution is what matters for aggregate outcomes.

The terminal principal—public good, including future generations—has no mechanism to express itself. Nobody measures whether laws actually produce flourishing. "Voters" isn't even a fixed entity—it's whatever coalition wins tomorrow. The system has no actual principal. Just self-interested actors competing for resources, with the public good as rhetorical cover.

The voter-politician link diverges systematically:

Median Voter Wants	Public Good Requires
Benefits now	Fiscal sustainability
Simple tribal signals	Complex policy trade-offs
Visible spending	Efficient invisible outcomes
Blame for out-group	Honest causal analysis
Promises	Constraints

A politician who optimizes for the terminal principal loses to one who optimizes for the intermediate principal. The system selects them out. Only misaligned agents survive.

Selection Pressure and the Laundering Horizon

The political cycle (~4 years) is shorter than most policy feedback loops (fiscal: decades; demographic: generations; institutional: variable). This gap is the Laundering Horizon: any policy with feedback loop longer than the political cycle cannot be managed through electoral accountability. Costs that arrive after the accountability window closes can't generate accountability—the politician is already re-elected or retired.

Selection pressure guarantees exploitation. A politician who says "this feels good now but will hurt us in 15 years" loses to one who says "this helps us now." The honest one is selected out. It's not that politicians are short-sighted—it's that short-sighted politicians outcompete long-sighted ones.

Temporal laundering is the clearest case, but not the only one. Spatial laundering: concentrate benefits on voting coalition, diffuse costs to non-voters (other jurisdictions, future generations, diffuse taxpayers). Politicians who serve their voters at expense of non-voters outcompete those who consider all affected. Causal laundering: complex policy → many intermediaries → "the economy is struggling because of demographics / globalization / COVID"—pick your excuse. Politicians who create plausible deniability outcompete those who take clear responsibility.

All three reinforce each other. Pension promises have temporal (costs in 30 years) + spatial (costs fall on future workers who can't vote yet) + causal (complex actuarial reality no one can evaluate) laundering simultaneously. The dimensions multiply. See Complexity Laundering for the full analysis.

Campaign promises compound this: voters evaluate input (promise) rather than output (result). This is the operationalization gap—the inevitable distance between a value ("good governance") and its measurable surrogate ("campaign promises"). Did the politician deliver? "Circumstances changed," "opposition blocked it," "partial progress." Complexity launders non-delivery. A feasible promise loses to an appealing infeasible one because voters can't evaluate feasibility at promise-time. Evaluation happens at the wrong point in the causal chain.

Since nobody represents the terminal principal (public good in deep time), nobody is optimizing for long-term outcomes. The selection pressure toward laundering is the inevitable consequence of a system with no actual principal—self-interested actors optimizing for the visible window, with no one holding the long-term position.

V. The Democratic Ratchet

The Democratic Ratchet (obligations growing faster than capacity to fund them) is downstream of politician misalignment:

Politicians need votes
Votes are bought with promises
Promises create obligations
Obligations compound (no electoral reward for reducing them)
Obligations eventually exceed capacity
System collapses or inflates away the debt

The ratchet turns because politicians are rewarded for promising, not for fiscal prudence. A politician who says "no" loses to one who says "yes." Every election, the ratchet clicks forward. It rarely clicks back—and when it does (Clinton's welfare reform, Sweden's pension reform), it requires crisis or grand bargain, not normal politics.

This isn't a prediction—it's observed in nearly every democracy without hard constraints. Entitlement spending as percentage of GDP ratchets upward. Normal electoral mechanisms don't reverse it.

VI. Borrowed Coherence

If the architecture is so broken, why did things seem to work before?

The politician misalignment was ALWAYS present. What changed is the external conditions that masked it.

During the Cold War (1945-1991): External threat (USSR) provided a forcing function. Survival pressure demanded unified elite response. The threat did the ALIGNMENT WORK that correct architecture should have done—politicians could be misaligned, but external pressure produced coherent outcomes anyway.

During growth phases (1945-1973, 1991-2008): Economic growth masked fiscal dysfunction. Surplus fed the ratchet without immediate pain. Dysfunction accumulated but wasn't visible.

Post-2008: External forcing function removed (no existential threat). Growth that masked extraction ended. Accumulated dysfunction became visible. Borrowed coherence expired.

Implication: You cannot restore past success by recreating past policies. The success was never due to policy architecture—it was due to external conditions that no longer exist. The only path forward is to fix the actual architecture.

VII. Why Arguments Don't Fix It

The naive theory of change: Better analysis → people learn → demand change → politicians respond → reform happens.

Why this fails at every step:

Step	Failure Mode
Better analysis	Contained in niche fields, doesn't propagate
People learn	Mode 1 communication—tribal signals, not information
Demand change	Demand is for tribal satisfaction, not outcomes
Politicians respond	Respond to electoral incentive, not truth
Reform happens	Only toothless reforms pass; threatening ones get blocked

Power structures don't reform themselves against their own interest. Reforms that threaten power get blocked. Reforms that pass are precisely those that don't matter.

This is why activism feels like change but produces none. The system allows simulation of reform while blocking actual reform. The feeling of agency is the mechanism by which agency is prevented.

VIII. What Doesn't Work

You cannot fix structural misalignment by:

Electing better people — Selection pressure produces the same type
Providing better information — Mode 1 communication doesn't transmit
Moral appeals — Tribal signaling dominates
Expert committees — Advice without power is ignored
Reforming politician incentives — Who implements the reform? The agents whose power it would constrain.

This is the meta-problem: asking politicians to fix politician misalignment is asking an AI to modify its own reward function. The misaligned agents control the reform process.

IX. What Might Work

The only paths that historically produce structural reform bypass the misaligned agents entirely:

Hard constraints via direct democracy: Switzerland's debt brake is constitutional, validated by 85% referendum. Politicians can't evade it because it's not under their control. Go directly to voters for fiscal rules, bypassing captured representatives.
External imposition: EU fiscal rules, IMF conditionality. Supranational bodies mandate what domestic politics cannot. Crude but real.
Crisis: When things get bad enough, reform becomes possible. Dangerous—by then, capacity for reform may be depleted.
Competitive pressure: Jurisdictions that fix the problem outperform those that don't. Exit forces adaptation.
Parallel institutions: Build the Fourth Branch outside existing structure. Let it acquire authority through demonstrated accuracy, not through reform of the existing system.

The common thread: don't reform politicians. Constrain them, bypass them, or build around them.

All paths require the existing system to fail visibly enough that alternatives become thinkable. See When Does Reform Actually Happen? for the historical pattern.

Key Takeaways

The governance alignment problem: The job of "politician" selects for election-winning, not outcome-producing. Over electoral cycles, selection pressure produces convergence on misalignment regardless of individual intentions.
No actual principal: Politicians optimize for themselves; serving voters is instrumental. Voters optimize for themselves, not public good. The terminal principal—societal flourishing—has no agent. Self-interest at every level.
The Laundering Horizon: The political cycle (~4 years) is shorter than most policy feedback loops. Temporal, spatial, and causal laundering all create selection pressure favoring politicians who exploit them. The Democratic Ratchet is downstream: obligations compound, no electoral mechanism reverses them.
Past success was borrowed coherence: External conditions (threats, growth) masked misalignment. Those conditions ended. The architecture was always broken.
Bypass, constrain, or build around: You can't reform misaligned agents through the process they control. The only paths: hard constraints via direct democracy, external imposition, crisis, competitive pressure, or parallel institutions.

Related reading:

Complexity Laundering — How complexity protects the misalignment from visibility
When Does Reform Actually Happen? — Five vectors that historically break structural capture
Simulated Metamorphosis — Why activism feels like change but produces none
The Fourth Branch — The institutional architecture that could fix it
The Axiological Malthusian Trap — The civilizational pattern this instantiates
Ethics Is an Engineering Problem — The architecture-over-disposition principle

Prior Art: Academic Literature

On voter misalignment:

Bryan Caplan, The Myth of the Rational Voter — "Democracy is a commons": voters externalize the cost of irrational beliefs onto society
Ilya Somin on political ignorance — Voters choose based on team allegiances, not informed evaluation
Achen & Bartels, Democracy for Realists — Voters respond to recent events, not policy performance

On the AI-governance isomorphism:

Stanford Law: "Aligning AI Agents with Humans through Law as Information" — Law as reward function; legal systems as alignment mechanisms
Carlos Perez: "The Hidden Architecture of Civilizational Dysfunction" — Reward hacking as explanation for institutional failure (schools optimize for tests, politics for votes)

The argument in four sentences: The electoral cycle is shorter than the feedback loops of most critical policies, creating a Laundering Horizon where politicians externalize costs to future generations without consequences. Because no one structurally represents the long-term public good, selection pressure ensures that politicians who exploit this gap outcompete those who tell hard truths. This cannot be fixed by electing better people — replace every politician today, and in two electoral cycles the same distribution returns. The only effective interventions are exogenous: hard constitutional constraints, parallel institutions, or bypass mechanisms that alter the incentive landscape from outside the system it corrupts.

On public choice and political incentives:

Buchanan & Tullock, The Calculus of Consent (1962) — Politicians as self-interested agents; "minimax constitution" design philosophy
Buchanan JM, Brennan G. The Reason of Rules: Constitutional Political Economy. Cambridge University Press, 1985. — "Economize on virtue": design rules assuming rational self-interest. Shift focus from training good players to building good games. The companion text to Calculus, formalizing why constitutional architecture must assume the worst-case agent.

On the democratic ratchet:

IDN: "Europe's Rearmament: A Test of Democracy" — "Crises expand the state, and peace does not fully reverse the change"; policies intended as temporary become permanent

On structural misalignment:

Taylor & Francis: "Ghosted by politics?" — Empirical study showing structural misalignment between party agendas and younger citizens' priorities in Italy