The DAO Proving Ground

An Engineering Approach to Building Aligned Organizations

July 9, 2025

In this post, I argued that our society is facing two alignment problems, not one. The first is the distant (less so every day) risk of a misaligned superintelligence. The second is the present, catastrophic harm being caused by misaligned corporations (climate change, inequality/poverty, …). The crucial link is this:

Our ongoing failure to align corporations with the public good is a live demonstration of our inability to solve alignment problems.

If we cannot trust a modern corporation—an entity whose legal structure often compels it to prioritize profit above all else¹—to not harm society, how can we possibly trust it to build a safe and beneficial Artificial General Intelligence? The premise is absurd.

This isn’t a problem we can ignore. We need a new type of organization, one built from the ground up for transparency, accountability, and alignment with the public good. Before we can align AI, we must first engineer aligned organizations. This is the proving ground.

The Democratic Foundation

But what does it mean to be “aligned with the public good”? Who defines what the public good actually is? This question cuts to the heart of political philosophy, and our answer shapes everything that follows.

The only people who can define the public good are the public.

Consider the alternatives: Should corporations define the public good? They optimize for profit. Should governments? They optimize for political survival. Should experts? They optimize for their own theories and reputations. Should AI systems? They optimize for whatever objectives we give them—which brings us back to the original question of who decides.

The democratic answer is not that the public is always right, but that the public is the only legitimate source of authority over itself. This commitment shapes how we approach organizational alignment: we seek systems that faithfully execute collective will, not systems that claim to know better than their members.

This new organization must be built on a foundation of verifiable rules and radical transparency—its operations must be auditable by its members in real-time.

DAOs: A New Hope for Alignment

A promising candidate for this new type of organization is the Decentralized Autonomous Organization (DAO)². Unlike a traditional corporation, which operates through opaque hierarchies and private ledgers, a DAO operates on a blockchain. Its rules are encoded in smart contracts, its treasury is fully transparent, and its decisions are made by its community members.

This structure offers a potential solution to the alignment problem. But to explore that, we first need a formal definition of alignment in this context. This challenge is at the heart of social choice theory: how to aggregate individual preferences into a single, desirable collective outcome. For our purposes, we can define it as follows:

Organizational Alignment: An organization is aligned with its members to the degree that its actions and state minimize the deviation from the collective, aggregate beliefs of those members.

In simple terms, an aligned organization does what its members actually want it to do. This directly parallels the AI alignment problem: we want AI systems to do what humanity collectively wants them to do. Both DAOs and AI systems are engineered artifacts where we can design the decision-making process. The DAO serves as a proving ground—if we can’t align a transparent system governed by simple voting rules, how can we hope to align more complex AI systems governed by neural networks and reward functions?

A simple DAO might be a collective investment fund. Members contribute capital, receive voting tokens, and then vote on proposals like “Invest 10% of the treasury in Project X.” The rules are clear, the treasury is visible to all, and the voting mechanism (e.g., 1-token-1-vote, quadratic voting) is programmed into the DAO’s code. More advanced DAOs use these same principles to govern everything from software protocols to scientific research grants, often incorporating sophisticated mechanisms like delegating votes to trusted experts.

But do these systems actually produce alignment? Or do they create new problems of their own? To answer this, we need to move beyond intuition and build a model.

A Framework for Analyzing Governance

To understand if DAOs can solve the alignment problem, we need a way to simulate them—a wind tunnel for governance. We can do this by representing the DAO and its members in a formal, mathematical framework.

The Intuition:

Imagine the “state” of a DAO—everything from its treasury allocation to its strategic priorities—as a single point in a high-dimensional space. To make this concrete, consider an AI safety research DAO where this vector could represent the funding allocated to different research areas (interpretability, robustness, alignment theory), the stringency of publication requirements, the level of collaboration with industry, and the criteria for grant approval.

Every member of the DAO also has a point in this space, representing their ideal state for the organization, reflecting their personal preference for how the DAO should be run. The goal of governance is to move the DAO’s point through this space, via proposals, to a location that is as close as possible to the collective ideal of its members.

A Formal Framework for DAO Governance Simulation

I. Core Components (The “Environment”)

Agents ($A$): A set of $n$ agents, $A = {a_1, a_2, …, a_n}$.
State Space ($S$): The state of the DAO is represented by a vector $S_t \in \mathbb{R}^k$ at time $t$. $S_0$ is the initial state.
Beliefs ($Y$): Each agent $a_i$ has a static belief vector $y_i \in \mathbb{R}^k$. This represents their ideal state for the DAO.
Tokens ($X$): Each agent $a_i$ holds $x_i$ governance tokens. The total number of tokens is $X_{total} = \sum x_i$. The distribution can be unequal (e.g., drawn from a power-law distribution).
Proposals ($P$): A set of proposals $P = {p_1, p_2, …, p_m}$. Each proposal $p_j$ is a delta vector $p_j \in \mathbb{R}^k$, representing a potential change to the DAO’s state.

II. The Governance Process (The “Simulation Loop”)

For each time step $t = 1, 2, …$:

Proposal Generation: A proposal $p_t$ is introduced for voting.
- Simplification: $p_t$ is drawn randomly from a predefined distribution (e.g., uniform random vectors within a certain magnitude).

Voting: Each agent $a_i$ evaluates $p_t$ and decides to vote ‘For’ or ‘Against’.

Agent’s Goal: An agent wants to move the DAO state $S_t$ closer to their belief $y_i$.
Decision Heuristic: Agent $a_i$ votes ‘For’ if the proposed state $S_{t+1} = S_t + p_t$ is “better” for them than the current state $S_t$. A simple way to model this is to check if the distance to their belief decreases: $\|(S_t + p_t) - y_i\| < \|S_t - y_i\|$ (where $|\cdot|$ is the Euclidean distance).

*Note: When the distances are exactly equal ($

(S_t + p_t) - y_i

S_t - y_i

$), we assume the agent abstains or votes against change, maintaining the status quo.*

Tallying: The votes are tallied according to a chosen mechanism.
- Baseline Mechanism (1-Token-1-Vote):
  - $Votes_{For} = \sum x_i$ for all $a_i$ who voted ‘For’.
  - $Votes_{Against} = \sum x_i$ for all $a_i$ who voted ‘Against’.
- The proposal $p_t$ is accepted if $Votes_{For} > Votes_{Against}$.
State Update: The DAO’s state is updated based on the outcome.
- If $p_t$ is accepted, $S_{t+1} = S_t + p_t$.
- If $p_t$ is rejected, $S_{t+1} = S_t$.
- Imperfection Model (Imperfect Execution): If accepted, the update could include noise: $S_{t+1} = S_t + p_t + \epsilon$, where $\epsilon$ is a random noise vector.

(Note: This model assumes perfect execution and observation of the state. This is a reasonable starting point because, unlike traditional organizations, DAOs are transparent by default and can execute proposals via smart contract. We will return to these assumptions later.)

Utility/Loss Calculation: Each agent measures their alignment.
- Individual Loss $L_i(t)$: The misalignment for agent $a_i$ at time $t$ is the distance between the current DAO state and their belief: $L_i(t) = |S_t - y_i|$.
- System-wide Misalignment $L_{total}(t)$: The aggregate loss could be the average or sum of all individual losses.

A Laboratory for Governance

This framework creates a computational laboratory where we can rigorously test the dynamics of governance and ask critical questions:

Plutocracy: What happens if one agent holds 50% of the tokens? Our model can simulate how this “whale” can drag the DAO’s state vector towards their own belief, even if it misaligns with 99% of the other members.
Voter Apathy: We can model scenarios where only a fraction of agents participate in votes. This allows us to measure how apathy amplifies the power of whales and makes the system vulnerable to capture by small, highly-motivated groups.
Gridlock: What happens if the community’s beliefs are polarized into two opposing clusters? The model can show how this leads to gridlock, where no proposal can gain a majority, leaving the DAO’s state stuck far from anyone’s ideal point.
Efficiency and Stability: How quickly does a DAO converge on a stable state? Is that state robust, or can a small shock (like a controversial new proposal) easily dislodge it and send the system into chaos?
Minority Rights: Can a minority group with very strong preferences on a specific issue ever win a vote, or are they always overruled by a largely indifferent majority? This helps test the fairness of different voting systems.
Path Dependency: How much does the final state of the DAO depend on the order of the first few proposals? Can a few early decisions lock the organization into a suboptimal trajectory forever?

The Fundamental Limits of Governance

This framework doesn’t just let us test specific mechanisms; it forces us to confront the fundamental limits of any democratic system.

A Product of its Voters: The model makes it clear that a DAO’s state is fundamentally a function of its members’ beliefs. You cannot produce an outcome that is alien to the inputs. If the members’ collective beliefs are short-sighted or misguided, the DAO’s actions will be too. This illustrates the “garbage in, garbage out” principle of collective intelligence.
No Perfect System: The work of economists like Kenneth Arrow has mathematically proven that no voting system can simultaneously satisfy a set of basic, desirable fairness criteria³. Our model allows us to visualize these trade-offs. A system that is decisive and simple (like 1-token-1-vote) might be unfair to minorities and prone to plutocracy. A system that is fairer (like quadratic voting) might be more complex or prone to gridlock. There is no magic bullet, only engineering trade-offs.

Extending the Model to Real-World Complexity

The true power of this framework is its extensibility. We can easily swap out rules and add new mechanisms to see how they affect the outcome.

Quadratic Voting (QV): We can change the voting rule from “1-token-1-vote” to a QV model where voting cost is the square of votes cast⁴. The simulation would then show how QV dramatically reduces the power of whales and forces them to build broader consensus, likely leading to a final state that is more representative of the average member’s belief.
Sub-DAOs and Staking: We can model advanced, agile governance. A main DAO can vote to fund a “sub-DAO”—a specialized team tasked with a specific goal. To ensure alignment, the sub-DAO’s leader must stake their own tokens as a bond against negligence or fraud. Our model can test the consequences: if the stake is slashed for any failure, leaders become risk-averse and do nothing. If it’s only slashed for proven malice, and there are large token bonuses for good performance, we can find a “sweet spot” that encourages both accountability and ambitious, innovative work⁵.

Limitations and Real-World Complications

Our framework, while powerful, makes several simplifying assumptions that deserve scrutiny:

Static Beliefs: We assume each agent’s ideal point $y_i$ remains fixed. In reality, preferences evolve through deliberation, new information, and social influence. A robust governance system must handle this dynamism.

No Strategic Behavior: We assume agents vote sincerely based on their true preferences. Real governance systems face strategic voting, preference falsification, and vote trading. Agents might misrepresent their beliefs to achieve better outcomes.

No Coalitions: Our model treats each agent independently, but real governance involves coalition formation, log-rolling, and political parties. These emergent structures can dramatically alter outcomes.

Perfect Information: We assume all agents perfectly understand each proposal’s effects. In practice, proposals are complex, outcomes uncertain, and attention is limited. This creates opportunities for manipulation through framing and selective disclosure.

Closed System: We model the DAO as isolated from external pressures. Real organizations face regulatory capture, market forces, and adversarial actors attempting to influence governance.

These limitations don’t invalidate our framework—they highlight areas for future extension. Each complication can be formally modeled and studied. The transparency of DAOs makes them ideal laboratories for understanding how these real-world factors affect alignment.

Analysis: The Equilibrium of a 1D DAO

Let’s walk through the logic step-by-step to prove the equilibrium point of our simplified model. This result is a token-weighted version of the famous Median Voter Theorem from political science⁶.

1. The Setup (Simplifying the Framework)

1-Dimensional Space ($k=1$): The DAO’s state $S$ is just a single number on a line. The agents’ beliefs $y_i$ are also just numbers.
Voting Mechanism: We use the baseline: 1-Token-1-Vote.
Proposal: Consider a proposal $p$ to move the state a small amount to the right. So, $p > 0$. The proposed new state is $S + p$.

2. The Agent’s Decision Rule

An agent $a_i$ votes ‘For’ this proposal if the new state is closer to their ideal point $y_i$. Formally: $| (S + p) - y_i | < | S - y_i |$ To solve this inequality, we can square both sides (since both are non-negative, this preserves the inequality): $((S + p) - y_i)^2 < (S - y_i)^2$ Let’s expand the terms: $(S - y_i)^2 + 2p(S - y_i) + p^2 < (S - y_i)^2$ We can subtract $(S - y_i)^2$ from both sides: $2p(S - y_i) + p^2 < 0$ Now, we can factor out $p$: $p \cdot (2(S - y_i) + p) < 0$ We assumed $p$ is a small positive number ($p > 0$). For this inequality to be true, the second term must be negative: $2(S - y_i) + p < 0$ Since $p$ is very small, this condition is dominated by the first part. The condition is met if $S - y_i$ is negative. $S - y_i < 0 \to S < y_i$ Conclusion of Step 2: An agent will vote ‘For’ a small step to the right if and only if their ideal point $y_i$ is to the right of the current state $S$. Intuitively, this makes sense: they want the DAO to move towards them.

3. The Tallying Rule

The proposal $p$ passes if the sum of token weights of the ‘For’ voters is greater than the sum of token weights of the ‘Against’ voters. Based on our conclusion from Step 2:

‘For’ voters: All agents $a_i$ for whom $y_i > S$.
‘Against’ voters: All agents $a_i$ for whom $y_i \le S$.

So, the proposal to move right passes if: $\sum_{i \text{ s.t. } y_i > S} x_i > \sum_{i \text{ s.t. } y_i \le S} x_i$

4. The Equilibrium Condition

Equilibrium is the state $S^$ where the system stops moving. This means no proposal can pass. For $S^$ to be a stable equilibrium, two conditions must be met:

A proposal to move a small step right must fail.
A proposal to move a small step left must fail.

Let’s apply our tallying rule to this:

Condition 1 (Rightward move fails): $\sum_{i \text{ s.t. } y_i > S^*} x_i \le \sum_{i \text{ s.t. } y_i \le S^*} x_i$

Condition 2 (Leftward move fails): (By symmetric logic, a vote to move left passes if the weight of voters to the left is greater than the weight of voters to the right). So, for it to fail: $\sum_{i \text{ s.t. } y_i < S^*} x_i \le \sum_{i \text{ s.t. } y_i \ge S^*} x_i$

These two conditions together define the weighted median. $S^*$ is the point that divides the total token distribution in half.

Conclusion: The equilibrium state $S^*$ of the simplified DAO is the token-weighted median of the members’ beliefs.

Why This Matters: A Concrete Example

Consider a DAO focused on climate policy with 200 members:

100 members each hold 1 token and believe the optimal carbon tax should be $40 per ton.
99 members each hold 1 token and believe it should be $45 per ton.
1 “whale” member holds 101 tokens and believes it should be $1000 per ton.

The token-weighted mean would be $(100 \cdot 40 + 99 \cdot 45 + 101 \cdot 1000) / 300 \approx 350$. The whale’s position, amplified by their token holdings, dramatically skews the average.

The token-weighted median, however, is the value at the 150.5th token when all token holders are lined up in order. Since tokens 1-100 are at position 40, tokens 101-199 are at position 45, and tokens 200-300 are at position 1000, the median falls at position 45. The DAO’s state converges to 45, largely ignoring the whale’s extreme preference despite their majority token holding.

This illustrates a crucial property: the median is robust to extremes but insensitive to preference intensity. We view this as a feature, not a bug. Democratic voting power—where each token carries equal weight regardless of how strongly its holder feels—protects against both plutocracy and the tyranny of passionate minorities who might otherwise dominate governance through the sheer intensity of their beliefs. Those who care deeply about an issue can acquire more tokens or convince others, but they cannot simply declare their preferences more important.

Beyond One Dimension: The Geometric Median

In our multi-dimensional model ($k > 1$), the equilibrium point is no longer the simple median. It converges to the Geometric Median⁷. This is the point $S^*$ that minimizes the sum of weighted Euclidean distances to all member beliefs: $S^* = \underset{S \in \mathbb{R}^k}{\arg\min} \sum_{i=1}^n x_i \|S - y_i\|$

Unlike the one-dimensional median, the geometric median in higher dimensions has no closed-form solution and must be computed iteratively—a reminder that even simple-seeming governance goals can hide computational complexity.

The geometric median is the natural generalization of the median to higher dimensions and shares its desirable property of being robust to extreme outliers. A single whale, no matter how far their belief is from the cluster of other members, cannot single-handedly pull the DAO’s state to their ideal point. They must convince others to move with them.

Addressing Complexity and Expertise

The framework reveals both the strengths and limitations of democratic governance. While the median voter theorem provides stability and robustness against extremes, it also highlights challenges when dealing with complex issues that require specialized knowledge.

This doesn’t mean DAOs must be ignorant. We can build auxiliary structures that enhance democratic decision-making without overriding it:

Expert councils that analyze proposals and provide voting recommendations with clear reasoning
Educational resources and deliberation forums where members can discuss proposals before voting
Prediction markets that reveal the expected outcomes of different policy choices
Constitutional constraints that members voluntarily adopt to limit certain types of decisions

These mechanisms don’t replace democratic choice—they inform it. The DAO remains aligned when it does what its members want, even when they’re wrong, because the alternative is a system that claims to know better than the people it serves.

The Path Forward

This framework creates both opportunities and obligations for the governance community. The challenge of building truly aligned organizations is fundamentally an engineering problem, and we must approach it with engineering discipline.

DAOs offer a unique laboratory for alignment research. Unlike traditional organizations, their transparent operations and programmable governance allow us to study alignment dynamics in controlled conditions. Every governance crisis, every fork, every failed proposal generates data we can analyze.

The next steps are clear:

Build open-source simulation tools implementing this framework, allowing researchers to test governance mechanisms before deploying them in practice
Create alignment sandboxes—DAOs explicitly designed to stress-test governance mechanisms with diverse stakeholder groups and adversarial proposals
Develop richer metrics for alignment quality that go beyond simple distance measures to incorporate robustness, minority protection, and adaptability
Study organizational scale, understanding how aligned systems can maintain alignment as they grow from dozens to millions of members

The stakes are significant. If we cannot reliably align organizations of humans—with all our conflicts, biases, and competing interests—how can we hope to align systems that may soon surpass human intelligence? The DAO represents our best opportunity to solve alignment in a controlled, transparent environment before the challenge becomes existential.

This is not merely an academic exercise. The institutions we build today will shape the governance of tomorrow’s most powerful technologies. The proving ground is open.

While the legal doctrine of “shareholder primacy” has been challenged in recent years, it remains a powerful norm in corporate governance. The conflict between shareholder value and broader stakeholder interests is well-documented across industries. ↩
The concept of a DAO was first proposed in detail by Vitalik Buterin in 2014. See: Buterin, V. (2014). “DAOs, DACs, DAs and More: An Incomplete Terminology Guide.” Ethereum Blog. ↩
Arrow, K. J. (1950). “A Difficulty in the Concept of Social Welfare.” Journal of Political Economy, 58(4), 328–346. This is the seminal paper on Arrow’s Impossibility Theorem. ↩
Posner, E. A., & Weyl, E. G. (2018). Radical Markets: Uprooting Capitalism and Democracy for a Just Society. Princeton University Press. This book provides the most comprehensive argument for Quadratic Voting. ↩
The use of staking and slashing as a mechanism to ensure honest behavior is a core concept in “cryptoeconomics” and mechanism design, particularly in Proof-of-Stake consensus protocols. ↩
Black, D. (1948). “On the Rationale of Group Decision-making.” Journal of Political Economy, 56(1), 23–34. This paper introduced the median voter theorem, a foundational concept in public choice theory. ↩
The geometric median, also known as the Fermat–Torricelli point or Fermat-Weber point, is a classic problem in geometry and optimization. It is known for its robustness as a measure of central tendency. ↩