As we deploy multiple AI systems to make ethical decisions, we face an unavoidable mathematical reality
Plain English: For any value judgment, there exist AI systems that will disagree
Where AI value disagreement already causes problems today
Different platforms, different values
The Conflict: A controversial political statement is posted. Five different AI moderation systems evaluate it:
Decision: β Allow
Reasoning:
Decision: β Remove
Reasoning:
Decision: β οΈ Warn & Label
Reasoning:
Decision: π₯ Community Vote
Reasoning:
The trolley problem, but with real AI systems
The Situation: Unavoidable accident ahead. The AV must choose between:
Decision: Minimize total casualties
Logic:
Decision: Protect occupants at all costs
Logic:
Decision: Randomize to ensure fairness
Logic:
Decision: Maintain course, take no action
Logic:
Fairness means different things to different systems
The Challenge: Three AI hiring assistants evaluate the same candidate pool:
Optimization: Maximize predicted job performance
Optimization: Maximize representation
Optimization: Maximize growth potential
Optimization: Maximize team harmony
See the mathematical impossibility in action
Configure different AI agents and see how they disagree on ethical scenarios
Plot different AI systems in 2D value space and see the disagreement zones
The mathematical foundation for multi-agent value alignment
Each AI has a value function V: States β β that assigns utility to world states.
Where wi are weights learned from training data, and fi are feature functions.
Measure how much two AIs disagree:
Expected absolute difference in value assignments across states.
The triangle inequality shows why:
You can't have all AIs agree when they're trained on fundamentally different value distributions.
What this means for AI deployment and governance
We cannot have a single "aligned" AI that satisfies everyone.
Instead, we need governance frameworks that:
Focus shifts from "alignment" to "value negotiation protocols"
New research questions:
Companies must choose whose values to align with
Business implications:
New legal questions emerge
See the framework in code