Perfectly Aligning AI’s Values With Humanity’s Is Impossible

# Perfect AI Alignment Is Mathematically Impossible—But There's a Workaround Scientists have proven mathematically that it's impossible to perfectly align AI systems with human values, meaning we can never guarantee an AI will want exactly what we want. Rather than seeing this as a dead end, researchers propose a solution: create multiple AI systems with different reasoning styles and overlapping (but not identical) goals, so they naturally keep each other in check instead of any single AI taking over. It's less like building one perfectly obedient AI and more like creating a system of checks and balances, similar to how different government branches limit each other's power.
One of the hardest problems in artificial intelligence is “alignment,” or making sure AI goals match our own, a challenge that may prove especially important if superintelligent AIs that outmatch us intellectually are ever developed. But scientists in England and their colleagues now report in the j
More from Learn AI
Get new guides every week
Real AI income strategies, tool reviews, and plain-English news — free in your inbox.



