Discarding Human Data: David Silver Raises $1.1B to Lead the Self-Learning AI Revolution

1. Overview: The Post-Human Data Era Begins

On April 27, 2026, the AI industry witnessed a seismic shift as David Silver, the legendary researcher behind DeepMind’s AlphaGo and AlphaZero, announced the successful closing of a $1.1 billion Series A funding round for his new startup, Tabula AI. This news, first reported by TechCrunch, marks one of the largest early-stage investments in the history of artificial intelligence, signaling a massive pivot in how the industry views the future of Machine Learning.

For the past several years, the AI boom has been fueled by Large Language Models (LLMs) that rely on gargantuan datasets scraped from the internet—human-written text, code, and images. However, Silver’s new venture is built on a radical premise: To reach Artificial General Intelligence (AGI), we must stop teaching AI to imitate humans and instead allow it to learn for itself.

This approach, often referred to as "Tabula Rasa" (clean slate) learning, draws from Silver’s success with AlphaZero, which mastered chess and Go not by studying human games, but by playing against itself millions of times. As the industry faces the looming "data wall"—where high-quality human data is becoming scarce and legally contentious—Silver’s $1.1 billion war chest suggests that the era of self-learning AI is no longer a theoretical pursuit, but the next commercial frontier.

2. Details: The Silver Philosophy and the End of Imitation

The Limitations of Current LLMs

While models like Gemini 3.1 Pro have pushed the boundaries of reasoning and context, they are fundamentally limited by their training data. Current LLMs are trained to predict the next token based on human patterns. This creates several inherent flaws:

Data Exhaustion: We are running out of high-quality human text. Synthetic data (AI-generated text used for training) often leads to model collapse or degradation.
Human Bias: If an AI learns only from humans, it inherits our prejudices, logical fallacies, and limitations.
The Ceiling of Imitation: An AI that learns by imitation can, at best, become as good as the collective average of its training set. It struggles to innovate or find solutions that humans haven't already documented.

The Tabula AI Approach: Search and Reinforcement Learning

David Silver has long argued that "Reward is Enough." In his view, the most powerful AI systems are those that are given a goal (a reward function) and left to explore the environment to maximize that reward. According to the recent Wired profile, Silver believes the current path of AI—relying on massive transformer models—is an "inefficient detour" from the true path to intelligence.

Tabula AI’s core technology focuses on two pillars:

Pure Reinforcement Learning (RL): Building systems that learn through trial and error in simulated environments.
Inference-Time Compute: Rather than relying solely on pre-trained weights, the AI uses massive amounts of computation during the "thinking" phase to explore possibilities. This aligns with the industry's shift toward LLM inference-compute optimization, where the quality of the output is determined by how much the model can "reason" before answering.

The $1.1 Billion Bet

The funding round, led by major venture capital firms and supported by sovereign wealth funds, is intended to build the world’s most advanced "Self-Play Supercomputer." Unlike the data centers used by OpenAI or Google, which are optimized for processing petabytes of text, Silver’s infrastructure will be optimized for simulation and search. This hardware-software synergy is reminiscent of how AWS has adopted the Model Context Protocol (MCP) to standardize AI infrastructure, though Silver’s requirements for real-time simulation are even more demanding.

3. Discussion: The Pros and Cons of Self-Learning AI

Pros: Breaking the Human Ceiling

The most significant advantage of David Silver’s approach is the potential for super-human innovation. When AlphaZero played Go, it made moves that human experts described as "alien"—moves that had never been seen in 3,000 years of human play. By discarding human data, AI can find more efficient ways to solve physics equations, design semiconductors, or write code that is more optimized than anything a human could produce.

Furthermore, this approach bypasses the legal and ethical minefields of copyright. If the AI is learning from its own experience rather than scraping the web, the risk of intellectual property theft is virtually eliminated. This is a crucial evolution as AI agents begin to dominate software development, requiring original, high-performance logic rather than just boilerplate code.

Cons: The Alignment and Simulation Challenges

However, the "Tabula Rasa" path is fraught with difficulties. The first is the Alignment Problem. If an AI develops its own logic through self-play, it becomes a "black box" that is even harder for humans to interpret. If the AI's internal reasoning is no longer based on human language or concepts, ensuring it remains safe and beneficial to humanity becomes an order of magnitude more difficult.

The second challenge is the Simulation Gap. While games like Go or Chess have clear, mathematical rules, the real world is messy. For an AI to learn without human data, it needs a perfect simulator of the environment it is meant to operate in. Creating a simulator for "writing a legal contract" or "managing a supply chain" is significantly harder than creating a simulator for a board game.

4. Conclusion: A New Hegemony in the AI Arms Race

The emergence of Tabula AI and the massive $1.1 billion investment in David Silver’s vision marks the beginning of the end for the "Imitation Era" of AI. While the industry has spent the last five years perfecting the art of the chatbot, the next five years will be defined by Autonomous Discovery.

If Silver succeeds, we will see a shift in the AI hierarchy. The winners will no longer be those with the largest datasets, but those with the most efficient simulators and the greatest compute power for self-play. This transition will redefine everything from scientific research to the way we build software. As we noted in our inaugural AI Watch post, the "now" of AI technology is a moving target, and David Silver just moved the target further than anyone expected.

The message is clear: To reach the next level of intelligence, AI must leave its human teachers behind and step into the unknown. Whether this leads to a utopian era of discovery or an uncontrollable "alien" intelligence remains the defining question of our decade.

References

DeepMind’s David Silver just raised $1.1B to build an AI that learns without human data: https://techcrunch.com/2026/04/27/deepminds-david-silver-just-raised-1-1b-to-build-an-ai-that-learns-without-human-data/
The Man Behind AlphaGo Thinks AI Is Taking the Wrong Path: https://www.wired.com/story/david-silver-ai-ineffable-intelligence-reinforcement-learning/