Argomenti trattati
In a significant advancement for the field of artificial intelligence, Google DeepMind has introduced SIMA 2, an advanced AI agent capable of playing various video games, including popular titles like No Man’s Sky and Valheim. This development is not merely about enhancing gaming experiences; it represents a crucial step towards achieving artificial general intelligence (AGI), which aims to create systems that can perform any intellectual task a human can.
Unveiled during a recent press briefing, SIMA 2 builds upon its predecessor, SIMA, launched in March 2025. The new agent integrates the advanced capabilities of Google’s Gemini AI, allowing it to not only follow commands but also comprehend user intentions, engage in complex reasoning, and execute multifaceted actions within virtual environments, including those it has yet to encounter.
The evolution of SIMA
SIMA 2 is a product of extensive research and development within DeepMind. The initial version, SIMA, was trained using hundreds of hours of gameplay data, enabling it to tackle various 3D gaming environments effectively. However, SIMA faced limitations, achieving only a 31% success rate in completing complex tasks, while human players managed a 71% success rate. With SIMA 2, researchers aim to address these challenges.
Enhanced capabilities with Gemini
The integration of the Gemini AI model into SIMA 2 has significantly amplified its performance. Joe Marino, a senior research scientist at DeepMind, highlighted that SIMA 2 can now handle intricate tasks in unfamiliar settings and learn from its experiences. This self-improvement capability is pivotal for the future development of general-purpose robots and AGI systems.
What sets SIMA 2 apart is its ability to engage in reasoning and understanding, rather than just executing predefined commands. During a demonstration in No Man’s Sky, SIMA 2 accurately described its surroundings and made logical decisions based on its observations, such as identifying a distress beacon and determining its next move.
Implications for the future
Beyond gaming applications, the potential for SIMA 2 extends into the real world. Jane Wang, another senior researcher at DeepMind, emphasized that while gaming serves as a valuable training platform, the ultimate goal is to transfer these skills to practical scenarios. The team envisions a future where AI agents can assist in everyday tasks by understanding and responding to complex human requests.
Building toward AGI
The race to develop AGI has intensified as tech giants like Google, Meta, and OpenAI invest heavily in AI research. DeepMind’s blog post refers to SIMA 2 as a “significant step” toward this goal, with implications for robotics and AI embodiment. The ability to perform actions in both virtual and physical worlds is seen as fundamental to achieving true AGI.
Furthermore, SIMA 2’s self-improvement mechanism allows it to learn and adapt without extensive human intervention. Unlike SIMA 1, which relied solely on human gameplay data, SIMA 2 can generate its own tasks and evaluate its performance through a feedback loop, enhancing its learning process. This capability mirrors human learning patterns, making it a promising step toward creating intelligent systems that can independently navigate complex environments.
As DeepMind continues to refine SIMA 2, the implications for AI and robotics are profound. The integration of language and reasoning abilities through Gemini is paving the way for more sophisticated AI systems that can engage with their environments in meaningful ways. While the timeline for deploying SIMA 2 in physical robotics remains unclear, its advancement signifies a step closer to a future where AI can assist in real-world applications, fundamentally changing how we interact with technology.

