Google’s Genie 3 AI can create playable videogames from a single prompt.

Google has unveiled Genie 3 today, a groundbreaking AI that can create interactive, playable environments from a simple image or text prompt. This is a huge leap forward in generative AI. It goes beyond static images and texts to create dynamic environments that you can explore.

The new technology developed by Google’s DeepMind represents a new frontier in what we call “world models”. It’s a machine that can create a virtual world on the fly. Imagine typing in “a magical forest” with glowing mushrooms and hidden river, and being able walk around that world immediately. Genie 3 promises to democratise the creation of virtual worlds for everyone. Not just those with advanced programming skills.

There are a wide range of applications, from creating video game levels that are unique to training robot agents in a variety of simulated scenarios prior to their deployment in the real world. Google DeepMind claims that this is an important step on the road to Artificial General Intelligence.

Today we announce Genie 3, a world model with a general purpose that can create an unprecedented variety of interactive environments. World models are a crucial step on the road to AGI because they allow AI agents to be trained in a vast curriculum of rich simulation environments.

Google DeepMind.

Genie 3.

Genie 3 can be used to create a wide range of interactive environments. It uses a text prompt to create navigable worlds in 720p resolution. The model runs smoothly at 24 frames per seconds and maintains consistency for several minutes. It is ideal for simulating anything from animated fantasies to natural ecosystems.

This model excels at modelling physical properties such as lighting and water while also handling complex interaction in fictional or historic settings. Users can explore these virtual worlds as though they were real. This is a major step towards immersive AI experiences. DeepMind describes it as a way to create unlimited training grounds for AI.

How Does Genie 3 Work?

Genie 3 is based on auto-regressive creation to build each frame in a sequential manner, taking into account past actions and user inputs. This allows for real-time responsiveness. The model will remember and maintain details from a previous visit if you revisit a location after a minute. This is a clever solution to avoid the environment becoming unreliable over time.

Promptable world events is a feature that stands out. Text commands can be used to alter the scene, such as changing the weather or adding characters. This adds interactivity beyond basic navigation. How does it perform?

Consistency in the Environment
Genie 3 maintains worlds for minutes. Its visual memory holds up even when elements are moving into and out of view. This is a natural result of its training. Genie 3 allows users to interact in real-time with the generated world, which is a major improvement over previous versions.

Impressive Performance
This model can generate dynamic worlds at 24 frames per seconds with a 720p Resolution, making it feel fluid and responsive.

The technology could change the game industry, allowing developers and designers to quickly prototype ideas or have games generate content for players. It opens up a whole new world of creative possibilities to storytellers, educators, and artists. Google’s Deepmind has published a video that shows the possibilities. It’s nothing less than amazing. This could literally be the future of video games.

Google hasn’t yet announced the date of public release for Genie 3. It is a research project for now, but it paints a fascinating picture of the future.

To learn more, visit on the Google DeepMind Blog

www.aiobserver.co

More from this stream

Recomended