Google’s Genie 2 “world model” reveal leaves more questions than answers

May Be Interested In:Danny Care: Former England scrum-half to retire from rugby union at end of the season


As podcaster Ryan Zhao put it on Bluesky, “The design process has gone wrong when what you need to prototype is ‘what if there was a space.'”

Gotta go fast

When Google revealed the first version of Genie earlier this year, it also released a detailed research paper outlining the specific steps taken behind the scenes to train the model and how that model generated interactive videos. They haven’t done the same for a research paper detailing Genie 2’s process, leaving us guessing at some important details.

One of the most important of these details is model speed. The first Genie model generated its world at roughly one frame per second, a rate that was orders of magnitude slower than would be tolerably playable in real time. For Genie 2, Google only says that “the samples in this blog post are generated by an undistilled base model, to show what is possible. We can play a distilled version in real-time with a reduction in quality of the outputs.”

Reading between the lines, it sounds like the full version of Genie 2 operates at something well below the real-time interactions implied by those flashy GIFs. It’s unclear how much “reduction in quality” is necessary to get a diluted version of the model to real-time controls, but given the lack of examples presented by Google, we have to assume that reduction is significant.

Oasis’ AI-generated Minecraft clone shows great potential, but still has a lot of rough edges, so to speak.


Credit:

Oasis

Real-time, interactive AI video generation isn’t exactly a pipe dream. Earlier this year, AI model maker Decart and hardware maker Etched published the Oasis model, showing off a human-controllable, AI-generated video clone of Minecraft that runs at a full 20 frames per second. However, that 500 million parameter model was trained on millions of hours of footage of a single, relatively simple game, and focused exclusively on the limited set of actions and environmental designs inherent to that game.

When Oasis launched, its creators fully admitted the model “struggles with domain generalization,” showing how “realistic” starting scenes had to be reduced to simplistic Minecraft blocks to achieve good results. And even with those limitations, it’s not hard to find footage of Oasis degenerating into horrifying nightmare fuel after just a few minutes of play.

share Share facebook pinterest whatsapp x print

Similar Content

Sync Your Calendar With the Solar System
Sync Your Calendar With the Solar System
Supreme Court casts doubt on TikTok's free-speech defense as shutdown law is set to take effect
Supreme Court casts doubt on TikTok’s free-speech defense as shutdown law is set to take effect
Gene Hackman: Death of actor and wife 'suspicious enough' for investigation, police say
Gene Hackman: Death of actor and wife ‘suspicious enough’ for investigation, police say
High school wrestling
Saskatoon High school wrestling season will wrap up on February 27-28
Well, Duh! New Poll Says Americans Were Unhappier Than Ever Under Biden
Well, Duh! New Poll Says Americans Were Unhappier Than Ever Under Biden
How to Hurt Myanmar's Coup Leaders, According to Activists
How to Hurt Myanmar’s Coup Leaders, According to Activists
Changing Perspectives: A New Take on Global Events | © 2024 | Daily News