Over the last few months, many AI boosters have been increasingly interested in generative video models and their seeming ability to show at least limited emergent knowledge of the physical properties of the real world. That kind of learning could underpin a robust version of a so-called "world model" that would represent a major breakthrough in generative AI's actual operant real-world capabilities. Recently, Google's DeepMind Research tried to add some scientific rigor to how well video models can actually learn about the real world from their training data. In the bluntly titled paper "Video Models are Zero-shot Learners and Reasoners," the researchers used Google's Veo 3 model to generate thousands of videos designed to test its abilities across dozens of tasks related to perceiving, modeling, manipulating, and reasoning about the real world. In the paper, the researchers boldly claim that Veo 3 "can solve a broad variety of tasks it wasn’t explicitly trained for" (that's the "zero-shot" part of the title) and that video models "are on a path to becoming unified, generalist vision foundation models." But digging into the actual results of those experiments, the researchers seem to be grading today's video models on a bit of a curve and assuming future progress will smooth out many of today's highly inconsistent results.Read full article Comments
Can today’s AI video models accurately model how the real world works?

Advertisement
Related Articles
Blender 4.5 brings big changes
Article URL: https://lwn.net/Articles/1036262/ Comments URL: https://news.ycombinator.com/item?id=45458791 Points: 24 # Comments: 1
Rescuer at Fatal Tesla Cybertruck Crash Says Car …
Article URL: https://www.newsweek.com/tesla-cybertruck-car-door-malfunction-2043976 Comments URL: https://news.ycombinator.com/item?id=45458768 Points: 16 # Comments: 5
You Want Technology with Warts
Article URL: https://entropicthoughts.com/you-want-technology-with-warts Comments URL: https://news.ycombinator.com/item?id=45458550 Points: 5 # Comments: 0
Stdlib: A library of frameworks, templates, and guides …
Article URL: https://debuggingleadership.com/stdlib Comments URL: https://news.ycombinator.com/item?id=45458249 Points: 11 # Comments: 1
FyneDesk: A full desktop environment for Linux written …
Article URL: https://github.com/FyshOS/fynedesk Comments URL: https://news.ycombinator.com/item?id=45458122 Points: 17 # Comments: 2
Apple pulls ICEBlock from the App Store
Apple has removed the “Waze but for ICE sightings” app ICEBlock from its App Store, …