Is Vidu 1.5 the Breakthrough Generative AI Needed to Conquer Hollywood?

Source: Vidu 1.5

OK! Staff

Nov. 14 2024, Published 1:28 a.m. ET

Add OK! on Google

Article continues below advertisement

From digital twins to game design development, generative AIs have made some promising progress in the media and entertainment industry, but we’re seeing a turning point. And what Shengshu is bringing to the table just might upend or empower filmmakers and Hollywood, depending on which side you’re on.

The film industry has dabbled or at least attempted to use generative video. But there are some caveats. When you generate a video, the output is often not really what you expected. It’s jumpy and obviously looks like AI.

A sneak peek into OpenAI’s Sora was an indication of just how tricky it was to deliver visual consistency. It’s not uncommon for the sizes and appearances of objects to change throughout a generated video. So, in reality, post-production on AI generated footage is a reality. And AI-generated videos haven’t quite been able replicate the natural look and feel of light interacting with a generated character

Vidu 1.5 is a major update for Shengshu’s video generator, and this time it’s all about putting more control in the user’s hand. Now camera angles, character actions, and subtle expressions don’t result in jarring videos. The final footage from Vidu 1.5 looks and feels like any old video from start to end, and reveals fewer abrupt jumps or unwanted transitions typical of generative video these days. But it’s like you’re in the director’s seat directing live actors, but all this can be done with text inputs instead of expensive camera equipment.

Tap Here To Add Ok Magazine as A Trusted Source

Article continues below advertisement

In one example, Vidu 1.5’s new Multiple-Entity Consistency feature which is the first of its kind is able to merge images together. And these could be images that are completely irrelevant to each other. Take for instance a profile shot of Elon Musk, a second image of a rose printed shirt, and a third of a moped. Uploading these three images end up resulting in a cohesive video of Elon dressed in a rose printed button down shirt, enjoying his moped ride.

Or, in another scenario with its Multiple-Angle Consistency function, three images can be uploaded of a single subject - take for examplea model - but from differing angles. The combined result? Vidu 1.5 is scarily accurate in being able to predict what the model might look like from any angle. Normally the intricate details of a dress would easily throw off an AI. Vidu 1.5 on the other hand generates a video in which the model is walking, but she could even be turning around in 360 degrees without breaking visual continuity, along with minimal distortion to the natural flow of the dress and facial expressions.

Is Vidu 1.5 the Breakthrough Generative AI Needed to Conquer Hollywood?

About OK!

CONTACT OK!

SUBSCRIBE

Privacy & Legal