AI is about to disrupt video creation - lowering costs, reducing time, and democratizing the ability for anyone to create amazing cinematic scenes with a prompt. I would place a bet that someone right this very moment is creating a full featured film, one prompt at a time. Perhaps we will even see the first Oscar awarded to a prompt engineer in the not to distant future.
OpenAI’s Sora is the first glance of what might be coming very soon in the next revolution of generative AI.
On February 16th, 2024, OpenAI unveiled Sora. Sora is a text to video AI model. Users can write a prompt and it will create a video that tries to match the the users description. Watch OpenAI’s introduction to Sora video below.
Sora can generate videos up a minute long. In the public examples that OpenAI has shared, the length of the videos range from 8 to 59 seconds.
At the time of this writing, Sora is not available to the general public. Sora is currently available to two groups of people.
The best chance for you to try Sora is by tweeting your prompt the people working on Sora at OpenAI:
The model has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions. Sora can also create multiple shots within a single generated video that accurately persist characters and visual style.
Source: OpenAI.com/sora
The Sora team is working to generate and simulate anything. Animals, animation, aerial drone shots, cinematic B-roll, humans, closeup shots, vehicles, and more.
At the top of the list, short-form videos on social media platforms like TikTok, Instagram Reels, and YouTube shorts will most likely explode from Sora. I predict that there will be a noticeable increase in new faceless channels being created. Sora will be a catalyst to allow anyone to start a channel while remaining anonymous.
Imagine that you work at a travel agency in need to promote a tour package. In the past, you would have to rely on stock video or you would have to hire a videographer. WIth the advent of Sora, that video can be created with a prompt.
Sora can create unlimited unique b-rolls to help sell products and services with the benefit that you get the generated results within seconds.
I would place a wager that someone right now is using a text to video AI model to piece together a compelling full featured film 100% AI generated. If Sora can keep generated characters consistent, it will enable anyone that can tell a good story to become their own filmmakers.
100 page employee handbooks? Encyclopedia sized technical manuals? Sora can deliver educational and/or training material through the medium of video. We all know that technical documents are not exactly page turners. Delivering dry written content through the form of entertaining content may increase retained information better.
While Sora is currently being developed behind the scenes there are text to video AI models that you can try today - Runway ML, invideo, and Fliki. I’m also curating a list of AI video generators here.
Sora appears to be a big improvement for generative text to video AI models. While OpenAI finalizes the model for public use, especially when it comes to safety, we will most likely see it first as a stand alone interface and then later combined into the ChatGPT interface similar to how Dall-E evolved.