Written by Content Desk, Brand Practitioners• February 18, 2024• 2:55 pm• News

OpenAI Introduces Sora: A Text-to-Video Generative AI Model

OpenAI has unveiled its latest innovation, Sora, a cutting-edge diffusion model designed to revolutionize text-to-video creation. Developed by the creators of ChatGPT, Sora boasts the ability to generate videos in various resolutions and aspect ratios, as well as edit existing videos swiftly based on text prompts. This advanced AI model enables quick adjustments to scenery, lighting, and shooting style, all through simple textual input. Moreover, Sora can create videos from a single still image and even extend existing videos by filling in missing frames.

OpenAI - Sora

According to OpenAI, Sora is currently capable of generating up to a minute of Full HD video content, and initial examples demonstrate its promising capabilities. To explore more video samples generated by Sora, visit its dedicated landing page.

Sora can generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.

Sora operates on a transformer architecture akin to ChatGPT, treating videos and images as smaller data units known as patches. The video generation process in Sora begins with a presentation of static noise, gradually refined by the model to produce the final output.

OpenAI has incorporated its existing safety protocols, as seen in DALL·E 3, to ensure the responsible deployment of Sora. The model is currently undergoing rigorous testing by “red teamers” – experts tasked with assessing potential risks before its official release.

Additionally, OpenAI plans to engage in discussions with policymakers, artists, and educators to explore potential concerns and applications for Sora. As of now, no official launch date has been announced.

Share this on