Creating videos with AI: OpenAI's Sora and Google's Veo 2 ...
ChatGPT and other text-based AI tools have transformed the way we interact with technology, making it easier than ever to generate ideas, automate tasks, and enhance productivity. But AI is no longer just about text. The evolution of multimodal AI—capable of understanding and generating across different formats—has opened new frontiers.

Not long ago, creating a realistic-looking video required millions of dollars. All it takes now are text commands (prompts) on a text to video generator to create videos that would make a casual observer doubt if the video they are looking at is real. What once required expensive equipment and a team of professionals can now be achieved with a few lines of text. In the future, creativity might be the most valuable skill, as AI editors handle the rest.
Open AI’s Sora Turbo & Google DeepMind’s Veo 2
There are a variety of AI-driven video editors available, but the latest and most advanced are Open AI’s Sora Turbo and Google DeepMind’s Veo 2. Since their debut in Dec 2024, there has been intense digital chatter about various aspects of both and which is better.

The first notable difference is that Veo 2 can natively generate 4k resolution videos, while Sora Turbo can only go up to 1080p resolution. Veo 2’s videos look more realistic and emphasize accuracy, with a focus on prompt adherence and prompt execution.
Sora Turbo allows the creation of twenty-second videos in various aspect ratios, while Veo 2 can generate longer videos. Sora has introduced a new dashboard and interface, making it easier to prompt Sora with text, images, and videos. It also includes a storyboard tool with templates for user experimentation.

Google’s Veo 2 is capable of understanding simple to complex instructions, with superior knowledge of real-world physics and enhanced realism. It can simulate visuals of specific lenses and cinematic jargon like depth of field and tracking shot. Veo 2 is known for producing realistic outputs with fewer instances of "hallucinations."
Veo 2 is preferred over other text to video generators based on human ratings. Alphabet, the parent company of YouTube and Veo 2, ensures superior quality output from Veo 2.
Availability and Plans
Sora is available to ChatGPT Plus account holders at no extra cost, with limitations on video resolution and quantity. Veo 2 is available in select markets for preview through Google Lab’s VideoFX.
Google is targeting filmmakers and enterprises with professional quality videos, while OpenAI's videos could cater to the general public and individual creators.

Both Veo 2 and Sora are being cautiously rolled out to address concerns about misinformation, deepfake videos, and misattribution. The generated outputs are traceable to the source, with watermarks and metadata for identification.
Google plans to expand Veo 2 to YouTube Shorts in 2025, improving its capabilities for handling complex scenes. As AI video editing technology progresses, industry experts anticipate significant innovations from both OpenAI and Google, shaping a promising future for AI-driven video editing.
The creative capabilities of AI are expanding rapidly, raising questions not about whether AI can assist in content creation, but how far it will go. The rise of AI-powered video editing hints at a future where human imagination might be the only limit.