Veo 3, Google’s newest generative AI video tool, has officially launched in India. This video generation feature was previewed at the recent Google I/O event. Currently, Veo 3 is exclusively available to users with a Gemini 'pro' subscription.
The Veo 3 model allows users to create eight-second video clips with audio, enabling the generation and synthesis of speech, as well as the addition of background music and sound effects to enhance the realism of the clips.
“From reimagining historical events through the perspective of a modern influencer to envisioning the sound of cutting through a glass apple, and even capturing sightings of the legendary Bigfoot, your creativity knows no bounds when creating photos and videos on Gemini. These incredible creations inspire our work on the Veo 3 team and motivate us to make Veo 3 accessible to more people around the globe,” Google stated, highlighting the tool's capabilities.
On May 20, during its annual developer conference, Google unveiled Veo 3, its newest innovation in AI video production. This model not only creates visually stunning and cinematic videos but also features realistic audio elements, including dialogue, sound effects, and background music, enhancing the authenticity of the videos.
All videos produced with Veo 3 from photos will display a visible watermark and include an invisible SynthID watermark. These features signify that the content has been generated using AI.
In addition, Google has reaffirmed its dedication to the safe and responsible use of AI. The company stated that it is actively taking significant measures to ensure the safety of video generation. This includes extensive red teaming and evaluations of its AI models.
After the introduction of Veo 3 at Google I/O, several users on X shared their creations made with the model. The AI tool has been recognized as Google’s response to OpenAI’s Sora. Following this announcement, Eli Collins, the vice president of product at Google DeepMind, noted that Veo 3 excels in areas such as text and image prompting, real-world physics, and accurate lip syncing.