Google Gemini’s Veo 3 Can Now Animate Photos Into 8‑Second Videos
Google is rolling out a photo-to-video feature in Gemini powered by its Veo 3 model, allowing AI Pro and Ultra users to transform still images into 8‑second MP4 clips complete with synced sound and watermarks
image for illustrative purpose

Google has introduced a transformative feature in its Gemini app: the ability to convert static photos into eight-second video clips enriched with AI-generated audio. The technology is powered by the advanced Veo 3 video model.
Here’s how it works:
- Exclusive to Google AI Ultra and Pro subscribers (now available in select regions)
- Accessible via desktop now, rolling out to mobile later this week
- Users click Tools → Video, upload a photo, then describe desired visuals and sounds—dialogue, background noise, effects included
- Delivers 8‑second MP4, 720p resolution, 16:9 format, with visible “Veo” watermark and hidden SynthID watermark for authenticity
Since launching in May via the Flow filmmaking app, over 40 million Veo 3-powered videos have been generated across both platforms. Now, Gemini users get direct access without needing a separate app.
Google emphasizes safety: the tool filters explicit or problematic content, and the watermarks aim to curb misuse.
Feature | Significance |
---|---|
Democratizes video creation | Enables anyone to animate images with sound and realism |
Safety-first design | Watermarks and content filters protect against deepfakes |
Integrated user experience | Brings advanced video tech into Gemini, expanding reach |
Global rollout | Flow expands into 75 new countries alongside Gemini updates |