Ovi
Ovi is a Veo-3–style text/image-to-video model that generates video and audio simultaneously, producing 5‑second clips at 24 FPS with synchronized sound. It supports spoken dialogue and sound effects via prompt tags—use `<S>...</S>` for speech and `<AUDCAP>...</AUDCAP>` for background audio—making it well-suited for sh…
Speed
Price
Inputs
TextImage
Outputs
VideoAudio
Lab
Character AI
Ovi sample outputs
Captured directly from our eval suite. Click any tile to inspect the full render.