Ovi
Ovi is an experimental Veo-3–style video+audio generation model that takes text and/or image inputs and produces 5-second, 24 FPS clips with synchronized audio. It supports structured prompting for dialogue and sound effects via `<S>...</S>` (speech) and `<AUDCAP>...</AUDCAP>` (background audio) tags.
Speed
Price
Inputs
TextImage
Outputs
VideoAudio
Lab
Character AI
Ovi sample outputs
Captured directly from our eval suite. Click any tile to inspect the full render.