Glif
AgentsWorkflowsModelsLearnAPIDocs
MMAudio mark

MMAudio V2

A model that generates audio for videos, optionally guided by a text prompt. It can work with an empty prompt, relying on the visual/video input alone to create appropriate sound, speech, or ambience.

Use this model

Inputs

Video

Outputs

Audio

Lab

fal-ai

Launch your first agent today

Build without code. Share without limits.

Create an Agent

Resources

  • Guide/Docs
  • API
  • Chrome Extension
  • Contact Us

Company

  • Jobs
  • Brand
  • Legal
  • Privacy Policy
  • Security

Social

  • X (Twitter)
  • YouTube
  • TikTok
  • Instagram
  • LinkedIn

Community

  • Discord

© 2023-2025 Spellcasters, Inc.