create music videos from images and audio with captions and lipsyncing
This agent creates longform music videos with lipsync, image to video, captioning and stitching