MMAudio generates synchronized audio given video and/or text inputs. It can be combined with video models to get videos with audio.