V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation
V2M-Zero generates time-aligned music from video using event curves, achieving significant improvements in audio quality and beat alignment across datasets.
Yan-Bo Lin, Jonah Casebeer, Long Mai et al.