Conventional audio-visual models have independent audio and video branches. In this work, we unify the audio and visual branches by designing a Unified Audio-Visual Model (UAVM). The UAVM achieves a… Click to show full abstract
Conventional audio-visual models have independent audio and video branches. In this work, we
               
Click one of the above tabs to view related content.