Search Results for author: Victor Gomes

Found 1 papers, 0 papers with code

Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities

no code implementations • 9 Nov 2023 • AJ Piergiovanni, Isaac Noble, Dahun Kim, Michael S. Ryoo, Victor Gomes, Anelia Angelova

We propose a multimodal model, called Mirasol3B, consisting of an autoregressive component for the time-synchronized modalities (audio and video), and an autoregressive component for the context modalities which are not necessarily aligned in time but are still sequential.

Ranked #1 on Audio Classification on VGGSound

Action Classification Audio Classification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.