no code implementations • 6 Jul 2023 • Netta Madvil, Yonatan Bitton, Roy Schwartz
We propose a two-step method to analyze multimodal datasets, which leverages a small seed of human annotation to map each multimodal instance to the modalities required to process it.