1 code implementation • 31 Jan 2024 • Jaeyeon Kim, JaeYoon Jung, Jinjoo Lee, Sang Hoon Woo
We also introduce a new training objective called masked codec modeling that improves acoustic awareness of the pretrained language model.
Ranked #1 on Audio captioning on AudioCaps