1 code implementation • 21 Dec 2023 • Davide Berghi, Philip J. B. Jackson
The multichannel audio ``student'' network is trained to generate the same results.
1 code implementation • 14 Dec 2023 • Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang, Philip J. B. Jackson
Sound event localization and detection (SELD) combines two subtasks: sound event detection (SED) and direction of arrival (DOA) estimation.
no code implementations • 27 Jul 2023 • Davide Berghi, Philip J. B. Jackson
This study considers the problem of detecting and locating an active talker's horizontal position from multichannel audio captured by a microphone array.
no code implementations • 4 Dec 2022 • Davide Berghi, Marco Volino, Philip J. B. Jackson
This is partly due to the lack of available datasets enabling audio-visual research in this direction.
no code implementations • 7 Mar 2022 • Davide Berghi, Adrian Hilton, Philip J. B. Jackson
We propose to generate weak labels using a pre-trained active speaker detector on pre-extracted face tracks.