Search Results for author: Samuel Pegg

Found 2 papers, 2 papers with code

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion

1 code implementation25 Jan 2024 Samuel Pegg, Kai Li, Xiaolin Hu

TDANet serves as the architectural foundation for the auditory and visual networks within TDFNet, offering an efficient model with fewer parameters.

speech-recognition Speech Recognition +1

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation

1 code implementation29 Sep 2023 Samuel Pegg, Kai Li, Xiaolin Hu

This is the first time-frequency domain audio-visual speech separation method to outperform all contemporary time-domain counterparts.

Audio-Visual Speech Recognition speech-recognition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.