no code implementations • 31 May 2017 • Ken Hoover, Sourish Chaudhuri, Caroline Pantofaru, Malcolm Slaney, Ian Sturdy
In this paper, we present a system that associates faces with voices in a video by fusing information from the audio and visual signals.