Search Results for author: Sourish Chaudhuri

Found 6 papers, 3 papers with code

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

1 code implementation • 5 Jan 2019 • Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru

The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible.

Audio-Visual Active Speaker Detection speaker-diarization +2

Paper
Code

AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies

1 code implementation • 2 Aug 2018 • Sourish Chaudhuri, Joseph Roth, Daniel P. W. Ellis, Andrew Gallagher, Liat Kaver, Radhika Marvin, Caroline Pantofaru, Nathan Reale, Loretta Guarino Reid, Kevin Wilson, Zhonghua Xi

Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization.

Sound Audio and Speech Processing

Paper
Code

Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers

no code implementations • 31 May 2017 • Ken Hoover, Sourish Chaudhuri, Caroline Pantofaru, Malcolm Slaney, Ian Sturdy

In this paper, we present a system that associates faces with voices in a video by fusing information from the audio and visual signals.

Paper
Add Code

CNN Architectures for Large-Scale Audio Classification

16 code implementations • 29 Sep 2016 • Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio.

Audio Classification Event Detection +1

3,017

Paper
Code

Plagiarism Detection in Polyphonic Music using Monaural Signal Separation

no code implementations • 27 Feb 2015 • Soham De, Indradyumna Roy, Tarunima Prabhakar, Kriti Suneja, Sourish Chaudhuri, Rita Singh, Bhiksha Raj

Given the large number of new musical tracks released each year, automated approaches to plagiarism detection are essential to help us track potential violations of copyright.

General Classification

Paper
Add Code

Unsupervised Structure Discovery for Semantic Analysis of Audio

no code implementations • NeurIPS 2012 • Sourish Chaudhuri, Bhiksha Raj

Approaches to audio classification and retrieval tasks largely rely on detection-based discriminative models.

Audio Classification General Classification +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.