Search Results for author: Seyed Omid Sadjadi

Found 8 papers, 0 papers with code

The 2021 NIST Speaker Recognition Evaluation

no code implementations21 Apr 2022 Seyed Omid Sadjadi, Craig Greenberg, Elliot Singer, Lisa Mason, Douglas Reynolds

Evaluation results indicate: audio-visual fusion produce substantial gains in performance over audio-only or visual-only systems; top performing speaker and face recognition systems exhibited comparable performance under the matched domain conditions present in this evaluation; and, the use of complex neural network architectures (e. g., ResNet) along with angular losses with margin, data augmentation, as well as long duration fine-tuning contributed to notable performance improvements for the audio-only speaker recognition task.

Data Augmentation Face Recognition +2

The NIST CTS Speaker Recognition Challenge

no code implementations21 Apr 2022 Seyed Omid Sadjadi, Craig Greenberg, Elliot Singer, Lisa Mason, Douglas Reynolds

The US National Institute of Standards and Technology (NIST) has been conducting a second iteration of the CTS challenge since August 2020.

Data Augmentation Speaker Recognition

Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models

no code implementations16 Feb 2022 Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram

Experimental results indicate that both audio and text-based models improve the emotion recognition performance and that the proposed multimodal solution achieves state-of-the-art results on the IEMOCAP benchmark.

Data Augmentation Emotional Intelligence +3

NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition

no code implementations16 Aug 2021 Seyed Omid Sadjadi

This document provides a brief description of the National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) conversational telephone speech (CTS) Superset.

Speaker Recognition

Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation

no code implementations5 Aug 2021 Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram

Automatic speech emotion recognition (SER) is a challenging task that plays a crucial role in natural human-computer interaction.

Emotion Classification Speaker Recognition +2

The IBM 2016 Speaker Recognition System

no code implementations23 Feb 2016 Seyed Omid Sadjadi, Sriram Ganapathy, Jason W. Pelecanos

In this paper we describe the recent advancements made in the IBM i-vector speaker recognition system for conversational speech.

2k Automatic Speech Recognition +3

Cannot find the paper you are looking for? You can Submit a new open access paper.