Search Results for author: Alexey Karpov

Found 12 papers, 4 papers with code

RUSAVIC Corpus: Russian Audio-Visual Speech in Cars

no code implementations • LREC 2022 • Denis Ivanko, Alexandr Axyonov, Dmitry Ryumin, Alexey Kashevnik, Alexey Karpov

We present a new audio-visual speech corpus (RUSAVIC) recorded in a car environment and designed for noise-robust speech recognition.

Audio-Visual Speech Recognition Lip Reading +3

Paper
Add Code

Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems

1 code implementation • Expert Systems with Applications 2024 • Dmitry Ryumin, Alexandr Axyonov, Elena Ryumina, Denis Ivanko, Alexey Kashevnik, Alexey Karpov

The article introduces a novel audio-visual speech command recognition transformer (AVCRFormer) specifically designed for robust AVSR.

Ranked #1 on Audio-Visual Speech Recognition on LRW

Audio-Visual Speech Recognition Lipreading +3

Paper
Code

Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision

no code implementations • 19 Mar 2024 • Elena Ryumina, Maxim Markitantov, Dmitry Ryumin, Heysem Kaya, Alexey Karpov

Our findings from the challenge demonstrate that the proposed method can potentially form a basis for developing intelligent tools for annotating audio-visual data in the context of human's basic and compound emotions.

Cross-corpus Emotion Recognition +1

Paper
Add Code

SUN Team's Contribution to ABAW 2024 Competition: Audio-visual Valence-Arousal Estimation and Expression Recognition

no code implementations • 19 Mar 2024 • Denis Dresvyanskiy, Maxim Markitantov, Jiawei Yu, Peitong Li, Heysem Kaya, Alexey Karpov

As emotions play a central role in human communication, automatic emotion recognition has attracted increasing attention in the last two decades.

Arousal Estimation Emotion Recognition

Paper
Add Code

OCEAN-AI framework with EmoFormer cross-hemiface attention approach for personality traits assessment

1 code implementation • Expert Systems with Applications 2023 • Elena Ryumina, Maxim Markitantov, Dmitry Ryumin, Alexey Karpov

Psychological and neurological studies earlier suggested that a personality type can be determined by the whole face as well as by its sides.

Ranked #1 on Personality Trait Recognition by Face on First Impressions v2

Personality Trait Recognition Personality Trait Recognition by Face

Paper
Code

In Search of a Robust Facial Expressions Recognition Model: A Large-Scale Visual Cross-Corpus Study

1 code implementation • Neurocomputing 2022 • Elena Ryumina, Denis Dresvyanskiy, Alexey Karpov

Many researchers have been seeking robust emotion recognition system for already last two decades.

Ranked #1 on Facial Expression Recognition (FER) on Aff-Wild2

Cross-corpus Emotion Recognition +1

Paper
Code

Visual Speech Recognition in a Driver Assistance System

no code implementations • 30th European Signal Processing Conference (EUSIPCO) 2022 • Denis Ivanko, Dmitry Ryumin, Alexey Kashevnik, Alexandr Axyonov, Alexey Karpov

After a comprehensive evaluation, we adapt the developed method and test it on the collected RUSAVIC corpus we recorded in-the-wild for vehicle driver.

Ranked #4 on Lipreading on Lip Reading in the Wild

Data Augmentation Lipreading +3

Paper
Add Code

An Audio-Video Deep and Transfer Learning Framework for Multimodal Emotion Recognition in the wild

no code implementations • 7 Oct 2020 • Denis Dresvyanskiy, Elena Ryumina, Heysem Kaya, Maxim Markitantov, Alexey Karpov, Wolfgang Minker

In this paper, we present our contribution to ABAW facial expression challenge.

Multimodal Emotion Recognition Transfer Learning

Paper
Add Code

Is Everything Fine, Grandma? Acoustic and Linguistic Modeling for Robust Elderly Speech Emotion Recognition

1 code implementation • 7 Sep 2020 • Gizem Soğancıoğlu, Oxana Verkholyak, Heysem Kaya, Dmitrii Fedotov, Tobias Cadèe, Albert Ali Salah, Alexey Karpov

Acoustic and linguistic analysis for elderly emotion recognition is an under-studied and challenging research direction, but essential for the creation of digital assistants for the elderly, as well as unobtrusive telemonitoring of elderly in their residences for mental healthcare purposes.

Speech Emotion Recognition

Paper
Code

Class-based LSTM Russian Language Model with Linguistic Information

no code implementations • LREC 2020 • Irina Kipyatkova, Alexey Karpov

We achieved WER of 14. 94 {\%} at our own speech corpus of continuous Russian speech that is 15 {\%} relative reduction with respect to the baseline 3-gram model.

Language Modelling speech-recognition +1

Paper
Add Code

TheRuSLan: Database of Russian Sign Language

no code implementations • LREC 2020 • Ildar Kagirov, Denis Ivanko, Dmitry Ryumin, Alex Axyonov, er, Alexey Karpov

The database includes lexical units (single words and phrases) from Russian sign language within one subject area, namely, {``}food products at the supermarket{''}, and was collected using MS Kinect 2. 0 device including both FullHD video and the depth map modes, which provides new opportunities for the lexicographical description of the Russian sign language vocabulary and enhances research in the field of automatic gesture recognition.

Gesture Recognition Sign Language Recognition

Paper
Add Code

Cross-Corpus Data Augmentation for Acoustic Addressee Detection

no code implementations • WS 2019 • Oleg Akhtiamov, Ingo Siegert, Alexey Karpov, Wolfgang Minker

Mixup is shown to be beneficial for merging acoustic data (extracted features but not raw waveforms) from different domains that allows us to reach a higher classification performance on human-machine AD and also for training a multipurpose neural network that is capable of solving both human-machine and adult-child AD problems.

Cross-corpus Data Augmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.