Search Results for author: Ivan Vovk

Found 3 papers, 3 papers with code

A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-By-Humming Task

1 code implementation • 2 Dec 2023 • Amantur Amatov, Dmitry Lamanov, Maksim Titov, Ivan Vovk, Ilya Makarov, Mikhail Kudinov

To expand our dataset, we employ a semi-supervised model training pipeline that leverages the QbH task as a specialized case of cover song identification (CSI) task.

Cover song identification

Paper
Code

Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme

4 code implementations • ICLR 2022 • Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei

Voice conversion is a common speech synthesis task which can be solved in different ways depending on a particular real-world scenario.

Speech Synthesis Voice Conversion

534

Paper
Code

Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech

6 code implementations • 13 May 2021 • Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov

Recently, denoising diffusion probabilistic models and generative score matching have shown high potential in modelling complex data distributions while stochastic calculus has provided a unified point of view on these techniques allowing for flexible inference schemes.

Ranked #3 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Decoder Speech Synthesis +1

534

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.