Search Results for author: Ivan Vovk

Found 3 papers, 3 papers with code

A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-By-Humming Task

1 code implementation2 Dec 2023 Amantur Amatov, Dmitry Lamanov, Maksim Titov, Ivan Vovk, Ilya Makarov, Mikhail Kudinov

To expand our dataset, we employ a semi-supervised model training pipeline that leverages the QbH task as a specialized case of cover song identification (CSI) task.

Cover song identification

Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech

6 code implementations13 May 2021 Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov

Recently, denoising diffusion probabilistic models and generative score matching have shown high potential in modelling complex data distributions while stochastic calculus has provided a unified point of view on these techniques allowing for flexible inference schemes.

Ranked #3 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Decoder Speech Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.