A Window-based Discriminator is a type of discriminator for generative adversarial networks. It is analogous to a PatchGAN but designed for audio. While a standard GAN discriminator learns to classify between distributions of entire audio sequences, window-based discriminator learns to classify between distribution of small audio chunks. Since the discriminator loss is computed over the overlapping windows where each window is very large (equal to the receptive field of the discriminator), the model learns to maintain coherence across patches.
Source: MelGAN: Generative Adversarial Networks for Conditional Waveform SynthesisPAPER | DATE |
---|---|
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN
• • • |
2021-03-26 |
Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple Domains
• • |
2020-11-19 |
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization
• • |
2020-11-03 |
SpeedySpeech: Efficient Neural Speech Synthesis
• |
2020-08-09 |
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
• • • • |
2020-07-30 |
Adversarial representation learning for private speech generation
• • • • |
2020-06-16 |
SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement
• |
2020-06-13 |
MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis
• • • • • • • • |
2019-10-08 |
TASK | PAPERS | SHARE |
---|---|---|
Speech Synthesis | 4 | 80.00% |
Speech Enhancement | 1 | 20.00% |
COMPONENT | TYPE |
|
---|---|---|
![]() |
Convolutions |