In this manifesto, we put forward the idea of data alchemy as a narrative device to discuss storytelling and transdisciplinarity in visualization.
Human-Computer Interaction
Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.
Sound Audio and Speech Processing
The need for developing and delivering interactive web applications has grown rapidly.
Human-Computer Interaction
Erd\H{o}s-Ginzburg-Ziv theorem is a famous theorem in additive number theory, which states any sequence of $2n-1$ integers contains a subsequence of $n$ elements, with their sum being a multiple of $n$.
Data Structures and Algorithms Combinatorics
To alleviate these problems, Pulsar employs: 1) a sphere-based scene representation, 2) an efficient differentiable rendering engine, and 3) neural shading.
Graphics
This paper presents Daft-Exprt, a multi-speaker acoustic model advancing the state-of-the-art for cross-speaker prosody transfer on any text.
Sound Audio and Speech Processing
A common tool used by security professionals for reverse-engineering binaries found in the wild is the decompiler.
Software Engineering Programming Languages
In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech.
Sound Audio and Speech Processing
A vocoder is a conditional audio generation model that converts acoustic features such as mel-spectrograms into waveforms.
Sound Audio and Speech Processing
Finder networks in general, and Apple's Find My network in particular, can pose a grave threat to users' privacy and even health if these networks are abused for stalking.
Cryptography and Security Computers and Society