The conclusion is that it is possible (and even convenient) to use computer algebra systems within the Julia framework.
Symbolic Computation
Large-scale, weakly-supervised speech recognition models, such as Whisper, have demonstrated impressive results on speech recognition across domains and languages.
Sound Audio and Speech Processing
Many applications model their data in a general-purpose storage format such as JSON.
Distributed, Parallel, and Cluster Computing Databases
To effectively leverage intensity as an additional modality, we present a novel feature selection scheme that detects uninformative directions in the point cloud registration and explicitly selects patches with complementary image information.
Robotics
During inference, voice conversion is performed by substituting source SSL features with their nearest counterparts from a matching pool which comprises SSL features extracted from the reference audio, while preserving raw harmonic signals and loudness from the source audio.
Sound Audio and Speech Processing
Owing to the prohibitively large overhead (e. g., $10 \times$) of GPUs' native memory allocator, DNN frameworks like PyTorch and TensorFlow adopt a caching allocator that maintains a memory pool with a splitting mechanism for fast memory (de)allocation.
Distributed, Parallel, and Cluster Computing
Odometry estimation is crucial for every autonomous system requiring navigation in an unknown environment.
Robotics
Establishing the correspondences between newly acquired points and historically accumulated data (i. e., map) through nearest neighbors search is crucial in numerous robotic applications.
Robotics
We present egglog, a fixpoint reasoning system that unifies Datalog and equality saturation (EqSat).
Programming Languages
Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.
Sound Audio and Speech Processing