Co-speech gestures enhance interaction experiences between humans as well as between humans and robots.
Robotics
This paper presents an inexpensive, high-precision, but at the same time, easy-to-maintain PIEEG board to convert a RaspberryPI to a Brain-computer interface.
Human-Computer Interaction Robotics
Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.
Sound Audio and Speech Processing
Large-scale, weakly-supervised speech recognition models, such as Whisper, have demonstrated impressive results on speech recognition across domains and languages.
Sound Audio and Speech Processing
In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech.
Sound Audio and Speech Processing
Recent advances in LiDAR technology have opened up new possibilities for robotic navigation.
Robotics
Multi-dimensional arrays are ubiquitous in high-performance computing (HPC), but their absence from the C++ language standard is a long-standing and well-known limitation of their use for HPC.
Distributed, Parallel, and Cluster Computing Programming Languages
Together, these two designs enlarge the corridor spaces in accordance with the quadrotor's current state and hence allow the quadrotor to maneuver at high speeds.
Robotics
These keypoints, visual bags of words, and several threshold parameters are then used to identify overlapping images and revisited areas.
Robotics
This paper presents $D^2$SLAM, a novel decentralized and distributed ($D^2$) CSLAM system that covers two scenarios: near-field estimation for high accuracy state estimation in close range and far-field estimation for consistent global trajectory estimation.
Robotics