Robots Learn Social Skills: End-to-End Learning of Co-Speech Gesture Generation for Humanoid Robots

PantoMatrix/BEAT ICRA 2019

Co-speech gestures enhance interaction experiences between humans as well as between humans and robots.


PIEEG: Turn a Raspberry Pi into a Brain-Computer-Interface to measure biosignals

Ildaron/EEGwithRaspberryPI journal 2022

This paper presents an inexpensive, high-precision, but at the same time, easy-to-maintain PIEEG board to convert a RaspberryPI to a Brain-computer interface.

Human-Computer Interaction Robotics

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

jaywalnut310/vits 11 Jun 2021

Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.

Sound Audio and Speech Processing

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

m-bain/whisperx 1 Mar 2023

Large-scale, weakly-supervised speech recognition models, such as Whisper, have demonstrated impressive results on speech recognition across domains and languages.

Sound Audio and Speech Processing

Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech

coqui-ai/TTS Interspeech2020 2020

In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech.

Sound Audio and Speech Processing

ROG-Map: An Efficient Robocentric Occupancy Grid Map for Large-scene and High-resolution LiDAR-based Motion Planning

hku-mars/rog-map 28 Feb 2023

Recent advances in LiDAR technology have opened up new possibilities for robotic navigation.


mdspan in C++: A Case Study in the Integration of Performance Portable Features into International Language Standards

rapidsai/raft 13 Oct 2020

Multi-dimensional arrays are ubiquitous in high-performance computing (HPC), but their absence from the C++ language standard is a long-standing and well-known limitation of their use for HPC.

Distributed, Parallel, and Cluster Computing Programming Languages

Bubble Planner: Planning High-speed Smooth Quadrotor Trajectories using Receding Corridors

hku-mars/FAST_LIO 24 Feb 2022

Together, these two designs enlarge the corridor spaces in accordance with the quadrotor's current state and hence allow the quadrotor to maneuver at high speeds.


Monocular Simultaneous Localization and Mapping using Ground Textures

navy-rise-lab/ground-texture-slam 10 Mar 2023

These keypoints, visual bags of words, and several threshold parameters are then used to identify overlapping images and revisited areas.


$D^2$SLAM: Decentralized and Distributed Collaborative Visual-inertial SLAM System for Aerial Swarm

hkust-aerial-robotics/d2slam 3 Nov 2022

This paper presents $D^2$SLAM, a novel decentralized and distributed ($D^2$) CSLAM system that covers two scenarios: near-field estimation for high accuracy state estimation in close range and far-field estimation for consistent global trajectory estimation.


