Resilient and Distributed Multi-Robot Visual SLAM: Datasets, Experiments, and Lessons Learned

mit-spark/kimera-multi 10 Apr 2023

This paper revisits Kimera-Multi, a distributed multi-robot Simultaneous Localization and Mapping (SLAM) system, towards the goal of deployment in the real world.


Aerial Gym -- Isaac Gym Simulator for Aerial Robots

ntnu-arl/aerial_gym_simulator 25 May 2023

This simulator is a step towards developing a - currently missing - highly parallelized aerial robot simulation with geometric controllers at a large scale, while also providing a customizable obstacle randomization functionality for navigation tasks.


WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

m-bain/whisperx 1 Mar 2023

Large-scale, weakly-supervised speech recognition models, such as Whisper, have demonstrated impressive results on speech recognition across domains and languages.

Sound Audio and Speech Processing

Xor Filters: Faster and Smaller Than Bloom and Cuckoo Filters

hexops/xorfilter 17 Dec 2019

We find that xor filters can be faster than Bloom and cuckoo filters while using less memory.

Data Structures and Algorithms

Manu: A Cloud Native Vector Database Management System

milvus-io/milvus 28 Jun 2022

In the past three years, through interaction with our 1200+ industry users, we have sketched a vision for the features that next-generation vector databases should have, which include long-term evolvability, tunable consistency, good elasticity, and high performance.


Dynablox: Real-time Detection of Diverse Dynamic Objects in Complex Environments

ethz-asl/dynablox 20 Apr 2023

The spatio-temporally conservative free space estimate enables robust detection of moving objects without making any assumptions on the appearance of objects or environments.


BRAVO -- Biased Locking for Reader-Writer Locks

puzpuzpuz/xsync 3 Oct 2018

Readers make their presence known to writers by hashing their thread's identity with the lock address, forming an index into a visible readers table.

Operating Systems D.1.3

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

jaywalnut310/vits 11 Jun 2021

Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.

Sound Audio and Speech Processing

iPlanner: Imperative Path Planning

leggedrobotics/iplanner 22 Feb 2023

Additionally, the IL approach enables the planner to generalize to various unseen environments, resulting in an overall 26-87% improvement in SPL performance compared to baseline learning methods.


Asynchronous Multiple LiDAR-Inertial Odometry using Point-wise Inter-LiDAR Uncertainty Propagation

minwoo0611/ma-lio 26 May 2023

In recent years, multiple Light Detection and Ranging (LiDAR) systems have grown in popularity due to their enhanced accuracy and stability from the increased field of view (FOV).


