Trending Research

LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition

alibaba-damo-academy/FunASR 12 Jan 2024

The growing prevalence of online conferences and courses presents a new challenge in improving automatic speech recognition (ASR) with enriched textual information from video slides.

Sound Multimedia Audio and Speech Processing

3,750
0.40 stars / hour

Empowering Robotics with Large Language Models: osmAG Map Comprehension with LLMs

hiyouga/llama-factory 13 Mar 2024

In this letter, we address the problem of enabling LLMs to comprehend Area Graph, a text-based map representation, in order to enhance their applicability in the field of mobile robotics.

Robotics

22,198
0.26 stars / hour

Neural Concatenative Singing Voice Conversion: Rethinking Concatenation-Based Approach for One-Shot Singing Voice Conversion

thuhcsi/neucosvc 8 Dec 2023

During inference, voice conversion is performed by substituting source SSL features with their nearest counterparts from a matching pool which comprises SSL features extracted from the reference audio, while preserving raw harmonic signals and loudness from the source audio.

Sound Audio and Speech Processing

148
0.25 stars / hour

i-Octree: A Fast, Lightweight, and Dynamic Octree for Proximity Search

zhujun3753/i-octree 15 Sep 2023

Establishing the correspondences between newly acquired points and historically accumulated data (i. e., map) through nearest neighbors search is crucial in numerous robotic applications.

Robotics

197
0.15 stars / hour

ViPlanner: Visual Semantic Imperative Learning for Local Navigation

leggedrobotics/viplanner 2 Oct 2023

This optimization uses a differentiable formulation of a semantic costmap, which enables the planner to distinguish between the traversability of different terrains and accurately identify obstacles.

Robotics

181
0.14 stars / hour

Ca$^2$Lib: Simple and Accurate LiDAR-RGB Calibration using Small Common Markers

rvp-group/ca2lib 14 Sep 2023

Our approach exploits the planarity of the target to find correspondences between the sensors measurements, leading to features that are robust to LiDAR noise.

Robotics

21
0.13 stars / hour

COIN-LIO: Complementary Intensity-Augmented LiDAR Inertial Odometry

ethz-asl/COIN-LIO 2 Oct 2023

To effectively leverage intensity as an additional modality, we present a novel feature selection scheme that detects uninformative directions in the point cloud registration and explicitly selects patches with complementary image information.

Robotics

124
0.13 stars / hour

Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU

flashinfer-ai/flashinfer 9 Jan 2023

We introduce Stream-K, a work-centric parallelization of matrix multiplication (GEMM) and related computations in dense linear algebra.

Data Structures and Algorithms Distributed, Parallel, and Cluster Computing

663
0.11 stars / hour

LOG-LIO2: A LiDAR-Inertial Odometry with Efficient Uncertainty Analysis

tiev-tongji/log-lio2 2 May 2024

Uncertainty in LiDAR measurements, stemming from factors such as range sensing, is crucial for LIO (LiDAR-Inertial Odometry) systems as it affects the accurate weighting in the loss function.

Robotics

68
0.10 stars / hour

Code Generation for Conic Model-Predictive Control on Microcontrollers with TinyMPC

TinyMPC/TinyMPC 26 Mar 2024

Conic constraints appear in many important control applications like legged locomotion, robotic manipulation, and autonomous rocket landing.

Robotics Systems and Control Systems and Control Optimization and Control

358
0.10 stars / hour