Trending Research

GPTR: Gaussian Process Trajectory Representation for Continuous-Time Motion Estimation

brytsknguyen/gptr 30 Oct 2024

To bolster the adoption of the continuous-time paradigm, we propose a so-called Gaussian Process Trajectory Representation (GPTR) framework for continuous-time motion estimation (CTME) tasks.

Robotics

249
0.57 stars / hour

QMDB: Quick Merkle Database

layerzero-labs/qmdb 9 Jan 2025

This performance enables support for 1 million token transfers per second (TPS), marking QMDB as the first solution achieving such a milestone.

Networking and Internet Architecture Databases

198
0.23 stars / hour

P4Testgen: An Extensible Test Oracle For P4

p4lang/p4c 28 Nov 2022

We have instantiated P4Testgen for the V1model, eBPF, PNA, and Tofino P4 architectures.

Networking and Internet Architecture Symbolic Computation Software Engineering

726
0.15 stars / hour

Empowering Robot Path Planning with Large Language Models: osmAG Map Topology & Hierarchy Comprehension with LLMs

hiyouga/llama-factory 13 Mar 2024

Large Language Models (LLMs) have demonstrated great potential in robotic applications by providing essential general knowledge.

Robotics

38,537
0.13 stars / hour

SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

open-mmlab/amphion 20 Feb 2024

In this study, we present SingVisio, an interactive visual analysis system that aims to explain the diffusion model used in singing voice conversion.

Sound Human-Computer Interaction Audio and Speech Processing

8,159
0.10 stars / hour

Leveraging Diverse Semantic-based Audio Pretrained Models for Singing Voice Conversion

open-mmlab/Amphion 17 Oct 2023

We discover that the knowledge of different models is diverse and can be complementary for SVC.

Sound Audio and Speech Processing

8,159
0.10 stars / hour

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

modelscope/ClearerVoice-Studio 17 Jan 2025

However, existing SR methods that typically rely on independently trained and concatenated networks may lead to inconsistent representations and poor speech quality, especially in out-of-domain scenarios.

Sound Audio and Speech Processing

2,054
0.10 stars / hour

Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU

flashinfer-ai/flashinfer 9 Jan 2023

We introduce Stream-K, a work-centric parallelization of matrix multiplication (GEMM) and related computations in dense linear algebra.

Data Structures and Algorithms Distributed, Parallel, and Cluster Computing

1,824
0.10 stars / hour

Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)

Thinklab-SJTU/Bench2Drive 26 Feb 2024

As a result, Think2Drive is able to run in an expert-level proficiency in CARLA v2 within 3 days of training on a single A6000 GPU, and to our best knowledge, so far there is no reported success (100\% route completion)on CARLA v2.

Robotics

1,023
0.09 stars / hour

CAM++: A Fast and Efficient Network for Speaker Verification Using Context-Aware Masking

alibaba-damo-academy/3D-Speaker 1 Mar 2023

Time delay neural network (TDNN) has been proven to be efficient for speaker verification.

Sound Audio and Speech Processing

1,528
0.09 stars / hour