Deterministic and Reliable Software-Defined Vehicles: key building blocks, challenges, and vision

torvalds/linux 24 Jul 2024

As vehicle systems become increasingly complex, with more features, services, sensors, actuators, and processing units, it is important to view vehicles not just as modes of transportation moving toward full autonomy, but also as adaptive systems that respond to the needs of their occupants.

Distributed, Parallel, and Cluster Computing Networking and Internet Architecture C.2.1; C.2.4; C.0; D.2.1; D.2.11; J.7; K.6.4

Empowering Robot Path Planning with Large Language Models: osmAG Map Topology & Hierarchy Comprehension with LLMs

hiyouga/llama-factory 13 Mar 2024

Large Language Models (LLMs) have demonstrated great potential in robotic applications by providing essential general knowledge.

Robotics

Enhancing Empathetic Response Generation by Augmenting LLMs with Small-scale Empathetic Models

hiyouga/llama-factory 19 Feb 2024

Current large language models (LLMs) excel in response expression; however, they lack the ability to deeply understand emotional and cognitive nuances, particularly in pinpointing fine-grained emotions and their triggers.

Human-Computer Interaction

UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with Action Understanding and Feedback in Natural Language

hiyouga/llama-factory 21 Aug 2023

Specifically, the proposed UbiPhysio framework comprises a fine-grained action descriptor and a knowledge retrieval-enhanced feedback module.

Human-Computer Interaction

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

coqui-ai/TTS 11 Jun 2021

Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems.

Sound Audio and Speech Processing

Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech

coqui-ai/TTS Interspeech2020 2020

In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech.

Sound Audio and Speech Processing

google-research

google-research/google-research NeurIPS 2020

Google Research

Data Structures and Algorithms

Manu: A Cloud Native Vector Database Management System

milvus-io/milvus 28 Jun 2022

In the past three years, through interaction with our 1200+ industry users, we have sketched a vision for the features that next-generation vector databases should have, which include long-term evolvability, tunable consistency, good elasticity, and high performance.

Databases

Array Programming with NumPy

numpy/numpy 18 Jun 2020

Array programming provides a powerful, compact, expressive syntax for accessing, manipulating, and operating on data in vectors, matrices, and higher-dimensional arrays.

Mathematical Software Computation

MediaPipe: A Framework for Building Perception Pipelines

google/mediapipe 14 Jun 2019

A developer can use MediaPipe to build prototypes by combining existing perception components, to advance them to polished cross-platform applications and measure system performance and resource consumption on target platforms.

Distributed, Parallel, and Cluster Computing