1 code implementation • 14 Mar 2024 • Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Geng Yang, Wenbo Huang, Yunsong Li
Multimodal image fusion and object detection play a vital role in autonomous driving.
1 code implementation • 6 Jan 2024 • Jiaqing Zhang, Jie Lei, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li
Additionally, the information distribution flow (IDF) in MIVit enhances performance-awareness by distributing global classification information across different modalities' feature maps.
no code implementations • 17 Sep 2021 • Xiuqiang He, Hua Geng, Geng Yang
It is deemed that a DEM can be used to represent the whole WF to evaluate its impact on the SSS of power systems, as long as the frequency response of the DEM adequately matches that of the detailed WF model around the frequency of oscillation modes of concern.
1 code implementation • 12 Aug 2020 • Haohe Liu, Lei Xie, Jian Wu, Geng Yang
We aim to address the major issues in CNN-based high-resolution MSS model: high computational cost and weight sharing between distinctly different bands.
Audio and Speech Processing Sound
9 code implementations • Interspeech2020 2020 • Geng Yang, Shan Yang, Kai Liu, Peng Fang, Wei Chen, Lei Xie
In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech.
Sound Audio and Speech Processing