1 code implementation • 29 May 2024 • Jiaqi Xu, Xinyi Zou, Kunzhe Huang, Yunkuo Chen, Bo Liu, Mengli Cheng, Xing Shi, Jun Huang
The motion module can be adapted to various DiT baseline methods to generate video with different styles.
no code implementations • 17 Feb 2024 • Yu Feng, Xing Shi, Mengli Cheng, Yun Xiong
As the task of 2D-to-3D reconstruction has gained significant attention in various real-world scenarios, it becomes crucial to be able to generate high-quality point clouds.
no code implementations • 10 Mar 2023 • Jiaqi Xu, Bo Liu, Yunkuo Chen, Mengli Cheng, Xing Shi
Specifically, we design a Text-Guided MultiWay-Sampler based on adapt-pooling residual mapping and self-attention modules to sample long sequences and fuse multi-modal features, which reduces the computational costs and addresses performance degradation caused by previous samplers.
Ranked #1 on TGIF-Transition on TGIF-QA (using extra training data)
1 code implementation • 26 Sep 2022 • Mengli Cheng, Yue Gao, Guoqiang Liu, Hongsheng Jin, Xiaowen Zhang
We present EasyRec, an easy-to-use, extendable and efficient recommendation framework for building industrial recommendation systems.
no code implementations • 14 Sep 2020 • Chengyu Wang, Mengli Cheng, Xu Hu, Jun Huang
We present EasyASR, a distributed machine learning platform for training and serving large-scale Automatic Speech Recognition (ASR) models, as well as collecting and processing audio data at scale.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 9 Sep 2020 • Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei. Lin
Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document.
no code implementations • 4 Aug 2020 • Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang
Building Automatic Speech Recognition (ASR) systems from scratch is significantly challenging, mostly due to the time-consuming and financially-expensive process of annotating a large amount of audio data with transcripts.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 3 May 2018 • Qiangpeng Yang, Mengli Cheng, Wenmeng Zhou, Yan Chen, Minghui Qiu, Wei. Lin, Wei Chu
To solve this problem, we propose a novel end-to-end scene text detector IncepText from an instance-aware segmentation perspective.