Search Results for author: Mengli Cheng

Found 8 papers, 3 papers with code

EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

1 code implementation • 29 May 2024 • Jiaqi Xu, Xinyi Zou, Kunzhe Huang, Yunkuo Chen, Bo Liu, Mengli Cheng, Xing Shi, Jun Huang

The motion module can be adapted to various DiT baseline methods to generate video with different styles.

Image Generation Video Generation

493

Paper
Code

DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model

no code implementations • 17 Feb 2024 • Yu Feng, Xing Shi, Mengli Cheng, Yun Xiong

As the task of 2D-to-3D reconstruction has gained significant attention in various real-world scenarios, it becomes crucial to be able to generate high-quality point clouds.

Point cloud reconstruction

Paper
Add Code

MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling

no code implementations • 10 Mar 2023 • Jiaqi Xu, Bo Liu, Yunkuo Chen, Mengli Cheng, Xing Shi

Specifically, we design a Text-Guided MultiWay-Sampler based on adapt-pooling residual mapping and self-attention modules to sample long sequences and fuse multi-modal features, which reduces the computational costs and addresses performance degradation caused by previous samplers.

Ranked #1 on TGIF-Transition on TGIF-QA (using extra training data)

Multi-Label Classification Multiple-choice +8

Paper
Add Code

EasyRec: An easy-to-use, extendable and efficient framework for building industrial recommendation systems

1 code implementation • 26 Sep 2022 • Mengli Cheng, Yue Gao, Guoqiang Liu, Hongsheng Jin, Xiaowen Zhang

We present EasyRec, an easy-to-use, extendable and efficient recommendation framework for building industrial recommendation systems.

feature selection Recommendation Systems

1,557

Paper
Code

EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition

no code implementations • 14 Sep 2020 • Chengyu Wang, Mengli Cheng, Xu Hu, Jun Huang

We present EasyASR, a distributed machine learning platform for training and serving large-scale Automatic Speech Recognition (ASR) models, as well as collecting and processing audio data at scale.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

1 code implementation • 9 Sep 2020 • Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei. Lin

Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document.

One-Shot Learning Text Detection

Paper
Code

Weakly Supervised Construction of ASR Systems with Massive Video Data

no code implementations • 4 Aug 2020 • Mengli Cheng, Chengyu Wang, Xu Hu, Jun Huang, Xiaobo Wang

Building Automatic Speech Recognition (ASR) systems from scratch is significantly challenging, mostly due to the time-consuming and financially-expensive process of annotating a large amount of audio data with transcripts.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

no code implementations • 3 May 2018 • Qiangpeng Yang, Mengli Cheng, Wenmeng Zhou, Yan Chen, Minghui Qiu, Wei. Lin, Wei Chu

To solve this problem, we propose a novel end-to-end scene text detector IncepText from an instance-aware segmentation perspective.

Multi-Oriented Scene Text Detection object-detection +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.