Search Results for author: Jianguo Wei

Found 7 papers, 3 papers with code

Attention Guidance Mechanism for Handwritten Mathematical Expression Recognition

no code implementations • 4 Mar 2024 • Yutian Liu, Wenjun Ke, Jianguo Wei

Handwritten mathematical expression recognition (HMER) is challenging in image-to-text tasks due to the complex layouts of mathematical expressions and suffers from problems including over-parsing and under-parsing.

Paper
Add Code

Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling

no code implementations • 9 Aug 2023 • Yu Zhao, Hao Fei, Yixin Cao, Bobo Li, Meishan Zhang, Jianguo Wei, Min Zhang, Tat-Seng Chua

A scene-event mapping mechanism is first designed to bridge the gap between the underlying scene structure and the high-level event semantic structure, resulting in an overall hierarchical scene-event (termed ICE) graph structure.

Semantic Role Labeling

Paper
Add Code

Generating Visual Spatial Description via Holistic 3D Scene Understanding

1 code implementation • 19 May 2023 • Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang, Tat-Seng Chua

With an external 3D scene extractor, we obtain the 3D objects and scene features for input images, based on which we construct a target object-centered 3D spatial scene graph (Go3D-S2G), such that we model the spatial semantics of target objects within the holistic 3D scenes.

Scene Understanding Text Generation

Paper
Code

Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation

1 code implementation • 20 Oct 2022 • Yu Zhao, Jianguo Wei, Zhichao Lin, Yueheng Sun, Meishan Zhang, Min Zhang

Accordingly, we manually annotate a dataset to facilitate the investigation of the newly-introduced task and build several benchmark encoder-decoder models by using VL-BART and VL-T5 as backbones.

Decoder Image Captioning +1

Paper
Code

TeKo: Text-Rich Graph Neural Networks with External Knowledge

no code implementations • 15 Jun 2022 • Zhizhi Yu, Di Jin, Jianguo Wei, Ziyang Liu, Yue Shang, Yun Xiao, Jiawei Han, Lingfei Wu

Graph Neural Networks (GNNs) have gained great popularity in tackling various analytical tasks on graph-structured data (i. e., networks).

Paper
Add Code

TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding

no code implementations • 17 Mar 2022 • Ruiteng Zhang, Jianguo Wei, Xugang Lu, Wenhuan Lu, Di Jin, Junhai Xu, Lin Zhang, Yantao Ji, Jianwu Dang

Therefore, in the most current state-of-the-art network architectures, only a few branches corresponding to a limited number of temporal scales could be designed for speaker embeddings.

Speaker Verification

Paper
Add Code

CS-Rep: Making Speaker Verification Networks Embracing Re-parameterization

1 code implementation • 26 Oct 2021 • Ruiteng Zhang, Jianguo Wei, Wenhuan Lu, Lin Zhang, Yantao Ji, Junhai Xu, Xugang Lu

Automatic speaker verification (ASV) systems, which determine whether two speeches are from the same speaker, mainly focus on verification accuracy while ignoring inference speed.

Speaker Verification

7,955

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.