Search Results for author: MingYu Liu

Found 13 papers, 4 papers with code

A Transformer variant for multi-step forecasting of water level and hydrometeorological sensitivity analysis based on explainable artificial intelligence technology

no code implementations • 22 May 2024 • MingYu Liu, Nana Bao, Xingting Yan, Chenyang Li, Kai Peng

The variant model is utilized for multi-step forecasting of water level, by considering meteorological and hydrological factors simultaneously.

Decoder Explainable artificial intelligence +1

Paper
Add Code

GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs

no code implementations • 10 May 2024 • MingYu Liu, Ekim Yurtsever, Marc Brede, Jun Meng, Walter Zimmer, Xingcheng Zhou, Bare Luka Zagar, Yuning Cui, Alois Knoll

In this study, we introduce an object relation module, consisting of a graph generator and a graph neural network (GNN), to learn the spatial information from certain patterns to improve 3D object detection.

3D Object Detection Autonomous Vehicles +3

Paper
Add Code

PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems

no code implementations • 2 May 2024 • Walter Zimmer, Ramandika Pranamulia, Xingcheng Zhou, MingYu Liu, Alois C. Knoll

We achieve a frame rate of 10 FPS while keeping compression sizes below 105 Kb, a reduction of 50 times, and maintaining object detection performance on par with the original data.

Data Compression object-detection +1

Paper
Add Code

Diffusion Models Trained with Large Data Are Transferable Visual Models

no code implementations • 10 Mar 2024 • Guangkai Xu, Yongtao Ge, MingYu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shen

We show that, simply initializing image understanding models using a pre-trained UNet (or transformer) of diffusion models, it is possible to achieve remarkable transferable performance on fundamental vision perception tasks using a moderate amount of target data (even synthetic data only), including monocular depth, surface normal, image segmentation, matting, human pose estimation, among virtually many others.

Image Matting Image Segmentation +2

Paper
Add Code

A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook

2 code implementations • 2 Jan 2024 • MingYu Liu, Ekim Yurtsever, Jonathan Fossaert, Xingcheng Zhou, Walter Zimmer, Yuning Cui, Bare Luka Zagar, Alois C. Knoll

Autonomous driving has rapidly developed and shown promising performance due to recent advances in hardware and deep learning techniques.

Autonomous Driving

Paper
Code

Vision Language Models in Autonomous Driving and Intelligent Transportation Systems

1 code implementation • 22 Oct 2023 • Xingcheng Zhou, MingYu Liu, Bare Luka Zagar, Ekim Yurtsever, Alois C. Knoll

The applications of Vision-Language Models (VLMs) in the fields of Autonomous Driving (AD) and Intelligent Transportation Systems (ITS) have attracted widespread attention due to their outstanding performance and the ability to leverage Large Language Models (LLMs).

Autonomous Driving

Paper
Code

3D Understanding of Deformable Linear Objects: Datasets and Transferability Benchmark

no code implementations • 13 Oct 2023 • Bare Luka Žagar, Tim Hertel, MingYu Liu, Ekim Yurtsever, Alois C. Knoll

Finally, we analyzed the generalization capabilities of these methods by conducting transferability experiments on the PointWire and PointVessel datasets.

Object

Paper
Add Code

Implementing a new fully stepwise decomposition-based sampling technique for the hybrid water level forecasting model in real-world application

no code implementations • 19 Sep 2023 • Ziqian Zhang, Nana Bao, Xingting Yan, Aokai Zhu, Chenyang Li, MingYu Liu

Results of VMD-based hybrid model using FSDB sampling technique show that Nash-Sutcliffe Efficiency (NSE) coefficient is increased by 6. 4%, 28. 8% and 7. 0% in three stations respectively, compared with those obtained from the currently most advanced sampling technique.

Time Series Time Series Forecasting

Paper
Add Code

Looking and Listening: Audio Guided Text Recognition

1 code implementation • 6 Jun 2023 • Wenwen Yu, MingYu Liu, Biao Yang, Enming Zhang, Deqiang Jiang, Xing Sun, Yuliang Liu, Xiang Bai

Text recognition in the wild is a long-standing problem in computer vision.

Decoder Scene Text Recognition

Paper
Code

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

no code implementations • 5 Jun 2023 • Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, MingYu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

It is hoped that this competition will attract many researchers in the field of CV and NLP, and bring some new thoughts to the field of Document AI.

Document AI Entity Linking +1

Paper
Add Code

Neuro-Causal Factor Analysis

no code implementations • 31 May 2023 • Alex Markham, MingYu Liu, Bryon Aragam, Liam Solus

Factor analysis (FA) is a statistical tool for studying how observed variables with some mutual dependences can be expressed as functions of mutually independent unobserved factors, and it is widely applied throughout the psychological, biological, and physical sciences.

Causal Discovery

Paper
Add Code

ICDAR 2023 Competition on Reading the Seal Title

no code implementations • 24 Apr 2023 • Wenwen Yu, MingYu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai

To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (ReST), which included two tasks: seal title text detection (Task 1) and end-to-end seal title recognition (Task 2).

Optical Character Recognition (OCR) Task 2 +1

Paper
Add Code

3D Object Detection with a Self-supervised Lidar Scene Flow Backbone

1 code implementation • 2 May 2022 • Ekim Yurtsever, Emeç Erçelik, MingYu Liu, Zhijie Yang, Hanzhen Zhang, Pınar Topçam, Maximilian Listl, Yılmaz Kaan Çaylı, Alois Knoll

Our main contribution leverages learned flow and motion representations and combines a self-supervised backbone with a supervised 3D detection head.

3D Object Detection Object +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.