no code implementations • 22 May 2024 • MingYu Liu, Nana Bao, Xingting Yan, Chenyang Li, Kai Peng
The variant model is utilized for multi-step forecasting of water level, by considering meteorological and hydrological factors simultaneously.
no code implementations • 10 May 2024 • MingYu Liu, Ekim Yurtsever, Marc Brede, Jun Meng, Walter Zimmer, Xingcheng Zhou, Bare Luka Zagar, Yuning Cui, Alois Knoll
In this study, we introduce an object relation module, consisting of a graph generator and a graph neural network (GNN), to learn the spatial information from certain patterns to improve 3D object detection.
no code implementations • 2 May 2024 • Walter Zimmer, Ramandika Pranamulia, Xingcheng Zhou, MingYu Liu, Alois C. Knoll
We achieve a frame rate of 10 FPS while keeping compression sizes below 105 Kb, a reduction of 50 times, and maintaining object detection performance on par with the original data.
no code implementations • 10 Mar 2024 • Guangkai Xu, Yongtao Ge, MingYu Liu, Chengxiang Fan, Kangyang Xie, Zhiyue Zhao, Hao Chen, Chunhua Shen
We show that, simply initializing image understanding models using a pre-trained UNet (or transformer) of diffusion models, it is possible to achieve remarkable transferable performance on fundamental vision perception tasks using a moderate amount of target data (even synthetic data only), including monocular depth, surface normal, image segmentation, matting, human pose estimation, among virtually many others.
2 code implementations • 2 Jan 2024 • MingYu Liu, Ekim Yurtsever, Jonathan Fossaert, Xingcheng Zhou, Walter Zimmer, Yuning Cui, Bare Luka Zagar, Alois C. Knoll
Autonomous driving has rapidly developed and shown promising performance due to recent advances in hardware and deep learning techniques.
1 code implementation • 22 Oct 2023 • Xingcheng Zhou, MingYu Liu, Bare Luka Zagar, Ekim Yurtsever, Alois C. Knoll
The applications of Vision-Language Models (VLMs) in the fields of Autonomous Driving (AD) and Intelligent Transportation Systems (ITS) have attracted widespread attention due to their outstanding performance and the ability to leverage Large Language Models (LLMs).
no code implementations • 13 Oct 2023 • Bare Luka Žagar, Tim Hertel, MingYu Liu, Ekim Yurtsever, Alois C. Knoll
Finally, we analyzed the generalization capabilities of these methods by conducting transferability experiments on the PointWire and PointVessel datasets.
no code implementations • 19 Sep 2023 • Ziqian Zhang, Nana Bao, Xingting Yan, Aokai Zhu, Chenyang Li, MingYu Liu
Results of VMD-based hybrid model using FSDB sampling technique show that Nash-Sutcliffe Efficiency (NSE) coefficient is increased by 6. 4%, 28. 8% and 7. 0% in three stations respectively, compared with those obtained from the currently most advanced sampling technique.
1 code implementation • 6 Jun 2023 • Wenwen Yu, MingYu Liu, Biao Yang, Enming Zhang, Deqiang Jiang, Xing Sun, Yuliang Liu, Xiang Bai
Text recognition in the wild is a long-standing problem in computer vision.
no code implementations • 5 Jun 2023 • Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, MingYu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai
It is hoped that this competition will attract many researchers in the field of CV and NLP, and bring some new thoughts to the field of Document AI.
no code implementations • 31 May 2023 • Alex Markham, MingYu Liu, Bryon Aragam, Liam Solus
Factor analysis (FA) is a statistical tool for studying how observed variables with some mutual dependences can be expressed as functions of mutually independent unobserved factors, and it is widely applied throughout the psychological, biological, and physical sciences.
no code implementations • 24 Apr 2023 • Wenwen Yu, MingYu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai
To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (ReST), which included two tasks: seal title text detection (Task 1) and end-to-end seal title recognition (Task 2).
1 code implementation • 2 May 2022 • Ekim Yurtsever, Emeç Erçelik, MingYu Liu, Zhijie Yang, Hanzhen Zhang, Pınar Topçam, Maximilian Listl, Yılmaz Kaan Çaylı, Alois Knoll
Our main contribution leverages learned flow and motion representations and combines a self-supervised backbone with a supervised 3D detection head.