no code implementations • 9 May 2024 • Yiheng Huang, Hui Yang, Chuanchen Luo, Yuxi Wang, Shibiao Xu, Zhaoxiang Zhang, Man Zhang, Junran Peng
The effect of the design of each component is still unclear.
1 code implementation • 24 Jan 2024 • Zengbin Wang, Saihui Hou, Man Zhang, Xu Liu, Chunshui Cao, Yongzhen Huang, Peipei Li, Shibiao Xu
Gait recognition is a promising biometric method that aims to identify pedestrians from their unique walking patterns.
no code implementations • 7 Jan 2024 • Genghao Zhang, Yuxi Wang, Chuanchen Luo, Shibiao Xu, Zhaoxiang Zhang, Man Zhang, Junran Peng
Indoor scene generation has attracted significant attention recently as it is crucial for applications of gaming, virtual reality, and interior design.
1 code implementation • 20 Dec 2023 • Wenhao Xu, Rongtao Xu, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang
Recently, CLIP has found practical utility in the domain of pixel-level zero-shot segmentation tasks.
1 code implementation • 1 Nov 2023 • Jinzhou Lin, Han Gao, Xuxiang Feng, Rongtao Xu, Changwei Wang, Man Zhang, Li Guo, Shibiao Xu
This article offers an exhaustive summary of the symbiosis between LLMs and embodied intelligence with a focus on navigation.
no code implementations • 8 Oct 2023 • Peipei Li, Xing Cui, Yibo Hu, Man Zhang, Ting Yao, Tao Mei
Directly employing small models may result in a significant drop in performance since it is difficult for a small model to adequately capture local structure and global shape information simultaneously, which are essential clues for point cloud analysis.
1 code implementation • 8 Oct 2023 • Chengjie Lu, Tao Yue, Man Zhang, Shaukat Ali
In addition, existing ADS testing techniques have limited effectiveness in ensuring the realism of test scenarios, especially the realism of weather conditions and their changes over time.
1 code implementation • 1 Oct 2023 • Zekun Moore Wang, Zhongyuan Peng, Haoran Que, Jiaheng Liu, Wangchunshu Zhou, Yuhan Wu, Hongcheng Guo, Ruitong Gan, Zehao Ni, Jian Yang, Man Zhang, Zhaoxiang Zhang, Wanli Ouyang, Ke Xu, Stephen W. Huang, Jie Fu, Junran Peng
The advent of Large Language Models (LLMs) has paved the way for complex tasks such as role-playing, which enhances user interactions by enabling models to imitate various characters.
1 code implementation • ICCV 2023 • Xiaojun Tang, Junsong Fan, Chuanchen Luo, Zhaoxiang Zhang, Man Zhang, Zongyuan Yang
Considering this phenomenon, we propose Discriminability-Driven Graph Network (DDG-Net), which explicitly models ambiguous snippets and discriminative snippets with well-designed connections, preventing the transmission of ambiguous information and enhancing the discriminability of snippet-level representations.
Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization
no code implementations • 22 Apr 2022 • Si Yang, Lihua Zheng, Xieyuanli Chen, Laura Zabawa, Man Zhang, Minjuan Wang
In the first step, we finetune an instance segmentation network pretrained by a source domain (MS COCO dataset) with a synthetic target domain (in-vitro soybean pods dataset).
no code implementations • 12 Sep 2017 • Lingxiao Song, Man Zhang, Xiang Wu, Ran He
This framework integrates cross-spectral face hallucination and discriminative feature learning into an end-to-end adversarial network.
no code implementations • 28 Nov 2014 • Ran He, Man Zhang, Liang Wang, Ye Ji, Qiyue Yin
For unsupervised learning, we propose a cross-modal subspace clustering method to learn a common structure for different modalities.