1 code implementation • 24 Apr 2024 • Jiawei Yao, Qi Qian, Juhua Hu
Traditionally, aligning a user's brief keyword of interest with the corresponding vision components was challenging, but the emergence of multi-modal and large language models (LLMs) has begun to bridge this gap.
1 code implementation • 7 Feb 2024 • Jiawei Yao, Juhua Hu
In the E-step, the disentanglement learning module employs coarse-grained and fine-grained disentangled representations to obtain a more diverse set of latent factors from the data.
1 code implementation • 20 Dec 2023 • Jiawei Yao, Xiaochao Pan, Tong Wu, Xiaofeng Zhang
In this paper, we introduce for the first time a large-scale aerial image dataset built for lane detection, with high-quality polyline lane annotations on high-resolution images of around 80 kilometers of road.
no code implementations • 28 Nov 2023 • Jiawei Yao, Jusheng Zhang
The task of 3D semantic scene completion with monocular cameras is gaining increasing attention in the field of autonomous driving.
no code implementations • 9 Oct 2023 • Jiawei Yao, Chen Wang, Tong Wu, Chuming Li
In this paper, we propose a novel method for 3D scene and object reconstruction from sparse multi-view images.
no code implementations • 7 Oct 2023 • Jiawei Yao, Yingxin Lai
3D object detection is crucial for applications like autonomous driving and robotics.
1 code implementation • ICCV 2023 • Jiawei Yao, Chuming Li, Keqiang Sun, Yingjie Cai, Hao Li, Wanli Ouyang, Hongsheng Li
Monocular 3D Semantic Scene Completion (SSC) has garnered significant attention in recent years due to its potential to predict complex semantics and geometry shapes from a single image, requiring no 3D inputs.
3D Semantic Scene Completion from a single 2D image 3D Semantic Scene Completion from a single RGB image +1
no code implementations • 16 Aug 2023 • Jiawei Yao, Tong Wu, Xiaofeng Zhang
To explore the differences between Transformers and CNNs, we employ a sparse pixel approach to contrastively analyze the distinctions between the two.
1 code implementation • 22 Jun 2023 • Jiawei Yao, Enbei Liu, Maham Rashid, Juhua Hu
Thereafter, multiple clusterings based on different aspects of the data can be obtained.
1 code implementation • IEEE Transactions on Circuits and Systems for Video Technology 2022 • Cairong Zhao, Zhicheng Chen, Shuguang Dou, Zefan Qu, Jiawei Yao, Jun Wu, Duoqian Miao
For human-introduced noise, we propose a noise-discovery and noise-suppression training process for mislabeling robust person search.
no code implementations • 1 May 2017 • Wei Luo, Lingzhou Xue, Jiawei Yao, Xiufan Yu
Assuming that the predictors affect the response through the latent factors, we propose to first conduct factor analysis and then apply sufficient dimension reduction on the estimated factors, to derive the reduced data for subsequent forecasting.
no code implementations • 27 May 2015 • Jianqing Fan, Lingzhou Xue, Jiawei Yao
Our method and theory allow the number of predictors to be larger than the number of observations.