no code implementations • COLING 2022 • Xiao Song, Xiaodan Zhang, Junzhong Ji, Ying Liu, Pengxu Wei
Medical report automatic generation has gained increasing interest recently as a way to help radiologists write reports more efficiently.
no code implementations • 12 Mar 2024 • Hongcheng Zhang, Liu Liang, Pengxin Zeng, Xiao Song, Zhe Wang
Sparse 3D detectors have received significant attention since the query-based paradigm embraces low latency without explicit dense BEV feature construction.
no code implementations • 22 Nov 2023 • Xiao Song, Jiafan Liu, Yun Li, Wenbin Lei, Ruxin Wang
Radiology Report Generation (RRG) draws attention as an interaction between vision and language fields.
no code implementations • 28 Oct 2022 • Zezhong Jin, Dading Zhong, Xiao Song, Zhaoyi Liu, Naipeng Ye, Qingcheng Zeng
The model is iteratively updated to correct the unreliable pseudo labels to minimize the effect of noisy labels.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 9 Dec 2021 • Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Yuexin Ma, Zhe Wang, Jianping Shi
Compared to previous methods, our AdaStereo realizes a more standard, complete and effective domain adaptation pipeline.
no code implementations • 17 Aug 2021 • Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao
However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem).
no code implementations • 27 Nov 2020 • Zhenxun Yuan, Xiao Song, Lei Bai, Wengang Zhou, Zhe Wang, Wanli Ouyang
As a special design of this transformer, the information encoded in the encoder is different from that in the decoder, i. e. the encoder encodes temporal-channel information of multiple frames while the decoder decodes the spatial-channel information for the current frame in a voxel-wise manner.
3 code implementations • 4 Aug 2020 • Hui Zhou, Xinge Zhu, Xiao Song, Yuexin Ma, Zhe Wang, Hongsheng Li, Dahua Lin
A straightforward solution to tackle the issue of 3D-to-2D projection is to keep the 3D representation and process the points in the 3D space.
Ranked #11 on LIDAR Semantic Segmentation on nuScenes
no code implementations • CVPR 2021 • Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi
Compared to previous methods for adaptive stereo matching, our AdaStereo realizes a more standard, complete and effective domain adaptation pipeline.
no code implementations • 5 Mar 2019 • Xiao Song, Xu Zhao, Liangji Fang, Hanwen Hu
EdgeStereo also achieves comparable generalization performance for disparity estimation because of the incorporation of edge cues.
no code implementations • 27 Aug 2018 • Xiao Song, Xu Zhao, Liangji Fang, Tianwei Lin
Secondly we utilize the SSD, which is a deep learning framework for detection, to excavate context cues and conduct end-to-end face presentation attack detection.
no code implementations • 14 Mar 2018 • Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang
Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task.
no code implementations • 13 Mar 2018 • Xiao Song, Xu Zhao, Tianwei Lin
The second one is a high-level micro-texture based feature called Spatial Pyramid Coding Micro-Texture (SPMT) feature.