Search Results for author: Xiao Song

Found 13 papers, 1 papers with code

Cross-modal Contrastive Attention Model for Medical Report Generation

no code implementations • COLING 2022 • Xiao Song, Xiaodan Zhang, Junzhong Ji, Ying Liu, Pengxu Wei

Medical report automatic generation has gained increasing interest recently as a way to help radiologists write reports more efficiently.

Medical Report Generation

Paper
Add Code

SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection

no code implementations • 12 Mar 2024 • Hongcheng Zhang, Liu Liang, Pengxin Zeng, Xiao Song, Zhe Wang

Sparse 3D detectors have received significant attention since the query-based paradigm embraces low latency without explicit dense BEV feature construction.

3D Object Detection object-detection

Paper
Add Code

Rethinking Radiology Report Generation via Causal Reasoning and Counterfactual Augmentation

no code implementations • 22 Nov 2023 • Xiao Song, Jiafan Liu, Yun Li, Wenbin Lei, Ruxin Wang

Radiology Report Generation (RRG) draws attention as an interaction between vision and language fields.

counterfactual Sentence +1

Paper
Add Code

Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition

no code implementations • 28 Oct 2022 • Zezhong Jin, Dading Zhong, Xiao Song, Zhaoyi Liu, Naipeng Ye, Qingcheng Zeng

The model is iteratively updated to correct the unreliable pseudo labels to minimize the effect of noisy labels.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

no code implementations • 9 Dec 2021 • Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Yuexin Ma, Zhe Wang, Jianping Shi

Compared to previous methods, our AdaStereo realizes a more standard, complete and effective domain adaptation pipeline.

Domain Adaptation Stereo Matching

Paper
Add Code

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation

no code implementations • 17 Aug 2021 • Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao

However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem).

Autonomous Driving LIDAR Semantic Segmentation +1

Paper
Add Code

Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection in Autonomous Driving

no code implementations • 27 Nov 2020 • Zhenxun Yuan, Xiao Song, Lei Bai, Wengang Zhou, Zhe Wang, Wanli Ouyang

As a special design of this transformer, the information encoded in the encoder is different from that in the decoder, i. e. the encoder encodes temporal-channel information of multiple frames while the decoder decodes the spatial-channel information for the current frame in a voxel-wise manner.

3D Object Detection Autonomous Driving +4

Paper
Add Code

Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation

3 code implementations • 4 Aug 2020 • Hui Zhou, Xinge Zhu, Xiao Song, Yuexin Ma, Zhe Wang, Hongsheng Li, Dahua Lin

A straightforward solution to tackle the issue of 3D-to-2D projection is to keep the 3D representation and process the points in the 3D space.

Ranked #11 on LIDAR Semantic Segmentation on nuScenes

3D Semantic Segmentation LIDAR Semantic Segmentation

812

Paper
Code

AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching

no code implementations • CVPR 2021 • Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi

Compared to previous methods for adaptive stereo matching, our AdaStereo realizes a more standard, complete and effective domain adaptation pipeline.

Domain Adaptation Stereo Matching

Paper
Add Code

EdgeStereo: An Effective Multi-Task Learning Network for Stereo Matching and Edge Detection

no code implementations • 5 Mar 2019 • Xiao Song, Xu Zhao, Liangji Fang, Hanwen Hu

EdgeStereo also achieves comparable generalization performance for disparity estimation because of the incorporation of edge cues.

Disparity Estimation Edge Detection +3

Paper
Add Code

Discriminative Representation Combinations for Accurate Face Spoofing Detection

no code implementations • 27 Aug 2018 • Xiao Song, Xu Zhao, Liangji Fang, Tianwei Lin

Secondly we utilize the SSD, which is a deep learning framework for detection, to excavate context cues and conduct end-to-end face presentation attack detection.

Face Presentation Attack Detection

Paper
Add Code

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

no code implementations • 14 Mar 2018 • Xiao Song, Xu Zhao, Hanwen Hu, Liangji Fang

Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task.

Disparity Estimation Edge Detection +2

Paper
Add Code

Face Spoofing Detection by Fusing Binocular Depth and Spatial Pyramid Coding Micro-Texture Features

no code implementations • 13 Mar 2018 • Xiao Song, Xu Zhao, Tianwei Lin

The second one is a high-level micro-texture based feature called Spatial Pyramid Coding Micro-Texture (SPMT) feature.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.