Search Results for author: Mingyi He

Found 27 papers, 6 papers with code

A Revisit of the Normalized Eight-Point Algorithm and A Self-Supervised Deep Solution

no code implementations21 Apr 2023 Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He

The normalized eight-point algorithm has been widely viewed as the cornerstone in two-view geometry computation, where the seminal Hartley's normalization has greatly improved the performance of the direct linear transformation algorithm.

Self-Supervised Learning

Masked Representation Learning for Domain Generalized Stereo Matching

no code implementations CVPR 2023 Zhibo Rao, Bangshu Xiong, Mingyi He, Yuchao Dai, Renjie He, Zhelun Shen, Xing Li

Experimental results on multi-datasets show that: (1) our method can be easily plugged into the current various stereo matching models to improve generalization performance; (2) our method can reduce the significant volatility of generalization performance among different training epochs; (3) we find that the current methods prefer to choose the best results among different training epochs as generalization performance, but it is impossible to select the best performance by ground truth in practice.

Image Reconstruction Multi-Task Learning +2

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

no code implementations26 Oct 2022 Zhiyuan Zhang, Yuchao Dai, Bin Fan, Jiadai Sun, Mingyi He

In this paper, we propose to learn a robust task-specific feature descriptor to consistently describe the correct point correspondence under interference.

Context-Aware Video Reconstruction for Rolling Shutter Cameras

1 code implementation CVPR 2022 Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He

Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times.

Motion Compensation Video Reconstruction

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration

no code implementations24 Mar 2022 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Mingyi He

3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points.

Point Cloud Registration

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration

no code implementations24 Mar 2022 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Existing correspondences-free methods generally learn the holistic representation of the entire point cloud, which is fragile for partial and noisy point clouds.

Point Cloud Registration

End-to-end Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

no code implementations28 Oct 2021 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Even though considerable progress has been made in deep learning-based 3D point cloud processing, how to obtain accurate correspondences for robust registration remains a major challenge because existing hard assignment methods cannot deal with outliers naturally.

Point Cloud Registration

Dense Uncertainty Estimation

1 code implementation13 Oct 2021 Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes

Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks. The former is usually trained to achieve a mapping from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing.

Decision Making

SUNet: Symmetric Undistortion Network for Rolling Shutter Correction

1 code implementation ICCV 2021 Bin Fan, Yuchao Dai, Mingyi He

The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition.

Rolling Shutter Correction

Pattern Ensembling for Spatial Trajectory Reconstruction

no code implementations25 Jan 2021 Shivam Pathak, Mingyi He, Sergey Malinchik, Stanislav Sobolevsky

Digital sensing provides an unprecedented opportunity to assess and understand mobility.

Class Attention Network for Semantic Segmentation of Remote Sensing Images

no code implementations31 Dec 2020 Zhibo Rao, Mingyi He, Yuchao Dai

In this paper, we proposed a novel class attention module and decomposition-fusion strategy to cope with imbalanced labels.

Earth Observation Scene Parsing +2

NLCA-Net v2 for Stereo Matching in ECCV'20 Robust Vision Challenge

no code implementations1 Nov 2020 Zhibo Rao, Mingyi He, Bo Li, Renjie He

The network architecture used in this RVC, called as NLCA-Net v2, is consists of four parts: feature extraction, cost volume construction, feature matching, and refinement, as shown in Fig.

Stereo Matching

Monocular Human Pose Estimation: A Survey of Deep Learning-based Methods

no code implementations2 Jun 2020 Yu-cheng Chen, YingLi Tian, Mingyi He

Vision-based monocular human pose estimation, as one of the most fundamental and challenging problems in computer vision, aims to obtain posture of the human body from input images or video sequences.

3D Human Pose Estimation

Self-supervised Modal and View Invariant Feature Learning

no code implementations28 May 2020 Longlong Jing, Yu-cheng Chen, Ling Zhang, Mingyi He, YingLi Tian

By exploring the inherent multi-modality attributes of 3D objects, in this paper, we propose to jointly learn modal-invariant and view-invariant features from different modalities including image, point cloud, and mesh with heterogeneous networks for 3D data.

Cross-Modal Retrieval Retrieval

Self-supervised Feature Learning by Cross-modality and Cross-view Correspondences

no code implementations13 Apr 2020 Longlong Jing, Yu-cheng Chen, Ling Zhang, Mingyi He, YingLi Tian

Specifically, 2D image features of rendered images from different views are extracted by a 2D convolutional neural network, and 3D point cloud features are extracted by a graph convolution neural network.

3D Part Segmentation 3D Shape Classification +4

MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity Map for Stereo Matching

no code implementations25 Apr 2019 Zhibo Rao, Mingyi He, Yuchao Dai, Zhidong Zhu, Bo Li, Renjie He

The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module.

Autonomous Driving object-detection +3

Multi-scale Cross-form Pyramid Network for Stereo Matching

no code implementations25 Apr 2019 Zhidong Zhu, Mingyi He, Yuchao Dai, Zhibo Rao, Bo Li

The network consists of three modules: Multi-Scale 2D local feature extraction module, Cross-form spatial pyramid module and Multi-Scale 3D Feature Matching and Fusion module.

3D Feature Matching 3D Scene Reconstruction +3

Deep Edge-Aware Saliency Detection

no code implementations15 Aug 2017 Jing Zhang, Yuchao Dai, Fatih Porikli, Mingyi He

There has been profound progress in visual saliency thanks to the deep learning architectures, however, there still exist three major challenges that hinder the detection performance for scenes with complex compositions, multiple salient objects, and salient objects of diverse scales.

Descriptive Saliency Detection

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

1 code implementation2 Aug 2017 Bo Li, Yuchao Dai, Mingyi He

Extensive experiments on the NYU Depth V2 and KITTI datasets show the superiority of our method compared with current state-of-the-art methods.

Monocular Depth Estimation Quantization +1

Dense Non-rigid Structure-from-Motion Made Easy - A Spatial-Temporal Smoothness based Solution

no code implementations27 Jun 2017 Yuchao Dai, Huizhong Deng, Mingyi He

Second, we propose to exploit the spatial smoothness by resorting to the Laplacian of the 3D non-rigid shape.

Integrated Deep and Shallow Networks for Salient Object Detection

no code implementations2 Jun 2017 Jing Zhang, Bo Li, Yuchao Dai, Fatih Porikli, Mingyi He

Then the results from deep FCNN and RBD are concatenated to feed into a shallow network to map the concatenated feature maps to saliency maps.

Object object-detection +3

Single image depth estimation by dilated deep residual convolutional neural network and soft-weight-sum inference

1 code implementation27 Apr 2017 Bo Li, Yuchao Dai, Huahui Chen, Mingyi He

This paper proposes a new residual convolutional neural network (CNN) architecture for single image depth estimation.

Depth Estimation

Skeleton Boxes: Solving skeleton based action detection with a single deep convolutional neural network

no code implementations19 Apr 2017 Bo Li, Huahui Chen, Yu-cheng Chen, Yuchao Dai, Mingyi He

However, due to the difficulty in representing the 3D skeleton video and the lack of training data, action detection from streaming 3D skeleton video still lags far behind its recognition counterpart and image based object detection.

Action Detection Action Recognition +3

Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep cnn

no code implementations19 Apr 2017 Bo Li, Mingyi He, Xuelian Cheng, Yu-cheng Chen, Yuchao Dai

Especially on the largest and challenge NTU RGB+D, UTD-MHAD, and MSRC-12 dataset, our method outperforms other methods by a large margion, which proves the efficacy of the proposed method.

Action Recognition Image Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.