Search Results for author: Mingyi He

Found 27 papers, 6 papers with code

Transferable Attack for Semantic Segmentation

1 code implementation • 31 Jul 2023 • Mengqi He, Jing Zhang, Zhaoyuan Yang, Mingyi He, Nick Barnes, Yuchao Dai

We analysis performance of semantic segmentation models wrt.

Paper
Code

Joint Salient Object Detection and Camouflaged Object Detection via Uncertainty-aware Learning

no code implementations • 10 Jul 2023 • Aixuan Li, Jing Zhang, Yunqiu Lv, Tong Zhang, Yiran Zhong, Mingyi He, Yuchao Dai

In this case, salient objects are typically non-camouflaged, and camouflaged objects are usually not salient.

Attribute Contrastive Learning +5

Paper
Add Code

A Revisit of the Normalized Eight-Point Algorithm and A Self-Supervised Deep Solution

no code implementations • 21 Apr 2023 • Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He

The normalized eight-point algorithm has been widely viewed as the cornerstone in two-view geometry computation, where the seminal Hartley's normalization has greatly improved the performance of the direct linear transformation algorithm.

Self-Supervised Learning

Paper
Add Code

Masked Representation Learning for Domain Generalized Stereo Matching

no code implementations • CVPR 2023 • Zhibo Rao, Bangshu Xiong, Mingyi He, Yuchao Dai, Renjie He, Zhelun Shen, Xing Li

Experimental results on multi-datasets show that: (1) our method can be easily plugged into the current various stereo matching models to improve generalization performance; (2) our method can reduce the significant volatility of generalization performance among different training epochs; (3) we find that the current methods prefer to choose the best results among different training epochs as generalization performance, but it is impossible to select the best performance by ground truth in practice.

Image Reconstruction Multi-Task Learning +2

Paper
Add Code

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

no code implementations • 26 Oct 2022 • Zhiyuan Zhang, Yuchao Dai, Bin Fan, Jiadai Sun, Mingyi He

In this paper, we propose to learn a robust task-specific feature descriptor to consistently describe the correct point correspondence under interference.

Paper
Add Code

Context-Aware Video Reconstruction for Rolling Shutter Cameras

1 code implementation • CVPR 2022 • Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He

Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times.

Motion Compensation Video Reconstruction

Paper
Code

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration

no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Mingyi He

3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points.

Point Cloud Registration

Paper
Add Code

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration

no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Existing correspondences-free methods generally learn the holistic representation of the entire point cloud, which is fragile for partial and noisy point clouds.

Point Cloud Registration

Paper
Add Code

End-to-end Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

no code implementations • 28 Oct 2021 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Even though considerable progress has been made in deep learning-based 3D point cloud processing, how to obtain accurate correspondences for robust registration remains a major challenge because existing hard assignment methods cannot deal with outliers naturally.

Point Cloud Registration

Paper
Add Code

Dense Uncertainty Estimation

1 code implementation • 13 Oct 2021 • Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes

Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks. The former is usually trained to achieve a mapping from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing.

Decision Making

Paper
Code

SUNet: Symmetric Undistortion Network for Rolling Shutter Correction

1 code implementation • ICCV 2021 • Bin Fan, Yuchao Dai, Mingyi He

The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition.

Rolling Shutter Correction

Paper
Code

Pattern Ensembling for Spatial Trajectory Reconstruction

no code implementations • 25 Jan 2021 • Shivam Pathak, Mingyi He, Sergey Malinchik, Stanislav Sobolevsky

Digital sensing provides an unprecedented opportunity to assess and understand mobility.

Paper
Add Code

Class Attention Network for Semantic Segmentation of Remote Sensing Images

no code implementations • 31 Dec 2020 • Zhibo Rao, Mingyi He, Yuchao Dai

In this paper, we proposed a novel class attention module and decomposition-fusion strategy to cope with imbalanced labels.

Earth Observation Scene Parsing +2

Paper
Add Code

NLCA-Net v2 for Stereo Matching in ECCV'20 Robust Vision Challenge

no code implementations • 1 Nov 2020 • Zhibo Rao, Mingyi He, Bo Li, Renjie He

The network architecture used in this RVC, called as NLCA-Net v2, is consists of four parts: feature extraction, cost volume construction, feature matching, and refinement, as shown in Fig.

Stereo Matching

Paper
Add Code

Monocular Human Pose Estimation: A Survey of Deep Learning-based Methods

no code implementations • 2 Jun 2020 • Yu-cheng Chen, YingLi Tian, Mingyi He

Vision-based monocular human pose estimation, as one of the most fundamental and challenging problems in computer vision, aims to obtain posture of the human body from input images or video sequences.

3D Human Pose Estimation

Paper
Add Code

Self-supervised Modal and View Invariant Feature Learning

no code implementations • 28 May 2020 • Longlong Jing, Yu-cheng Chen, Ling Zhang, Mingyi He, YingLi Tian

By exploring the inherent multi-modality attributes of 3D objects, in this paper, we propose to jointly learn modal-invariant and view-invariant features from different modalities including image, point cloud, and mesh with heterogeneous networks for 3D data.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Self-supervised Feature Learning by Cross-modality and Cross-view Correspondences

no code implementations • 13 Apr 2020 • Longlong Jing, Yu-cheng Chen, Ling Zhang, Mingyi He, YingLi Tian

Specifically, 2D image features of rendered images from different views are extracted by a 2D convolutional neural network, and 3D point cloud features are extracted by a graph convolution neural network.

3D Part Segmentation 3D Shape Classification +4

Paper
Add Code

MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity Map for Stereo Matching

no code implementations • 25 Apr 2019 • Zhibo Rao, Mingyi He, Yuchao Dai, Zhidong Zhu, Bo Li, Renjie He

The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module.

Autonomous Driving object-detection +3

Paper
Add Code

Multi-scale Cross-form Pyramid Network for Stereo Matching

no code implementations • 25 Apr 2019 • Zhidong Zhu, Mingyi He, Yuchao Dai, Zhibo Rao, Bo Li

The network consists of three modules: Multi-Scale 2D local feature extraction module, Cross-form spatial pyramid module and Multi-Scale 3D Feature Matching and Fusion module.

3D Feature Matching 3D Scene Reconstruction +3

Paper
Add Code

Deep Edge-Aware Saliency Detection

no code implementations • 15 Aug 2017 • Jing Zhang, Yuchao Dai, Fatih Porikli, Mingyi He

There has been profound progress in visual saliency thanks to the deep learning architectures, however, there still exist three major challenges that hinder the detection performance for scenes with complex compositions, multiple salient objects, and salient objects of diverse scales.

Descriptive Saliency Detection

Paper
Add Code

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

1 code implementation • 2 Aug 2017 • Bo Li, Yuchao Dai, Mingyi He

Extensive experiments on the NYU Depth V2 and KITTI datasets show the superiority of our method compared with current state-of-the-art methods.

Monocular Depth Estimation Quantization +1

Paper
Code

Dense Non-rigid Structure-from-Motion Made Easy - A Spatial-Temporal Smoothness based Solution

no code implementations • 27 Jun 2017 • Yuchao Dai, Huizhong Deng, Mingyi He

Second, we propose to exploit the spatial smoothness by resorting to the Laplacian of the 3D non-rigid shape.

Paper
Add Code

Integrated Deep and Shallow Networks for Salient Object Detection

no code implementations • 2 Jun 2017 • Jing Zhang, Bo Li, Yuchao Dai, Fatih Porikli, Mingyi He

Then the results from deep FCNN and RBD are concatenated to feed into a shallow network to map the concatenated feature maps to saliency maps.

Object object-detection +3

Paper
Add Code

Single image depth estimation by dilated deep residual convolutional neural network and soft-weight-sum inference

1 code implementation • 27 Apr 2017 • Bo Li, Yuchao Dai, Huahui Chen, Mingyi He

This paper proposes a new residual convolutional neural network (CNN) architecture for single image depth estimation.

Depth Estimation

Paper
Code

Skeleton Boxes: Solving skeleton based action detection with a single deep convolutional neural network

no code implementations • 19 Apr 2017 • Bo Li, Huahui Chen, Yu-cheng Chen, Yuchao Dai, Mingyi He

However, due to the difficulty in representing the 3D skeleton video and the lack of training data, action detection from streaming 3D skeleton video still lags far behind its recognition counterpart and image based object detection.

Action Detection Action Recognition +3

Paper
Add Code

Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep cnn

no code implementations • 19 Apr 2017 • Bo Li, Mingyi He, Xuelian Cheng, Yu-cheng Chen, Yuchao Dai

Especially on the largest and challenge NTU RGB+D, UTD-MHAD, and MSRC-12 dataset, our method outperforms other methods by a large margion, which proves the efficacy of the proposed method.

Ranked #80 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Image Classification +3

Paper
Add Code

Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs

no code implementations • CVPR 2015 • Bo Li, Chunhua Shen, Yuchao Dai, Anton Van Den Hengel, Mingyi He

Predicting the depth (or surface normal) of a scene from single monocular color images is a challenging task.

regression Surface Normal Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.