Search Results for author: Lihe Zhang

Found 38 papers, 28 papers with code

Spatial Semantic Recurrent Mining for Referring Image Segmentation

no code implementations • 15 May 2024 • Jiaxing Yang, Lihe Zhang, Jiayu Sun, Huchuan Lu

In this paper, we propose Spatial Semantic Recurrent Mining (S\textsuperscript{2}RM) to achieve high-quality cross-modality fusion.

Image Segmentation Semantic Segmentation

Paper
Add Code

Spider: A Unified Framework for Context-dependent Concept Segmentation

1 code implementation • 2 May 2024 • Xiaoqi Zhao, Youwei Pang, Wei Ji, Baicheng Sheng, Jiaming Zuo, Lihe Zhang, Huchuan Lu

Different from the context-independent (CI) concepts such as human, car, and airplane, context-dependent (CD) concepts require higher visual understanding ability, such as camouflaged object and medical lesion.

Transparent objects

Paper
Code

Multi-view Aggregation Network for Dichotomous Image Segmentation

2 code implementations • 11 Apr 2024 • Qian Yu, Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu

Dichotomous Image Segmentation (DIS) has recently emerged towards high-precision object segmentation from high-resolution natural images.

Ranked #1 on Dichotomous Image Segmentation on DIS-VD

Decoder Dichotomous Image Segmentation +2

271

Paper
Code

Catastrophic Overfitting: A Potential Blessing in Disguise

no code implementations • 28 Feb 2024 • Mengnan Zhao, Lihe Zhang, Yuqiu Kong, BaoCai Yin

To tackle this issue, we initially employ the feature activation differences between clean and adversarial examples to analyze the underlying causes of CO. Intriguingly, our findings reveal that CO can be attributed to the feature coverage induced by a few specific pathways.

Adversarial Robustness

Paper
Add Code

Separable Multi-Concept Erasure from Diffusion Models

no code implementations • 3 Feb 2024 • Mengnan Zhao, Lihe Zhang, Tianhang Zheng, Yuqiu Kong, BaoCai Yin

Large-scale diffusion models, known for their impressive image generation capabilities, have raised concerns among researchers regarding social impacts, such as the imitation of copyrighted artistic styles.

Image Generation Machine Unlearning

Paper
Add Code

EipFormer: Emphasizing Instance Positions in 3D Instance Segmentation

no code implementations • 9 Dec 2023 • Mengnan Zhao, Lihe Zhang, Yuqiu Kong, BaoCai Yin

It enhances the initial instance positions through weighted farthest point sampling and further refines the instance positions and proposals using aggregation averaging and center matching.

3D Instance Segmentation Position +1

Paper
Add Code

Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baseline

1 code implementation • 5 Dec 2023 • Xiaoqi Zhao, Youwei Pang, Zhenyu Chen, Qian Yu, Lihe Zhang, Hanqi Liu, Jiaming Zuo, Huchuan Lu

We conduct a comprehensive study on a new task named power battery detection (PBD), which aims to localize the dense cathode and anode plates endpoints from X-ray images to evaluate the quality of power batteries.

Crowd Counting object-detection +2

Paper
Code

Open-Vocabulary Camouflaged Object Segmentation

no code implementations • 19 Nov 2023 • Youwei Pang, Xiaoqi Zhao, Jiaming Zuo, Lihe Zhang, Huchuan Lu

With the proposed dataset and baseline, we hope that this new task with more practical value can further expand the research on open-vocabulary dense prediction tasks.

Camouflaged Object Segmentation Image Segmentation +4

Paper
Add Code

ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection

1 code implementation • 31 Oct 2023 • Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang, Lihe Zhang, Huchuan Lu

Apart from the high intrinsic similarity between camouflaged objects and their background, objects are usually diverse in scale, fuzzy in appearance, and even severely occluded.

Ranked #1 on Camouflaged Object Segmentation on Camouflaged Animal Dataset (using extra training data)

Camouflaged Object Segmentation

Paper
Code

Referring Image Segmentation Using Text Supervision

1 code implementation • ICCV 2023 • Fang Liu, Yuhao Liu, Yuqiu Kong, Ke Xu, Lihe Zhang, BaoCai Yin, Gerhard Hancke, Rynson Lau

Hence, we propose a novel weakly-supervised RIS framework to formulate the target localization problem as a classification process to differentiate between positive and negative text expressions.

Image Segmentation Object Localization +4

Paper
Code

Fast Adversarial Training with Smooth Convergence

1 code implementation • ICCV 2023 • Mengnan Zhao, Lihe Zhang, Yuqiu Kong, BaoCai Yin

To address this, we analyze the training process of prior FAT work and observe that catastrophic overfitting is accompanied by the appearance of loss convergence outliers.

Adversarial Robustness

Paper
Code

ComPtr: Towards Diverse Bi-source Dense Prediction Tasks via A Simple yet General Complementary Transformer

1 code implementation • 23 Jul 2023 • Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Specifically, unlike existing methods that over-specialize in a single task or a subset of tasks, ComPtr starts from the more general concept of bi-source dense prediction.

Ranked #14 on Semantic Segmentation on NYU Depth v2

Change Detection Crowd Counting +4

Paper
Code

3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW

1 code implementation • 4 Jun 2023 • Shijie Chang, Zeqi Hao, Ben Kang, Xiaoqi Zhao, Jiawen Zhu, Zhenyu Chen, Lihe Zhang, Lu Zhang, Huchuan Lu

In this paper, we introduce 3rd place solution for PVUW2023 VSS track.

Position Segmentation +2

Paper
Code

M$^{2}$SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image Segmentation

2 code implementations • 20 Mar 2023 • Xiaoqi Zhao, Hongpeng Jia, Youwei Pang, Long Lv, Feng Tian, Lihe Zhang, Weibing Sun, Huchuan Lu

Next, we expand the single-scale SU to the intra-layer multi-scale SU, which can provide the decoder with both pixel-level and structure-level difference information.

Computed Tomography (CT) Decoder +4

Paper
Code

Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation

1 code implementation • 18 Mar 2023 • Xiaoqi Zhao, Shijie Chang, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu

In the static object predictor, the RGB source is converted to depth and static saliency sources, simultaneously.

Ranked #1 on Unsupervised Video Object Segmentation on YouTube-Objects

Object Optical Flow Estimation +4

Paper
Code

Towards Diverse Binary Segmentation via A Simple yet General Gated Network

2 code implementations • 18 Mar 2023 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang

They ignore two key problems when the encoder exchanges information with the decoder: one is the lack of interference control mechanism between them, the other is without considering the disparity of the contributions from different encoder levels.

Decoder Segmentation +1

159

Paper
Code

Adaptive Illumination Mapping for Shadow Detection in Raw Images

1 code implementation • ICCV 2023 • Jiayu Sun, Ke Xu, Youwei Pang, Lihe Zhang, Huchuan Lu, Gerhard Hancke, Rynson Lau

In this paper, we propose a novel method to detect shadows from raw images.

Shadow Detection

Paper
Code

Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation

no code implementations • 30 Mar 2022 • Guang Feng, Lihe Zhang, Zhiwei Hu, Huchuan Lu

To address this task, we first design a two-stream encoder to extract CNN-based visual features and transformer-based linguistic features hierarchically, and a vision-language mutual guidance (VLMG) module is inserted into the encoder multiple times to promote the hierarchical and progressive fusion of multi-modal features.

Ranked #3 on Referring Expression Segmentation on J-HMDB

Referring Expression Segmentation Video Segmentation +2

Paper
Add Code

Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction

1 code implementation • 9 Mar 2022 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu

In this paper, we propose a novel multi-task and multi-modal filtered transformer (MMFT) network for RGB-D salient object detection (SOD).

Depth Estimation object-detection +2

Paper
Code

Lane Detection with Versatile AtrousFormer and Local Semantic Guidance

no code implementations • 8 Mar 2022 • Jiaxing Yang, Lihe Zhang, Huchuan Lu

In this work, we propose Atrous Transformer (AtrousFormer) to solve the problem.

Ranked #25 on Lane Detection on CULane

Autonomous Driving Decoder +1

Paper
Add Code

CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection

1 code implementation • 4 Dec 2021 • Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Most of the existing bi-modal (RGB-D and RGB-T) salient object detection methods utilize the convolution operation and construct complex interweave fusion structures to achieve cross-modal information integration.

Decoder object-detection +2

Paper
Code

Temporal Knowledge Graph Reasoning Triggered by Memories

1 code implementation • 17 Oct 2021 • Mengnan Zhao, Lihe Zhang, Yuqiu Kong, BaoCai Yin

Specifically, the transient learning network considers transient memories as a static knowledge graph, and the time-aware recurrent evolution network learns representations through a sequence of recurrent evolution units from long-short-term memories.

Attribute Decision Making +2

Paper
Code

MODNet-V: Improving Portrait Video Matting via Background Restoration

1 code implementation • 24 Sep 2021 • Jiayu Sun, Zhanghan Ke, Lihe Zhang, Huchuan Lu, Rynson W. H. Lau

In this work, we observe that instead of asking the user to explicitly provide a background image, we may recover it from the input video itself.

Image Matting Video Matting

Paper
Code

Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation

1 code implementation • 11 Aug 2021 • Xiaoqi Zhao, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu

In this paper, we propose a novel multi-source fusion network for zero-shot video object segmentation.

Ranked #1 on Video Object Segmentation on FBMS (Jaccard (Mean) metric)

Depth Estimation Object +3

Paper
Code

Automatic Polyp Segmentation via Multi-scale Subtraction Network

2 code implementations • 11 Aug 2021 • Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

\keywords{Colorectal Cancer \and Automatic Polyp Segmentation \and Subtraction \and LossNet.}

Segmentation

Paper
Code

Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation

no code implementations • CVPR 2021 • Guang Feng, Zhiwei Hu, Lihe Zhang, Huchuan Lu

In this work, we propose an encoder fusion network (EFN), which transforms the visual encoder into a multi-modal feature learning network, and uses language to refine the multi-modal features progressively.

Image Segmentation Semantic Segmentation

Paper
Add Code

Self-Supervised Pretraining for RGB-D Salient Object Detection

1 code implementation • 29 Jan 2021 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Xiang Ruan

Existing CNNs-Based RGB-D salient object detection (SOD) networks are all required to be pretrained on the ImageNet to learn the hierarchy features which helps provide a good initialization.

Object object-detection +3

Paper
Code

Multi-scale Interactive Network for Salient Object Detection

1 code implementation • CVPR 2020 • Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

To obtain more efficient multi-scale features from the integrated features, the self-interaction modules are embedded in each decoder unit.

Decoder Object +3

237

Paper
Code

Suppress and Balance: A Simple Gated Network for Salient Object Detection

3 code implementations • ECCV 2020 • Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang

With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder.

Ranked #15 on Dichotomous Image Segmentation on DIS-TE4

Decoder Dichotomous Image Segmentation +2

159

Paper
Code

A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection

1 code implementation • ECCV 2020 • Xiaoqi Zhao, Lihe Zhang, Youwei Pang, Huchuan Lu, Lei Zhang

In this work, we design a single stream network to directly use the depth map to guide early fusion and middle fusion between RGB and depth, which saves the feature encoder of the depth stream and achieves a lightweight and real-time model.

Ranked #15 on Thermal Image Segmentation on RGB-T-Glass-Segmentation

Decoder object-detection +4

Paper
Code

Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection

1 code implementation • ECCV 2020 • Youwei Pang, Lihe Zhang, Xiaoqi Zhao, Huchuan Lu

The main purpose of RGB-D salient object detection (SOD) is how to better integrate and utilize cross-modal fusion information.

Ranked #5 on RGB-D Salient Object Detection on NJU2K

object-detection RGB-D Salient Object Detection +3

Paper
Code

TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

1 code implementation • 24 Dec 2019 • Yanxing Wang, Jianxing Hu, Junyong Lai, Yibo Li, Hongwei Jin, Lihe Zhang, Liangren Zhang, Zhenming Liu

Molecular fingerprints are the workhorse in ligand-based drug discovery.

Drug Discovery

Paper
Code

Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation

1 code implementation • ICCV 2019 • Yu Zeng, Yunzhi Zhuge, Huchuan Lu, Lihe Zhang

SSNet consists of a segmentation network (SN) and a saliency aggregation module (SAM).

Multi-Task Learning Saliency Detection +4

Paper
Code

Multi-source weak supervision for saliency detection

1 code implementation • CVPR 2019 • Yu Zeng, Yunzhi Zhuge, Huchuan Lu, Lihe Zhang, Mingyang Qian, Yizhou Yu

To this end, we propose a unified framework to train saliency detection models with diverse weak supervision sources.

Caption Generation Saliency Prediction

Paper
Code

Learning to Promote Saliency Detectors

1 code implementation • CVPR 2018 • Yu Zeng, Huchuan Lu, Lihe Zhang, Mengyang Feng, Ali Borji

The categories and appearance of salient objects vary from image to image, therefore, saliency detection is an image-specific task.

Saliency Detection Small Data Image Classification +1

Paper
Code

Detect Globally, Refine Locally: A Novel Approach to Saliency Detection

no code implementations • CVPR 2018 • Tiantian Wang, Lihe Zhang, Shuo Wang, Huchuan Lu, Gang Yang, Xiang Ruan, Ali Borji

Moreover, to effectively recover object boundaries, we propose a local Boundary Refinement Network (BRN) to adaptively learn the local contextual information for each spatial position.

Ranked #13 on RGB Salient Object Detection on DUTS-TE

object-detection RGB Salient Object Detection +2

Paper
Add Code

A Stagewise Refinement Model for Detecting Salient Objects in Images

1 code implementation • ICCV 2017 • Tiantian Wang, Ali Borji, Lihe Zhang, Pingping Zhang, Huchuan Lu

To remedy this problem, here we propose to augment feedforward neural networks with a novel pyramid pooling module and a multi-stage refinement mechanism for saliency detection.

Ranked #14 on RGB Salient Object Detection on DUTS-TE (max F-measure metric)

object-detection RGB Salient Object Detection +2

Paper
Code

Saliency Detection via Graph-Based Manifold Ranking

no code implementations • CVPR 2013 • Chuan Yang, Lihe Zhang, Huchuan Lu, Xiang Ruan, Ming-Hsuan Yang

The saliency of the image elements is defined based on their relevances to the given seeds or queries.

Saliency Detection Superpixels

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.