Search Results for author: Ping Wei

Found 15 papers, 4 papers with code

Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

1 code implementation • 14 Apr 2024 • Jin Yang, Ping Wei, Huan Li, Ziyang Ren

Video moment retrieval and highlight detection are two highly valuable tasks in video understanding, but until recently they have been jointly studied.

Highlight Detection Moment Retrieval +2

Paper
Code

Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

1 code implementation • 24 Mar 2024 • Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen

DETR-like methods have significantly increased detection performance in an end-to-end manner.

Ranked #2 on Object Detection on COCO 2017 val

Computational Efficiency Dense Object Detection

Paper
Code

Efficient Monaural Speech Enhancement using Spectrum Attention Fusion

no code implementations • 4 Aug 2023 • Jinyu Long, Jetic Gū, Binhao Bai, Zhibo Yang, Ping Wei, Junli Li

Speech enhancement is a demanding task in automated speech processing pipelines, focusing on separating clean speech from noisy channels.

Speech Enhancement

Paper
Add Code

Generative Steganographic Flow

no code implementations • 10 May 2023 • Ping Wei, Ge Luo, Qi Song, Xinpeng Zhang, Zhenxing Qian, Sheng Li

In the forward mapping, secret data is hidden in the input latent of Glow model to generate stego images.

Image Generation

Paper
Add Code

Generative Steganography Diffusion

no code implementations • 5 May 2023 • Ping Wei, Qing Zhou, Zichi Wang, Zhenxing Qian, Xinpeng Zhang, Sheng Li

However, existing GAN-based GS methods cannot completely recover the hidden secret data due to the lack of network invertibility, while Flow-based methods produce poor image quality due to the stringent reversibility restriction in each module.

Image Generation

Paper
Add Code

IntentQA: Context-aware Video Intent Reasoning

1 code implementation • ICCV 2023 • Jiapeng Li, Ping Wei, Wenjuan Han, Lifeng Fan

In this paper, we propose a novel task IntentQA, a special VideoQA task focusing on video intent reasoning, which has become increasingly important for AI with its advantages in equipping AI agents with the capability of reasoning beyond mere recognition in daily tasks.

Contrastive Learning

Paper
Code

Inverse Compositional Learning for Weakly-supervised Relation Grounding

no code implementations • ICCV 2023 • Huan Li, Ping Wei, Zeyu Ma, Nanning Zheng

In this study, we introduce a novel approach called inverse compositional learning (ICL) for weakly-supervised video relation grounding.

Relation Video Understanding

Paper
Add Code

Generative Steganography Network

no code implementations • 28 Jul 2022 • Ping Wei, Sheng Li, Xinpeng Zhang, Ge Luo, Zhenxing Qian, Qing Zhou

A new steganographic approach called generative steganography (GS) has emerged recently, in which stego images (images containing secret data) are generated from secret data directly without cover media.

Image Generation Steganalysis

Paper
Add Code

Multi-sensor joint target detection, tracking and classification via Bernoulli filter

no code implementations • 23 Sep 2021 • Gaiyou Li, Ping Wei, Giorgio Battistelli, Luigi Chisci, Lin Gao

This paper focuses on \textit{joint detection, tracking and classification} (JDTC) of a target via multi-sensor fusion.

Sensor Fusion

Paper
Add Code

Inverse Discriminative Networks for Handwritten Signature Verification

1 code implementation • CVPR 2019 • Ping Wei, Huan Li, Ping Hu

We test our method on the Chinese signature dataset and other three signature datasets of different languages: CEDAR, BHSig-B, and BHSig-H.

Paper
Code

Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks

no code implementations • CVPR 2018 • Ping Wei, Yang Liu, Tianmin Shu, Nanning Zheng, Song-Chun Zhu

We built a new video dataset of tasks, intentions, and attention.

Paper
Add Code

Inferring Shared Attention in Social Scene Videos

no code implementations • CVPR 2018 • Lifeng Fan, Yixin Chen, Ping Wei, Wenguan Wang, Song-Chun Zhu

We collect a new dataset VideoCoAtt from public TV show videos, containing 380 complex video sequences with more than 492, 000 frames that include diverse social scenes for shared attention study.

Scene Understanding

Paper
Add Code

Monocular 3D Human Pose Estimation by Predicting Depth on Joints

no code implementations • ICCV 2017 • Bruce Xiaohan Nie, Ping Wei, Song-Chun Zhu

This paper aims at estimating full-body 3D human poses from monocular images of which the biggest challenge is the inherent ambiguity introduced by lifting the 2D pose into 3D space.

Ranked #113 on 3D Human Pose Estimation on Human3.6M (PA-MPJPE metric)

Depth Estimation Depth Prediction +1

Paper
Add Code

Jointly Recognizing Object Fluents and Tasks in Egocentric Videos

no code implementations • ICCV 2017 • Yang Liu, Ping Wei, Song-Chun Zhu

Given an egocentric video, a beam search algorithm is applied to jointly recognizing the object fluents in each frame, and the task of the entire video.

Object

Paper
Add Code

Predicting Human Activities Using Stochastic Grammar

no code implementations • ICCV 2017 • Siyuan Qi, Siyuan Huang, Ping Wei, Song-Chun Zhu

This paper presents a novel method to predict future human activities from partially observed RGB-D videos.

Activity Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.