1 code implementation • 14 Apr 2024 • Jin Yang, Ping Wei, Huan Li, Ziyang Ren
Video moment retrieval and highlight detection are two highly valuable tasks in video understanding, but until recently they have been jointly studied.
1 code implementation • 24 Mar 2024 • Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen
DETR-like methods have significantly increased detection performance in an end-to-end manner.
Ranked #2 on Object Detection on COCO 2017 val
no code implementations • 4 Aug 2023 • Jinyu Long, Jetic Gū, Binhao Bai, Zhibo Yang, Ping Wei, Junli Li
Speech enhancement is a demanding task in automated speech processing pipelines, focusing on separating clean speech from noisy channels.
no code implementations • 10 May 2023 • Ping Wei, Ge Luo, Qi Song, Xinpeng Zhang, Zhenxing Qian, Sheng Li
In the forward mapping, secret data is hidden in the input latent of Glow model to generate stego images.
no code implementations • 5 May 2023 • Ping Wei, Qing Zhou, Zichi Wang, Zhenxing Qian, Xinpeng Zhang, Sheng Li
However, existing GAN-based GS methods cannot completely recover the hidden secret data due to the lack of network invertibility, while Flow-based methods produce poor image quality due to the stringent reversibility restriction in each module.
1 code implementation • ICCV 2023 • Jiapeng Li, Ping Wei, Wenjuan Han, Lifeng Fan
In this paper, we propose a novel task IntentQA, a special VideoQA task focusing on video intent reasoning, which has become increasingly important for AI with its advantages in equipping AI agents with the capability of reasoning beyond mere recognition in daily tasks.
no code implementations • ICCV 2023 • Huan Li, Ping Wei, Zeyu Ma, Nanning Zheng
In this study, we introduce a novel approach called inverse compositional learning (ICL) for weakly-supervised video relation grounding.
no code implementations • 28 Jul 2022 • Ping Wei, Sheng Li, Xinpeng Zhang, Ge Luo, Zhenxing Qian, Qing Zhou
A new steganographic approach called generative steganography (GS) has emerged recently, in which stego images (images containing secret data) are generated from secret data directly without cover media.
no code implementations • 23 Sep 2021 • Gaiyou Li, Ping Wei, Giorgio Battistelli, Luigi Chisci, Lin Gao
This paper focuses on \textit{joint detection, tracking and classification} (JDTC) of a target via multi-sensor fusion.
1 code implementation • CVPR 2019 • Ping Wei, Huan Li, Ping Hu
We test our method on the Chinese signature dataset and other three signature datasets of different languages: CEDAR, BHSig-B, and BHSig-H.
no code implementations • CVPR 2018 • Ping Wei, Yang Liu, Tianmin Shu, Nanning Zheng, Song-Chun Zhu
We built a new video dataset of tasks, intentions, and attention.
no code implementations • CVPR 2018 • Lifeng Fan, Yixin Chen, Ping Wei, Wenguan Wang, Song-Chun Zhu
We collect a new dataset VideoCoAtt from public TV show videos, containing 380 complex video sequences with more than 492, 000 frames that include diverse social scenes for shared attention study.
no code implementations • ICCV 2017 • Bruce Xiaohan Nie, Ping Wei, Song-Chun Zhu
This paper aims at estimating full-body 3D human poses from monocular images of which the biggest challenge is the inherent ambiguity introduced by lifting the 2D pose into 3D space.
Ranked #113 on 3D Human Pose Estimation on Human3.6M (PA-MPJPE metric)
no code implementations • ICCV 2017 • Yang Liu, Ping Wei, Song-Chun Zhu
Given an egocentric video, a beam search algorithm is applied to jointly recognizing the object fluents in each frame, and the task of the entire video.
no code implementations • ICCV 2017 • Siyuan Qi, Siyuan Huang, Ping Wei, Song-Chun Zhu
This paper presents a novel method to predict future human activities from partially observed RGB-D videos.