Search Results for author: Simon See

We innovatively utilize Gabor filters as a powerful extractor to exploit texture features, motivated by the capability of Gabor filters in effectively capturing multi-frequency features and detailed local information.

Paper
Add Code

CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields

1 code implementation • ICCV 2023 • Ziyuan Luo, Qing Guo, Ka Chun Cheung, Simon See, Renjie Wan

Neural Radiance Fields (NeRF) have the potential to be a major representation of media.

Paper
Code

Towards Balanced Active Learning for Multimodal Classification

1 code implementation • 14 Jun 2023 • Meng Shen, Yizheng Huang, Jianxiong Yin, Heqing Zou, Deepu Rajan, Simon See

Our studies demonstrate that the proposed method achieves more balanced multimodal learning by avoiding greedy sample selection from the dominant modality.

Active Learning Classification +1

Paper
Code

TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses

1 code implementation • ICCV 2023 • Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li

3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots.

3D Multi-Object Tracking 3D Object Tracking +2

103

Paper
Code

A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning

1 code implementation • 4 Jun 2023 • Wei-Fang Sun, Cheng-Kuang Lee, Simon See, Chun-Yi Lee

In fully cooperative multi-agent reinforcement learning (MARL) settings, environments are highly stochastic due to the partial observability of each agent and the continuously changing policies of other agents.

Ranked #1 on SMAC on SMAC 26m_vs_30m

reinforcement-learning SMAC +1

Paper
Code

COLA: Contextualized Commonsense Causal Reasoning from the Causal Inference Perspective

1 code implementation • 9 May 2023 • Zhaowei Wang, Quyet V. Do, Hongming Zhang, Jiayao Zhang, Weiqi Wang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See

This paper proposes a new task to detect commonsense causation between two events in an event sequence (i. e., context), called contextualized commonsense causal reasoning.

Causal Inference CoLA +1

Paper
Code

DiscoPrompt: Path Prediction Prompt Tuning for Implicit Discourse Relation Recognition

1 code implementation • 6 May 2023 • Chunkit Chan, Xin Liu, Jiayang Cheng, Zihan Li, Yangqiu Song, Ginny Y. Wong, Simon See

Implicit Discourse Relation Recognition (IDRR) is a sophisticated and challenging task to recognize the discourse relations between the arguments with the absence of discourse connectives.

Relation text-classification +1

Paper
Code

Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport

1 code implementation • 6 May 2023 • ZiHao Wang, Weizhi Fei, Hang Yin, Yangqiu Song, Ginny Y. Wong, Simon See

In contrast to existing scoring functions motivated by local comparison or global transport, this work investigates the local and global trade-off with unbalanced optimal transport theory.

Knowledge Graphs Logical Reasoning

Paper
Code

Continual Semantic Segmentation with Automatic Memory Sample Selection

no code implementations • CVPR 2023 • Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu

Continual Semantic Segmentation (CSS) extends static semantic segmentation by incrementally introducing new classes for training.

Continual Semantic Segmentation Decision Making +1

Paper
Add Code

VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

1 code implementation • ICCV 2023 • Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directional optical flows for the center frame in a three-frame manner.

Optical Flow Estimation

217

Paper
Code

FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation

1 code implementation • CVPR 2023 • Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance.

Optical Flow Estimation

100

Paper
Code

Logical Message Passing Networks with One-hop Inference on Atomic Formulas

1 code implementation • 21 Jan 2023 • ZiHao Wang, Yangqiu Song, Ginny Y. Wong, Simon See

On top of the query graph, we propose the Logical Message Passing Neural Network (LMPNN) that connects the local one-hop inferences on atomic formulas to the global logical reasoning for complex query answering.

Complex Query Answering Graph Representation Learning +1

Paper
Code

MIGPerf: A Comprehensive Benchmark for Deep Learning Training and Inference Workloads on Multi-Instance GPUs

1 code implementation • 1 Jan 2023 • Huaizheng Zhang, Yuanming Li, Wencong Xiao, Yizheng Huang, Xing Di, Jianxiong Yin, Simon See, Yong Luo, Chiew Tong Lau, Yang You

The vision of this paper is to provide a more comprehensive and practical benchmark study for MIG in order to eliminate the need for tedious manual benchmarking and tuning efforts.

Benchmarking

Paper
Code

Inductive Attention for Video Action Anticipation

no code implementations • 17 Dec 2022 • Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz

Consequently, existing solutions based on the action recognition models are only suboptimal.

Action Anticipation Action Recognition +1

Paper
Add Code

NAS-LID: Efficient Neural Architecture Search with Local Intrinsic Dimension

1 code implementation • 23 Nov 2022 • Xin He, Jiangchao Yao, Yuxin Wang, Zhenheng Tang, Ka Chu Cheung, Simon See, Bo Han, Xiaowen Chu

One-shot neural architecture search (NAS) substantially improves the search efficiency by training one supernet to estimate the performance of every possible child architecture (i. e., subnet).

Neural Architecture Search

Paper
Code

Complex Hyperbolic Knowledge Graph Embeddings with Fast Fourier Transform

1 code implementation • 7 Nov 2022 • Huiru Xiao, Xin Liu, Yangqiu Song, Ginny Y. Wong, Simon See

However, the performance of the hyperbolic KG embedding models for non-transitive relations is still unpromising, while the complex hyperbolic embeddings do not deal with multi-relations.

Knowledge Graph Embeddings

Paper
Code

CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation

1 code implementation • 2 Nov 2022 • Jun Wang, Abhir Bhalerao, Terry Yin, Simon See, Yulan He

Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process of disease decision making by radiologists.

Decision Making

Paper
Code

PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Population

1 code implementation • 14 Oct 2022 • Tianqing Fang, Quyet V. Do, Hongming Zhang, Yangqiu Song, Ginny Y. Wong, Simon See

We propose PseudoReasoner, a semi-supervised learning framework for CSKB population that uses a teacher model pre-trained on CSKBs to provide pseudo labels on the unlabeled candidate dataset for a student model to learn from.

Domain Generalization Knowledge Base Population

Paper
Code

SubeventWriter: Iterative Sub-event Sequence Generation with Coherence Controller

1 code implementation • 13 Oct 2022 • Zhaowei Wang, Hongming Zhang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See

In this paper, we propose a new task of sub-event generation for an unseen process to evaluate the understanding of the coherence of sub-event actions and objects.

Paper
Code

Attributed Abnormality Graph Embedding for Clinically Accurate X-Ray Report Generation

no code implementations • 4 Jul 2022 • Sixing Yan, William K. Cheung, Keith Chiu, Terence M. Tong, Charles K. Cheung, Simon See

In this paper, we introduce a novel fined-grained knowledge graph structure called an attributed abnormality graph (ATAG).

Attribute Decoder +2

Paper
Add Code

NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022

no code implementations • 22 Jun 2022 • Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon See

In this report, we describe the technical details of our submission for the EPIC-Kitchen-100 action anticipation challenge.

Action Anticipation

Paper
Add Code

A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift

1 code implementation • CVPR 2023 • Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li

In this study, we propose a simple yet effective framework for video restoration.

Ranked #1 on Deblurring on GoPro (using extra training data)

Deblurring Denoising +3

Paper
Code

Unified Recurrence Modeling for Video Action Anticipation

1 code implementation • 2 Jun 2022 • Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz

To this end, we propose a unified recurrence modeling for video action anticipation via message passing framework.

Action Anticipation Decision Making

Paper
Code

Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning

no code implementations • 18 Oct 2021 • Yi-Chen Chen, Shu-wen Yang, Cheng-Kuang Lee, Simon See, Hung-Yi Lee

It has been shown that an SSL pretraining model can achieve excellent performance in various downstream tasks of speech processing.

Multi-Task Learning Representation Learning +1

Paper
Add Code

Self-Supervised Video Representation Learning by Video Incoherence Detection

no code implementations • 26 Sep 2021 • Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Lihua Xie, Jianxiong Yin, Simon See

This paper introduces a novel self-supervised method that leverages incoherence detection for video representation learning.

Action Recognition Contrastive Learning +3

Paper
Add Code

Aligning Correlation Information for Domain Adaptation in Action Recognition

no code implementations • 11 Jul 2021 • Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See

Yet correlation features of the same action would differ across domains due to domain shift.

Action Recognition Domain Adaptation +1

Paper
Add Code

Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis

no code implementations • 6 Jun 2021 • Hyun Gon Ryu, Jeong-Hoon Kim, Simon See

The proposed method is expected to adapt for researching on neural network models capable of synthesizing speech at the studio recording level.

Speech Synthesis

Paper
Add Code

Effective Action Recognition with Embedded Key Point Shifts

no code implementations • 26 Aug 2020 • Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See

Temporal feature extraction is an essential technique in video-based action recognition.

Action Recognition Skeleton Based Action Recognition

Paper
Add Code

PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition

no code implementations • 9 Jun 2020 • Yuecong Xu, Haozhi Cao, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See

Empirical results prove the effectiveness and efficiency of our PNL module, which achieves state-of-the-art performance of 83. 09% on the Mini-Kinetics dataset, with decreased computation cost compared to the non-local block.

Action Recognition

Paper
Add Code

ARID: A New Dataset for Recognizing Action in the Dark

1 code implementation • 6 Jun 2020 • Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See

We bridge the gap of the lack of data for this task by collecting a new dataset: the Action Recognition in the Dark (ARID) dataset.

Action Recognition

Paper
Code

Exploiting Inter-Frame Regional Correlation for Efficient Action Recognition

no code implementations • 6 May 2020 • Yuecong Xu, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See

Temporal feature extraction is an important issue in video-based action recognition.

Action Recognition Optical Flow Estimation

Paper
Add Code

Dependently Typed Knowledge Graphs

no code implementations • 8 Mar 2020 • Zhangsheng Lai, Aik Beng Ng, Liang Ze Wong, Simon See, Shaowei Lin

Reasoning over knowledge graphs is traditionally built upon a hierarchy of languages in the Semantic Web Stack.

Knowledge Graphs

Paper
Add Code

Understanding Top-k Sparsification in Distributed Deep Learning

1 code implementation • 20 Nov 2019 • Shaohuai Shi, Xiaowen Chu, Ka Chun Cheung, Simon See

Distributed stochastic gradient descent (SGD) algorithms are widely deployed in training large-scale deep learning models, while the communication overhead among workers becomes the new system bottleneck.

Paper
Code

Improving Deep Lesion Detection Using 3D Contextual and Spatial Attention

1 code implementation • 9 Jul 2019 • Qingyi Tao, ZongYuan Ge, Jianfei Cai, Jianxiong Yin, Simon See

Secondly, in CT scans, the lesions are often indistinguishable from the background since the lesion and non-lesion areas may have very similar appearances.

Computed Tomography (CT) Lesion Detection +2

Paper
Code

Secure Deep Learning Engineering: A Software Quality Assurance Perspective

no code implementations • 10 Oct 2018 • Lei Ma, Felix Juefei-Xu, Minhui Xue, Qiang Hu, Sen Chen, Bo Li, Yang Liu, Jianjun Zhao, Jianxiong Yin, Simon See

Over the past decades, deep learning (DL) systems have achieved tremendous success and gained great popularity in various applications, such as intelligent machines, image processing, speech processing, and medical diagnostics.

Paper
Add Code

DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing

no code implementations • 4 Sep 2018 • Xiaofei Xie, Lei Ma, Felix Juefei-Xu, Hongxu Chen, Minhui Xue, Bo Li, Yang Liu, Jianjun Zhao, Jianxiong Yin, Simon See

In company with the data explosion over the past decade, deep neural network (DNN) based software has experienced unprecedented leap and is becoming the key driving force of many novel industrial applications, including many safety-critical scenarios such as autonomous driving.

Autonomous Driving Quantization