1 code implementation • 16 Feb 2024 • Zhaowei Wang, Wei Fan, Qing Zong, Hongming Zhang, Sehyun Choi, Tianqing Fang, Xin Liu, Yangqiu Song, Ginny Y. Wong, Simon See
Abstraction ability is crucial in human intelligence, which can also benefit various tasks in NLP study.
no code implementations • 29 Jan 2024 • Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the reference image's pixels.
no code implementations • 26 Jan 2024 • Xingzhi Zhou, Zhiliang Tian, Ka Chun Cheung, Simon See, Nevin L. Zhang
Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference.
1 code implementation • 8 Oct 2023 • Qing Zong, Zhaowei Wang, Baixuan Xu, Tianshi Zheng, Haochen Shi, Weiqi Wang, Yangqiu Song, Ginny Y. Wong, Simon See
A main goal of Argument Mining (AM) is to analyze an author's stance.
1 code implementation • 15 Sep 2023 • Chunkit Chan, Xin Liu, Tsz Ho Chan, Jiayang Cheng, Yangqiu Song, Ginny Wong, Simon See
However, the inter-sentential coherence and the model consistency have not been well exploited in the previous works on this task.
no code implementations • ICCV 2023 • Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu
We innovatively utilize Gabor filters as a powerful extractor to exploit texture features, motivated by the capability of Gabor filters in effectively capturing multi-frequency features and detailed local information.
1 code implementation • ICCV 2023 • Ziyuan Luo, Qing Guo, Ka Chun Cheung, Simon See, Renjie Wan
Neural Radiance Fields (NeRF) have the potential to be a major representation of media.
1 code implementation • 14 Jun 2023 • Meng Shen, Yizheng Huang, Jianxiong Yin, Heqing Zou, Deepu Rajan, Simon See
Our studies demonstrate that the proposed method achieves more balanced multimodal learning by avoiding greedy sample selection from the dominant modality.
1 code implementation • ICCV 2023 • Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li
3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots.
1 code implementation • 4 Jun 2023 • Wei-Fang Sun, Cheng-Kuang Lee, Simon See, Chun-Yi Lee
In fully cooperative multi-agent reinforcement learning (MARL) settings, environments are highly stochastic due to the partial observability of each agent and the continuously changing policies of other agents.
Ranked #1 on SMAC on SMAC 26m_vs_30m
1 code implementation • 9 May 2023 • Zhaowei Wang, Quyet V. Do, Hongming Zhang, Jiayao Zhang, Weiqi Wang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See
This paper proposes a new task to detect commonsense causation between two events in an event sequence (i. e., context), called contextualized commonsense causal reasoning.
1 code implementation • 6 May 2023 • Chunkit Chan, Xin Liu, Jiayang Cheng, Zihan Li, Yangqiu Song, Ginny Y. Wong, Simon See
Implicit Discourse Relation Recognition (IDRR) is a sophisticated and challenging task to recognize the discourse relations between the arguments with the absence of discourse connectives.
1 code implementation • 6 May 2023 • ZiHao Wang, Weizhi Fei, Hang Yin, Yangqiu Song, Ginny Y. Wong, Simon See
In contrast to existing scoring functions motivated by local comparison or global transport, this work investigates the local and global trade-off with unbalanced optimal transport theory.
no code implementations • CVPR 2023 • Lanyun Zhu, Tianrun Chen, Jianxiong Yin, Simon See, Jun Liu
Continual Semantic Segmentation (CSS) extends static semantic segmentation by incrementally introducing new classes for training.
1 code implementation • ICCV 2023 • Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directional optical flows for the center frame in a three-frame manner.
1 code implementation • CVPR 2023 • Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li
FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance.
1 code implementation • 21 Jan 2023 • ZiHao Wang, Yangqiu Song, Ginny Y. Wong, Simon See
On top of the query graph, we propose the Logical Message Passing Neural Network (LMPNN) that connects the local one-hop inferences on atomic formulas to the global logical reasoning for complex query answering.
1 code implementation • 1 Jan 2023 • Huaizheng Zhang, Yuanming Li, Wencong Xiao, Yizheng Huang, Xing Di, Jianxiong Yin, Simon See, Yong Luo, Chiew Tong Lau, Yang You
The vision of this paper is to provide a more comprehensive and practical benchmark study for MIG in order to eliminate the need for tedious manual benchmarking and tuning efforts.
no code implementations • 17 Dec 2022 • Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz
Consequently, existing solutions based on the action recognition models are only suboptimal.
1 code implementation • 23 Nov 2022 • Xin He, Jiangchao Yao, Yuxin Wang, Zhenheng Tang, Ka Chu Cheung, Simon See, Bo Han, Xiaowen Chu
One-shot neural architecture search (NAS) substantially improves the search efficiency by training one supernet to estimate the performance of every possible child architecture (i. e., subnet).
1 code implementation • 7 Nov 2022 • Huiru Xiao, Xin Liu, Yangqiu Song, Ginny Y. Wong, Simon See
However, the performance of the hyperbolic KG embedding models for non-transitive relations is still unpromising, while the complex hyperbolic embeddings do not deal with multi-relations.
1 code implementation • 2 Nov 2022 • Jun Wang, Abhir Bhalerao, Terry Yin, Simon See, Yulan He
Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process of disease decision making by radiologists.
1 code implementation • 14 Oct 2022 • Tianqing Fang, Quyet V. Do, Hongming Zhang, Yangqiu Song, Ginny Y. Wong, Simon See
We propose PseudoReasoner, a semi-supervised learning framework for CSKB population that uses a teacher model pre-trained on CSKBs to provide pseudo labels on the unlabeled candidate dataset for a student model to learn from.
1 code implementation • 13 Oct 2022 • Zhaowei Wang, Hongming Zhang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See
In this paper, we propose a new task of sub-event generation for an unseen process to evaluate the understanding of the coherence of sub-event actions and objects.
no code implementations • 4 Jul 2022 • Sixing Yan, William K. Cheung, Keith Chiu, Terence M. Tong, Charles K. Cheung, Simon See
In this paper, we introduce a novel fined-grained knowledge graph structure called an attributed abnormality graph (ATAG).
no code implementations • 22 Jun 2022 • Tsung-Ming Tai, Oswald Lanz, Giuseppe Fiameni, Yi-Kwan Wong, Sze-Sen Poon, Cheng-Kuang Lee, Ka-Chun Cheung, Simon See
In this report, we describe the technical details of our submission for the EPIC-Kitchen-100 action anticipation challenge.
1 code implementation • CVPR 2023 • Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li
In this study, we propose a simple yet effective framework for video restoration.
Ranked #1 on Deblurring on GoPro (using extra training data)
1 code implementation • 2 Jun 2022 • Tsung-Ming Tai, Giuseppe Fiameni, Cheng-Kuang Lee, Simon See, Oswald Lanz
To this end, we propose a unified recurrence modeling for video action anticipation via message passing framework.
no code implementations • 18 Oct 2021 • Yi-Chen Chen, Shu-wen Yang, Cheng-Kuang Lee, Simon See, Hung-Yi Lee
It has been shown that an SSL pretraining model can achieve excellent performance in various downstream tasks of speech processing.
no code implementations • 26 Sep 2021 • Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Lihua Xie, Jianxiong Yin, Simon See
This paper introduces a novel self-supervised method that leverages incoherence detection for video representation learning.
no code implementations • 11 Jul 2021 • Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See
Yet correlation features of the same action would differ across domains due to domain shift.
no code implementations • 6 Jun 2021 • Hyun Gon Ryu, Jeong-Hoon Kim, Simon See
The proposed method is expected to adapt for researching on neural network models capable of synthesizing speech at the studio recording level.
no code implementations • 26 Aug 2020 • Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See
Temporal feature extraction is an essential technique in video-based action recognition.
no code implementations • 9 Jun 2020 • Yuecong Xu, Haozhi Cao, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See
Empirical results prove the effectiveness and efficiency of our PNL module, which achieves state-of-the-art performance of 83. 09% on the Mini-Kinetics dataset, with decreased computation cost compared to the non-local block.
1 code implementation • 6 Jun 2020 • Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See
We bridge the gap of the lack of data for this task by collecting a new dataset: the Action Recognition in the Dark (ARID) dataset.
no code implementations • 6 May 2020 • Yuecong Xu, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See
Temporal feature extraction is an important issue in video-based action recognition.
no code implementations • 8 Mar 2020 • Zhangsheng Lai, Aik Beng Ng, Liang Ze Wong, Simon See, Shaowei Lin
Reasoning over knowledge graphs is traditionally built upon a hierarchy of languages in the Semantic Web Stack.
1 code implementation • 20 Nov 2019 • Shaohuai Shi, Xiaowen Chu, Ka Chun Cheung, Simon See
Distributed stochastic gradient descent (SGD) algorithms are widely deployed in training large-scale deep learning models, while the communication overhead among workers becomes the new system bottleneck.
1 code implementation • 9 Jul 2019 • Qingyi Tao, ZongYuan Ge, Jianfei Cai, Jianxiong Yin, Simon See
Secondly, in CT scans, the lesions are often indistinguishable from the background since the lesion and non-lesion areas may have very similar appearances.
no code implementations • 10 Oct 2018 • Lei Ma, Felix Juefei-Xu, Minhui Xue, Qiang Hu, Sen Chen, Bo Li, Yang Liu, Jianjun Zhao, Jianxiong Yin, Simon See
Over the past decades, deep learning (DL) systems have achieved tremendous success and gained great popularity in various applications, such as intelligent machines, image processing, speech processing, and medical diagnostics.
no code implementations • 4 Sep 2018 • Xiaofei Xie, Lei Ma, Felix Juefei-Xu, Hongxu Chen, Minhui Xue, Bo Li, Yang Liu, Jianjun Zhao, Jianxiong Yin, Simon See
In company with the data explosion over the past decade, deep neural network (DNN) based software has experienced unprecedented leap and is becoming the key driving force of many novel industrial applications, including many safety-critical scenarios such as autonomous driving.
1 code implementation • CVPR 2018 • Jason Kuen, Xiangfei Kong, Zhe Lin, Gang Wang, Jianxiong Yin, Simon See, Yap-Peng Tan
We propose a novel approach for cost-adjustable inference in CNNs - Stochastic Downsampling Point (SDPoint).
1 code implementation • 4 Jun 2017 • Xinyu Fu, Eugene Ch'ng, Uwe Aickelin, Simon See
We observe that MGU achieves the optimal runtime and comparable performance against GRU and LSTM.
no code implementations • 3 Aug 2016 • Jinghua Wang, Zhenhua Wang, DaCheng Tao, Simon See, Gang Wang
In this paper, we tackle the problem of RGB-D semantic segmentation of indoor images.