Search Results for author: Joo Hwee Lim

Found 17 papers, 6 papers with code

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

no code implementations • 17 Sep 2023 • Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Many studies focus on improving pretraining or developing new backbones in text-video retrieval.

Action Recognition Graph Generation +4

Paper
Add Code

An Overview of Challenges in Egocentric Text-Video Retrieval

no code implementations • 7 Jun 2023 • Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Text-video retrieval contains various challenges, including biases coming from diverse sources.

Retrieval Video Retrieval

Paper
Add Code

Is Bio-Inspired Learning Better than Backprop? Benchmarking Bio Learning vs. Backprop

no code implementations • 9 Dec 2022 • Manas Gupta, Sarthak Ketanbhai Modi, Hang Zhang, Joon Hei Lee, Joo Hwee Lim

Four of the five Bio-algorithms tested outperform BP by upto 5% accuracy when only 20% of the training dataset is available.

Benchmarking

Paper
Add Code

The Role of Robust Generalization in Continual Learning: Better Transfer and Less Forgetting

no code implementations • 21 Nov 2022 • Zenglin Shi, Ying Sun, Joo Hwee Lim, Mengmi Zhang

To the best of our knowledge, no existing technique can accomplish all of these objectives simultaneously.

Continual Learning Transfer Learning

Paper
Add Code

Portmanteauing Features for Scene Text Recognition

no code implementations • 9 Nov 2022 • Yew Lee Tan, Ernest Yu Kai Chew, Adams Wai-Kin Kong, Jung-jae Kim, Joo Hwee Lim

To generate the portmanteau feature, a non-linear input pipeline with a block matrix initialization is presented.

Scene Text Recognition

Paper
Add Code

Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition

no code implementations • 3 Aug 2022 • Mei Chee Leong, Haosong Zhang, Hui Li Tan, Liyuan Li, Joo Hwee Lim

Fine-grained action recognition is a challenging task in computer vision.

Attribute Fine-grained Action Recognition +1

Paper
Add Code

Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

1 code implementation • 29 Jun 2022 • Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

In this report, we present our approach for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.

Ranked #9 on Multi-Instance Retrieval on EPIC-KITCHENS-100

Multi-Instance Retrieval Retrieval +3

Paper
Code

Semantic Role Aware Correlation Transformer for Text to Video Retrieval

1 code implementation • 26 Jun 2022 • Burak Satar, Hongyuan Zhu, Xavier Bresson, Joo Hwee Lim

With the emergence of social media, voluminous video clips are uploaded every day, and retrieving the most relevant visual content with a language query becomes critical.

Ranked #13 on Video Retrieval on YouCook2

Retrieval Text to Video Retrieval +1

Paper
Code

RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval

1 code implementation • 26 Jun 2022 • Burak Satar, Hongyuan Zhu, Hanwang Zhang, Joo Hwee Lim

Most methods consider only one joint embedding space between global visual and textual features without considering the local structures of each modality.

Ranked #12 on Video Retrieval on YouCook2

Retrieval Text to Video Retrieval +1

Paper
Code

FashionSearchNet-v2: Learning Attribute Representations with Localization for Image Retrieval with Attribute Manipulation

no code implementations • 28 Nov 2021 • Kenan E. Ak, Joo Hwee Lim, Ying Sun, Jo Yew Tham, Ashraf A. Kassim

A key challenge in e-commerce is that images have multiple attributes where users would like to manipulate and it is important to estimate discriminative feature representations for each of these attributes.

Attribute Image Retrieval +1

Paper
Add Code

Joint Learning On The Hierarchy Representation for Fine-Grained Human Action Recognition

no code implementations • 12 Oct 2021 • Mei Chee Leong, Hui Li Tan, Haosong Zhang, Liyuan Li, Feng Lin, Joo Hwee Lim

Inspired by the recently proposed hierarchy representation of fine-grained actions in FineGym and SlowFast network for action recognition, we propose a novel multi-task network which exploits the FineGym hierarchy representation to achieve effective joint learning and prediction for fine-grained human action recognition.

Action Recognition Multi-Task Learning +1

Paper
Add Code

Prototype Recalls for Continual Learning

no code implementations • 25 Sep 2019 • Mengmi Zhang, Tao Wang, Joo Hwee Lim, Jiashi Feng

Without tampering with the performance on initial tasks, our method learns novel concepts given a few training examples of each class in new tasks.

Continual Learning Metric Learning +1

Paper
Add Code

Variational Prototype Replays for Continual Learning

1 code implementation • 23 May 2019 • Mengmi Zhang, Tao Wang, Joo Hwee Lim, Gabriel Kreiman, Jiashi Feng

In each classification task, our method learns a set of variational prototypes with their means and variances, where embedding of the samples from the same class can be represented in a prototypical distribution and class-representative prototypes are separated apart.

Continual Learning General Classification +2

Paper
Code

Egocentric Spatial Memory

1 code implementation • 31 Jul 2018 • Mengmi Zhang, Keng Teck Ma, Shih-Cheng Yen, Joo Hwee Lim, Qi Zhao, Jiashi Feng

Egocentric spatial memory (ESM) defines a memory system with encoding, storing, recognizing and recalling the spatial information about the environment from an egocentric perspective.

Feature Engineering

Paper
Code

Learning Attribute Representations With Localization for Flexible Fashion Search

no code implementations • CVPR 2018 • Kenan E. Ak, Ashraf A. Kassim, Joo Hwee Lim, Jo Yew Tham

In this paper, we investigate ways of conducting a detailed fashion search using query images and attributes.

Attribute

Paper
Add Code

Egocentric Spatial Memory Network

no code implementations • ICLR 2018 • Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Shih-Cheng Yen, Qi Zhao, Jiashi Feng

During the exploration, our proposed ESM network model updates belief of the global map based on local observations using a recurrent neural network.

Navigate Simultaneous Localization and Mapping

Paper
Add Code

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks

1 code implementation • CVPR 2017 • Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, Jiashi Feng

Through competition with discriminator, the generator progressively improves quality of the future frames and thus anticipates future gaze better.

Gaze Prediction

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.