no code implementations • 21 Mar 2024 • Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi
To effectively address this limitation, we instead keep the network architecture simple and use a set of data tokens that operate at different temporal resolutions in a hierarchical manner, accounting for the temporally hierarchical nature of videos.
no code implementations • 3 Sep 2023 • Son Tran, Cong Tran, Anh Tran, Cuong Pham
In this paper, we push forward the state-of-the-art performance of unsupervised MOT methods by proposing UnsMOT, a novel framework that explicitly combines the appearance and motion features of objects with geometric information to provide more accurate tracking.
no code implementations • 1 Aug 2023 • Steve J. Bickley, Ho Fai Chan, Bang Dao, Benno Torgler, Son Tran
This white paper presents our work on SurveyLM, a platform for analyzing augmented language models' (ALMs) emergent alignment behaviors through their dynamically evolving attitude and value perspectives in complex social contexts.
no code implementations • CVPR 2022 • Jiali Duan, Liqun Chen, Son Tran, Jinyu Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi
Aligning signals from different modalities is an important step in vision-language representation learning as it affects the performance of later stages such as cross-modality fusion.
1 code implementation • CVPR 2022 • Jinyu Yang, Jiali Duan, Son Tran, Yi Xu, Sampath Chanda, Liqun Chen, Belinda Zeng, Trishul Chilimbi, Junzhou Huang
Besides CMA, TCL introduces an intra-modal contrastive objective to provide complementary benefits in representation learning.
Ranked #3 on Zero-Shot Cross-Modal Retrieval on COCO 2014
no code implementations • 19 Dec 2021 • Renjie Li, Son Tran, Saurabh Garg, Katherine Lawler, Jane Alty, Quan Bai
Keypoint detection plays an important role in a wide range of applications.
no code implementations • CVPR 2021 • Jiali Duan, Yen-Liang Lin, Son Tran, Larry S. Davis, C. -C. Jay Kuo
We first train a teacher model on the labeled data and use it to generate pseudo labels for the unlabeled data.
no code implementations • 24 Sep 2020 • Xinping Liu, Zehong Cao, Son Tran
Word embeddings can reflect the semantic representations, and the embedding qualities can be comprehensively evaluated with human natural reading-related cognitive data sources.
1 code implementation • CVPR 2020 • Yen-Liang Lin, Son Tran, Larry S. Davis
We evaluate our method on the outfit compatibility, FITB and new retrieval tasks.
no code implementations • 4 Jul 2019 • Son Tran, Ming Du, Sampath Chanda, R. Manmatha, Cj Taylor
In particular, Instagram and Twitter influencers often provide images of themselves wearing different outfits and their followers are often inspired to buy similar clothes. We propose a system to automatically find the closest visually similar clothes in the online Catalog (street-to-shop searching).