1 code implementation • 7 Mar 2024 • Shijie Ma, Fei Zhu, Zhun Zhong, Xu-Yao Zhang, Cheng-Lin Liu
Generalized Category Discovery (GCD) is a pragmatic and challenging open-world task, which endeavors to cluster unlabeled samples from both novel and old classes, leveraging some labeled data of old classes.
1 code implementation • NeurIPS 2023 • Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng
Audio-Visual Source Localization (AVSL) aims to locate sounding objects within video frames given the paired audio clips.
no code implementations • 5 Mar 2024 • Yuxin Guo, Shijie Ma, Yuhao Zhao, Hu Su, Wei Zou
Audio-Visual Source Localization (AVSL) is the task of identifying specific sounding objects in the scene given audio cues.
no code implementations • 4 Mar 2024 • Fei Zhu, Shijie Ma, Zhen Cheng, Xu-Yao Zhang, Zhaoxiang Zhang, Cheng-Lin Liu
This paper aims to provide a comprehensive introduction to the emerging open-world machine learning paradigm, to help researchers build more powerful AI systems in their respective fields, and to promote the development of artificial general intelligence.
no code implementations • 2 Nov 2023 • Shijie Ma, Huayi Xu, Mengjian Li, Weidong Geng, Meng Wang, Yaxiong Wang
This paper targets to enhance the diffusion-based text-to-video generation by improving the two input prompts, including the noise and the text.
1 code implementation • 18 Jul 2023 • Shijie Ma, Fei Zhu, Zhen Cheng, Xu-Yao Zhang
By distilling both InD samples and outliers, the condensed datasets are capable to train models competent in both InD classification and OOD detection.
1 code implementation • 2 Mar 2022 • Yihan Lin, Yifan Hu, Shijie Ma, Guoqi Li, Dongjie Yu
In this work, a new SNN training paradigm is proposed by combining the concepts of the two different training methods with the help of the pretrain technique and BP-based deep SNN training mechanism.