1 code implementation • 12 Dec 2022 • Jiawei Mao, Honggu Zhou, Xuesong Yin, Yuanqi Chang. Binling Nie. Rui Xu
This results in ViT not performing as well as CNNs on small datasets like medicine and science.
no code implementations • 22 Nov 2022 • Honggu Zhou, Xiaogang Peng, Jiawei Mao, Zizhao Wu, Ming Zeng
To solve it, we proposed PointCMC, a novel cross-modal method to model multi-scale correspondences across modalities for self-supervised point cloud representation learning.
no code implementations • 21 May 2022 • Jiawei Mao, Xuesong Yin, Yuanqi Chang, Honggu Zhou
The MIM paradigm enables the model to learn the main object features of the image by masking the input image and predicting the masked part by the unmasked part.