no code implementations • 20 Mar 2024 • Jianhao Xie, Ruofan Liao, Ziang Zhang, Sida Yi, Yuesheng Zhu, Guibo Luo
To address these issues, we propose a segmentation model based on Prompt-Mamba, which incorporates the latest Vision-Mamba and prompt technologies.
no code implementations • 11 Mar 2024 • Jianhao Xie, Ziang Zhang, Guibo Luo, Yuesheng Zhu
Large pre-trained models with their numerous model parameters and extensive training datasets have shown excellent performance in various tasks.
1 code implementation • 8 Mar 2024 • Yuxi Liu, Guibo Luo, Yuesheng Zhu
The Segmentation Anything Model (SAM) serves as a powerful foundation model for visual segmentation and can be adapted for medical image segmentation.
1 code implementation • 4 Jan 2024 • Zhaokun Zhou, Kaiwei Che, Wei Fang, Keyu Tian, Yuesheng Zhu, Shuicheng Yan, Yonghong Tian, Li Yuan
To the best of our knowledge, this is the first time that the SNN achieves 80+% accuracy on ImageNet.
no code implementations • 14 Dec 2023 • Yuqing Wang, Zhenyu Weng, Zhaokun Zhou, Shuaijian Ji, Zhongjie Ye, Yuesheng Zhu
Over the past years, Printed Mathematical Expression Recognition (PMER) has progressed rapidly.
no code implementations • 24 May 2023 • Dongxu Yue, Qin Guo, Munan Ning, Jiaxi Cui, Yuesheng Zhu, Li Yuan
Despite the successful image reconstruction achieved by diffusion-based methods, there are still challenges in effectively manipulating fine-gained facial attributes with textual instructions. To address these issues and facilitate convenient manipulation of real facial images, we propose a novel approach that conduct text-driven image editing in the semantic latent space of diffusion model.
1 code implementation • CVPR 2023 • Yuqing Wang, Yizhi Wang, Longhui Yu, Yuesheng Zhu, Zhouhui Lian
First, we adopt Transformers instead of RNNs to process sequential data and design a relaxation representation for vector outlines, markedly improving the model's capability and stability of synthesizing long and complex outlines.
no code implementations • 20 Nov 2022 • Jie Ruan, Yue Wu, Xiaojun Wan, Yuesheng Zhu
Sarcasm generation has been investigated in previous studies by considering it as a text-to-text generation problem, i. e., generating a sarcastic sentence for an input sentence.
2 code implementations • 29 Sep 2022 • Zhaokun Zhou, Yuesheng Zhu, Chao He, YaoWei Wang, Shuicheng Yan, Yonghong Tian, Li Yuan
Spikformer (66. 3M parameters) with comparable size to SEW-ResNet-152 (60. 2M, 69. 26%) can achieve 74. 81% top1 accuracy on ImageNet using 4 time steps, which is the state-of-the-art in directly trained SNNs models.
no code implementations • 3 Apr 2022 • Zhilin Huang, Chujun Qin, Zhenyu Weng, Yuesheng Zhu
Recent attention-based image inpainting methods have made inspiring progress by modeling long-range dependencies within a single image.
no code implementations • 23 Feb 2022 • Longhui Yu, Zhenyu Weng, Yuqing Wang, Yuesheng Zhu
However, distilling knowledge from two teacher models could result in the student model making some redundant predictions.
1 code implementation • 26 Jan 2022 • Wangbo Yu, Jinhao Du, Ruixin Liu, Yixuan Li, Yuesheng Zhu
Image inpainting approaches have achieved significant progress with the help of deep neural networks.
no code implementations • 5 Nov 2021 • Zhilin Huang, Chujun Qin, Ruixin Liu, Zhenyu Weng, Yuesheng Zhu
Recent works in image inpainting have shown that structural information plays an important role in recovering visually pleasing results.
no code implementations • 4 Jul 2021 • Zhenyu Weng, Yuesheng Zhu
In the proposed framework, the hash functions are fixed and a parametric similarity function for the binary codes is learnt online to adapt to the streaming data.
1 code implementation • 18 Sep 2020 • Zhenyu Weng, Yuesheng Zhu, Ruixin Liu
In this paper, a fast search algorithm is proposed to perform the non-exhaustive search for $K$ nearest binary codes by weighted Hamming distance.
1 code implementation • 21 Nov 2019 • Zhenyu Weng, Yuesheng Zhu
In our method, based on the multi-index hash tables, two algorithms, the table bucket finding algorithm and the table merging algorithm, are proposed to select the nearest weighted binary codes of the query in a non-exhaustive and accurate way.
no code implementations • 20 Oct 2019 • Shuai Yang, Wenqi Zhu, Yuesheng Zhu
In the first stage, an affinity matrix is generated from data.
no code implementations • 12 Oct 2019 • Shuai Yang, Wenqi Zhu, Yuesheng Zhu
Subspace clustering aims to cluster unlabeled data that lies in a union of low-dimensional linear subspaces.
no code implementations • 2 May 2019 • Shuai Yang, Wenqi Zhu, Yuesheng Zhu
The affinity matrix is obtained in the first stage, then it goes through the second stage, where the proposed GBTO is applied to generate a reconstructed affinity matrix with more authentic similarity between data points.
no code implementations • 1 May 2019 • Wenqi Zhu, Yuesheng Zhu, Li Zhong, Shuai Yang
In this paper, we propose a noise-robust algorithm, Restricted Connection Orthogonal Matching Pursuit for Sparse Subspace Clustering (RCOMP-SSC), to improve the clustering accuracy and maintain the low computational time by restricting the number of connections of each data point during the iteration of OMP.
no code implementations • 5 Mar 2019 • Jiaqiyu Zhan, Zhiqiang Bai, Yuesheng Zhu
In our method a parameter selection process is developed to adjust the parameters based on the data distribution for information representation.
no code implementations • CVPR 2016 • Guibo Luo, Yuesheng Zhu, Zhaotian Li, Liming Zhang
However, in the synthesis process, the background occluded by the foreground objects might be exposed in the new view, resulting in some holes in the synthetized video.