1 code implementation • 1 Apr 2024 • Jing Hao, Lei He, Kuo Feng Hung
To address this issue, we propose T-Mamba, integrating shared positional encoding and frequency-based features into vision mamba, to address limitations in spatial position preservation and feature enhancement in frequency domain.
1 code implementation • 27 Jan 2024 • Jing Hao, Moyun Liu, Kuo Feng Hung
To segment glass surfaces with higher accuracy, we make full use of two visual foundation models: Segment Anything (SAM) and Stable Diffusion. Specifically, we devise a simple glass surface segmentor named GEM, which only consists of a SAM backbone, a simple feature pyramid, a discerning query selection module, and a mask decoder.
1 code implementation • 22 Jul 2023 • Jing Hao, Moyun Liu, Jinrong Yang, Kuo Feng Hung
Comprehensive experiments are conducted on the large-scale glass segmentation dataset GSD-S. Our GEM establishes a new state-of-the-art performance with the help of these two VFMs, surpassing the best-reported method GlassSemNet with an IoU improvement of 2. 1%.