no code implementations • 29 Mar 2024 • Jiayu Li, Xuechao Zou, Shiying Wang, Ben Chen, Junliang Xing, Pin Tao
Thus, we create the first large-scale cattle face recognition dataset, ICRWE, for wild environments.
no code implementations • 6 Mar 2024 • Kaidi Chen, Ben Chen, Dehong Gao, Huangyu Dai, Wen Jiang, Wei Ning, Shanqing Yu, Libin Yang, Xiaoyan Cai
Existing Neural Machine Translation (NMT) models mainly handle translation in the general domain, while overlooking domains with special writing formulas, such as e-commerce and legal documents.
1 code implementation • 1 Jan 2024 • Yifei Chen, Chenyan Zhang, Ben Chen, Yiyu Huang, Yifei Sun, Changmiao Wang, Xianjun Fu, Yuxing Dai, Feiwei Qin, Yong Peng, Yu Gao
To address these issues, this paper proposes an innovative method of leukocyte detection: the Multi-level Feature Fusion and Deformable Self-attention DETR (MFDS-DETR).
1 code implementation • 16 Aug 2023 • Ben Chen, Xuechao Zou, Kai Li, Yu Zhang, Junliang Xing, Pin Tao
Lake extraction from remote sensing imagery is a complex challenge due to the varied lake shapes and data noise.
1 code implementation • 8 Aug 2023 • Ben Chen, Xuechao Zou, Yu Zhang, Jiayu Li, Kai Li, Junliang Xing, Pin Tao
LEFormer contains three main modules: CNN encoder, Transformer encoder, and cross-encoder fusion.
no code implementations • 7 Jul 2023 • Ben Chen, Caihua Xiong, QuanLin Li, Zhonghua Wan
Accurate detection and localization of X-corner on both planar and non-planar patterns is a core step in robotics and machine vision.
no code implementations • 5 May 2023 • Ben Chen, Ke Guan, Danping He, Pengxiang Xie, Zhangdui Zhong, Jianwu Dou, Shahid Mumtaz, Wael Bazzi
In this paper, a three-dimensional (3D) stochastic model is proposed to characterize the THz scattering on rough surfaces.
no code implementations • 10 Feb 2023 • Ben Chen, Caihua Xiong, Qi Zhang
Aiming to improve the checkerboard corner detection robustness against the images with poor quality, such as lens distortion, extreme poses, and noise, we propose a novel detection algorithm which can maintain high accuracy on inputs under multiply scenarios without any prior knowledge of the checkerboard pattern.
no code implementations • 10 Feb 2023 • Ben Chen, Linbo Jin, Xinxin Wang, Dehong Gao, Wen Jiang, Wei Ning
Same-style products retrieval plays an important role in e-commerce platforms, aiming to identify the same products which may have different text descriptions or images.
no code implementations • 1 Jul 2021 • Qiwei Zhong, Guanxiong Zeng, Danqing Zhu, Yang Zhang, Wangli Lin, Ben Chen, Jiayu Tang
In this paper, we consider both the domain agnostic and specific knowledge, and propose a Hierarchical Dual-path BERT method coined hdBERT to capture the general fine-grained and high-level specific representations for acronym disambiguation.
1 code implementation • CVPR 2021 • Mingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao
We present a new vision-language (VL) pre-training model dubbed Kaleido-BERT, which introduces a novel kaleido strategy for fashion cross-modality representations from transformers.
no code implementations • 18 Jan 2021 • Yong Huang, Ben Chen, Daiming Qu
Eye gaze estimation has become increasingly significant in computer vision. In this paper, we systematically study the mainstream of eye gaze estimation methods, propose a novel methodology to estimate eye gaze points and eye gaze directions simultaneously. First, we construct a local sharing network for feature extraction of gaze points and gaze directions estimation, which can reduce network computational parameters and converge quickly;Second, we propose a Multiview Multitask Learning (MTL) framework, for gaze directions, a coplanar constraint is proposed for the left and right eyes, for gaze points, three views data input indirectly introduces eye position information, a cross-view pooling module is designed, propose joint loss which handle both gaze points and gaze directions estimation. Eventually, we collect a dataset to use of gaze points, which have three views to exist public dataset. The experiment show our method is state-of-the-art the current mainstream methods on two indicators of gaze points and gaze directions.
no code implementations • 14 Jan 2021 • Ben Chen, Bin Chen, Dehong Gao, Qijin Chen, Chengfu Huo, Xiaonan Meng, Weijun Ren, Yang Zhou
However, universal language models may perform weakly in these fake news detection for lack of large-scale annotated data and sufficient semantic understanding of domain-specific knowledge.
3 code implementations • 20 May 2020 • Dehong Gao, Linbo Jin, Ben Chen, Minghui Qiu, Peng Li, Yi Wei, Yi Hu, Hao Wang
In this paper, we address the text and image matching in cross-modal retrieval of the fashion industry.