Search Results for author: Ben Chen

Found 14 papers, 5 papers with code

A Parallel Attention Network for Cattle Face Recognition

no code implementations • 29 Mar 2024 • Jiayu Li, Xuechao Zou, Shiying Wang, Ben Chen, Junliang Xing, Pin Tao

Thus, we create the first large-scale cattle face recognition dataset, ICRWE, for wild environments.

Paper
Add Code

General2Specialized LLMs Translation for E-commerce

no code implementations • 6 Mar 2024 • Kaidi Chen, Ben Chen, Dehong Gao, Huangyu Dai, Wen Jiang, Wei Ning, Shanqing Yu, Libin Yang, Xiaoyan Cai

Existing Neural Machine Translation (NMT) models mainly handle translation in the general domain, while overlooking domains with special writing formulas, such as e-commerce and legal documents.

Machine Translation NMT +1

Paper
Add Code

Accurate Leukocyte Detection Based on Deformable-DETR and Multi-Level Feature Fusion for Aiding Diagnosis of Blood Diseases

1 code implementation • 1 Jan 2024 • Yifei Chen, Chenyan Zhang, Ben Chen, Yiyu Huang, Yifei Sun, Changmiao Wang, Xianjun Fu, Yuxing Dai, Feiwei Qin, Yong Peng, Yu Gao

To address these issues, this paper proposes an innovative method of leukocyte detection: the Multi-level Feature Fusion and Deformable Self-attention DETR (MFDS-DETR).

Paper
Code

High-Fidelity Lake Extraction via Two-Stage Prompt Enhancement: Establishing a Novel Baseline and Benchmark

1 code implementation • 16 Aug 2023 • Ben Chen, Xuechao Zou, Kai Li, Yu Zhang, Junliang Xing, Pin Tao

Lake extraction from remote sensing imagery is a complex challenge due to the varied lake shapes and data noise.

Decoder

Paper
Code

LEFormer: A Hybrid CNN-Transformer Architecture for Accurate Lake Extraction from Remote Sensing Imagery

1 code implementation • 8 Aug 2023 • Ben Chen, Xuechao Zou, Yu Zhang, Jiayu Li, Kai Li, Junliang Xing, Pin Tao

LEFormer contains three main modules: CNN encoder, Transformer encoder, and cross-encoder fusion.

Paper
Code

RCDN -- Robust X-Corner Detection Algorithm based on Advanced CNN Model

no code implementations • 7 Jul 2023 • Ben Chen, Caihua Xiong, QuanLin Li, Zhonghua Wan

Accurate detection and localization of X-corner on both planar and non-planar patterns is a core step in robotics and machine vision.

Camera Calibration Pose Estimation

Paper
Add Code

A 3D Modeling Method for Scattering on Rough Surfaces at the Terahertz Band

no code implementations • 5 May 2023 • Ben Chen, Ke Guan, Danping He, Pengxiang Xie, Zhangdui Zhong, Jianwu Dou, Shahid Mumtaz, Wael Bazzi

In this paper, a three-dimensional (3D) stochastic model is proposed to characterize the THz scattering on rough surfaces.

Paper
Add Code

CCDN: Checkerboard Corner Detection Network for Robust Camera Calibration

no code implementations • 10 Feb 2023 • Ben Chen, Caihua Xiong, Qi Zhang

Aiming to improve the checkerboard corner detection robustness against the images with poor quality, such as lens distortion, extreme poses, and noise, we propose a novel detection algorithm which can maintain high accuracy on inputs under multiply scenarios without any prior knowledge of the checkerboard pattern.

Camera Calibration

Paper
Add Code

Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval

no code implementations • 10 Feb 2023 • Ben Chen, Linbo Jin, Xinxin Wang, Dehong Gao, Wen Jiang, Wei Ning

Same-style products retrieval plays an important role in e-commerce platforms, aiming to identify the same products which may have different text descriptions or images.

Attribute Language Modelling +3

Paper
Add Code

Leveraging Domain Agnostic and Specific Knowledge for Acronym Disambiguation

no code implementations • 1 Jul 2021 • Qiwei Zhong, Guanxiong Zeng, Danqing Zhu, Yang Zhang, Wangli Lin, Ben Chen, Jiayu Tang

In this paper, we consider both the domain agnostic and specific knowledge, and propose a Hierarchical Dual-path BERT method coined hdBERT to capture the general fine-grained and high-level specific representations for acronym disambiguation.

document understanding Word Embeddings

Paper
Add Code

Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

1 code implementation • CVPR 2021 • Mingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao

We present a new vision-language (VL) pre-training model dubbed Kaleido-BERT, which introduces a novel kaleido strategy for fashion cross-modality representations from transformers.

Image Retrieval Retrieval +1

264

Paper
Code

LNSMM: Eye Gaze Estimation With Local Network Share Multiview Multitask

no code implementations • 18 Jan 2021 • Yong Huang, Ben Chen, Daiming Qu

Eye gaze estimation has become increasingly significant in computer vision. In this paper, we systematically study the mainstream of eye gaze estimation methods, propose a novel methodology to estimate eye gaze points and eye gaze directions simultaneously. First, we construct a local sharing network for feature extraction of gaze points and gaze directions estimation, which can reduce network computational parameters and converge quickly;Second, we propose a Multiview Multitask Learning (MTL) framework, for gaze directions, a coplanar constraint is proposed for the left and right eyes, for gaze points, three views data input indirectly introduces eye position information, a cross-view pooling module is designed, propose joint loss which handle both gaze points and gaze directions estimation. Eventually, we collect a dataset to use of gaze points, which have three views to exist public dataset. The experiment show our method is state-of-the-art the current mainstream methods on two indicators of gaze points and gaze directions.

Gaze Estimation

Paper
Add Code

Transformer-based Language Model Fine-tuning Methods for COVID-19 Fake News Detection

no code implementations • 14 Jan 2021 • Ben Chen, Bin Chen, Dehong Gao, Qijin Chen, Chengfu Huo, Xiaonan Meng, Weijun Ren, Yang Zhou

However, universal language models may perform weakly in these fake news detection for lack of large-scale annotated data and sufficient semantic understanding of domain-specific knowledge.

Fake News Detection Language Modelling

Paper
Add Code

FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval

3 code implementations • 20 May 2020 • Dehong Gao, Linbo Jin, Ben Chen, Minghui Qiu, Peng Li, Yi Wei, Yi Hu, Hao Wang

In this paper, we address the text and image matching in cross-modal retrieval of the fashion industry.

Cross-Modal Retrieval Retrieval

1,959

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.