no code implementations • 19 Apr 2024 • Danqing Ma, Meng Wang, Ao Xiang, Zongqing Qi, Qin Yang
This study proposes a multi-modal fusion framework Multitrans based on the Transformer architecture and self-attention mechanism.
no code implementations • 13 Mar 2024 • Ao Xiang, Zongqing Qi, Han Wang, Qin Yang, Danqing Ma
This paper introduces a new multi-modal model based on the Transformer architecture and tensor product fusion strategy, combining BERT's text vectors and ViT's image vectors to classify students' psychological conditions, with an accuracy of 93. 65%.
no code implementations • 13 Mar 2024 • Zongqing Qi, Danqing Ma, Jingyu Xu, Ao Xiang, Hedi Qu
In recent years, there have been frequent incidents of foreign objects intruding into railway and Airport runways.