no code implementations • 5 Jan 2024 • Ikumi Okubo, Keisuke Sugiura, Hiroki Matsutani
To mitigate the computational complexity, recently, a hybrid approach has been proposed, which uses ResNet as a backbone architecture and replaces a part of its convolution layers with an MHSA (Multi-Head Self-Attention) mechanism.