TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Recognition	CUTE80	DPAN	Accuracy	91.9	# 17
Scene Text Recognition	ICDAR2013	DPAN	Accuracy	97.7	# 13
Scene Text Recognition	ICDAR2015	DPAN	Accuracy	85.5	# 14
Scene Text Recognition	IIIT5k	DPAN	Accuracy	96.2	# 17
Scene Text Recognition	SVT	DPAN	Accuracy	93.9	# 18
Scene Text Recognition	SVTP	DPAN	Accuracy	89.0	# 17

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/look-back-again-dual-parallel-attention/scene-text-recognition-on-icdar2013)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2013?p=look-back-again-dual-parallel-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/look-back-again-dual-parallel-attention/scene-text-recognition-on-icdar2015)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2015?p=look-back-again-dual-parallel-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/look-back-again-dual-parallel-attention/scene-text-recognition-on-cute80)](https://paperswithcode.com/sota/scene-text-recognition-on-cute80?p=look-back-again-dual-parallel-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/look-back-again-dual-parallel-attention/scene-text-recognition-on-iiit5k)](https://paperswithcode.com/sota/scene-text-recognition-on-iiit5k?p=look-back-again-dual-parallel-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/look-back-again-dual-parallel-attention/scene-text-recognition-on-svtp)](https://paperswithcode.com/sota/scene-text-recognition-on-svtp?p=look-back-again-dual-parallel-attention)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/look-back-again-dual-parallel-attention/scene-text-recognition-on-svt)](https://paperswithcode.com/sota/scene-text-recognition-on-svt?p=look-back-again-dual-parallel-attention)`

Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition

ICMR 2021 · Zilong Fu, Guoqing Jin, Hongtao Xie, Junbo Guo ·

Nowadays, it is a trend that using a parallel-decoupled encoderdecoder (PDED) framework in scene text recognition for its flexibility and efficiency. However, due to the inconsistent information content between queries and keys in the parallel positional attention module (PPAM) used in this kind of framework(queries: position information, keys: context and position information), visual misalignment tends to appear when confronting hard samples(e.g., blurred texts, irregular texts, or low-quality images). To tackle this issue, in this paper, we propose a dual parallel attention network (DPAN), in which a newly designed parallel context attention module (PCAM) is cascaded with the original PPAM, using linguistic contextual information to compensate for the information inconsistency between queries and keys. Specifically, in PCAM, we take the visual features from PPAM as inputs and present a bidirectional language model to enhance them with linguistic contexts to produce queries. In this way, we make the information content of the queries and keys consistent in PCAM, which helps to generate more precise visual glimpses to improve the entire PDED framework’s accuracy and robustness. Experimental results verify the effectiveness of the proposed PCAM, showing the necessity of keeping the information consistency between queries and keys in the attention mechanism. On six benchmarks, including regular text and irregular text, the performance of DPAN surpasses the existing leading methods by large margins, achieving new state-of-the-art performance. The code is available on https://github.com/Jackandrome/DPAN.

PDF Abstract

Code

Add Remove Mark official

Jackandrome/DPAN official

siddagra/DPAN-look-back-Again-Dual-…

Tasks

Add Remove

Language Modelling

Position

Scene Text Recognition

Datasets

WikiText-2

WikiText-103

ICDAR 2013

SVT CUTE80

IIIT5k SVTP

Results from the Paper

Add Remove

Ranked #13 on Scene Text Recognition on ICDAR2013

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Recognition	CUTE80	DPAN	Accuracy	91.9	# 17	Compare
Scene Text Recognition	ICDAR2013	DPAN	Accuracy	97.7	# 13	Compare
Scene Text Recognition	ICDAR2015	DPAN	Accuracy	85.5	# 14	Compare
Scene Text Recognition	IIIT5k	DPAN	Accuracy	96.2	# 17	Compare
Scene Text Recognition	SVT	DPAN	Accuracy	93.9	# 18	Compare
Scene Text Recognition	SVTP	DPAN	Accuracy	89.0	# 17	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove