Printed Text Recognition
1 papers with code • 3 benchmarks • 1 datasets
Most implemented papers
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
To address the limitations of previous works, which struggle to generalize to the intricacies of the Urdu script and the lack of sufficient annotated real-world data, we have introduced the UTRSet-Real, a large-scale annotated real-world dataset comprising over 11, 000 lines and UTRSet-Synth, a synthetic dataset with 20, 000 lines closely resembling real-world and made corrections to the ground truth of the existing IIITH dataset, making it a more reliable resource for future research.