1 code implementation • 1 Dec 2023 • Bin Xiao, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
However, existing detection-based models usually cannot perform as well as other types of solutions regarding cell-level TSR metrics, such as TEDS, and the underlying reasons limiting the performance of these models on the TSR task are also not well-explored.
1 code implementation • 30 May 2023 • Bin Xiao, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
Table Detection (TD) is a fundamental task to enable visually rich document understanding, which requires the model to extract information without information loss.
no code implementations • 4 May 2023 • Bin Xiao, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
Moreover, to enrich the data sources, we propose a new ICT-TD dataset using the PDF files of Information and Communication Technologies (ICT) commodities, a different domain containing unique samples that hardly appear in open datasets.
no code implementations • 3 Nov 2022 • Bin Xiao, Yakup Akkaya, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
Table Structure Recognition (TSR) aims to represent tables with complex structures in a machine-interpretable format so that the tabular data can be processed automatically.
no code implementations • 11 Aug 2022 • Bin Xiao, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
To transform the tabular data in electronic documents into a machine-interpretable format and provide layout and semantic information for information extraction and interpretation, we define a Table Structure Recognition (TSR) task and a Table Cell Type Classification (CTC) task.
no code implementations • 8 Mar 2022 • Bin Xiao, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
Table Structure Recognition (TSR) problem aims to recognize the structure of a table and transform the unstructured tables into a structured and machine-readable format so that the tabular data can be further analysed by the down-stream tasks, such as semantic modeling and information retrieval.