Search Results for author: Madison May

Found 2 papers, 1 papers with code

RealKIE: Five Novel Datasets for Enterprise Key Information Extraction

no code implementations29 Mar 2024 Benjamin Townsend, Madison May, Christopher Wells

We introduce RealKIE, a benchmark of five challenging datasets aimed at advancing key information extraction methods, with an emphasis on enterprise applications.

Key Information Extraction Optical Character Recognition (OCR)

Doc2Dict: Information Extraction as Text Generation

1 code implementation16 May 2021 Benjamin Townsend, Eamon Ito-Fisher, Lily Zhang, Madison May

Typically, information extraction (IE) requires a pipeline approach: first, a sequence labeling model is trained on manually annotated documents to extract relevant spans; then, when a new document arrives, a model predicts spans which are then post-processed and standardized to convert the information into a database entry.

Language Modelling Text Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.