no code implementations • EMNLP 2021 • Harsh Gupta, Luciano del Corro, Samuel Broscheit, Johannes Hoffart, Eliot Brenner
We investigate post-OCR correction in a setting where we have access to different OCR views of the same document.
no code implementations • 18 Nov 2023 • Arindam Mitra, Luciano del Corro, Shweti Mahajan, Andres Codas, Clarisse Simoes, Sahaj Agarwal, Xuxi Chen, Anastasia Razdaibiedina, Erik Jones, Kriti Aggarwal, Hamid Palangi, Guoqing Zheng, Corby Rosset, Hamed Khanpour, Ahmed Awadallah
Research on training small LMs has often relied on imitation learning to replicate the output of more capable models.
Ranked #1 on Crass AI on BIG-bench
1 code implementation • 3 Oct 2023 • Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano del Corro, Shweti Mahajan, Julian McAuley, Jennifer Neville, Ahmed Hassan Awadallah, Nikhil Rao
Remarkably, our automatic contrastive post-training further improves the performance of Orca, already a state-of-the-art instruction learning model tuned with GPT-4 outputs, to outperform ChatGPT.
no code implementations • 5 Jul 2023 • Luciano del Corro, Allie Del Giorno, Sahaj Agarwal, Bin Yu, Ahmed Awadallah, Subhabrata Mukherjee
While existing token-level early exit methods show promising results for online inference, they cannot be readily applied for batch inferencing and Key-Value caching.
no code implementations • EMNLP (ECONLP) 2021 • Luciano Del Corro, Johannes Hoffart
We present a method to automatically identify financially relevant news using stock price movements and news headlines as input.
1 code implementation • EMNLP 2018 • Marco Ponza, Luciano del Corro, Gerhard Weikum
This work introduces fact salience: The task of generating a machine-readable representation of the most prominent information in a text document as a set of facts.
no code implementations • ACL 2018 • Prabal Agarwal, Jannik Str{\"o}tgen, Luciano del Corro, Johannes Hoffart, Gerhard Weikum
Named Entity Disambiguation (NED) systems perform well on news articles and other texts covering a specific time interval.
no code implementations • ACL 2018 • Dominic Seyler, Tatiana Dembelova, Luciano del Corro, Johannes Hoffart, Gerhard Weikum
In this work, we discuss the importance of external knowledge for performing Named Entity Recognition (NER).
no code implementations • 11 Sep 2017 • Dominic Seyler, Tatiana Dembelova, Luciano del Corro, Johannes Hoffart, Gerhard Weikum
KnowNER is a multilingual Named Entity Recognition (NER) system that leverages different degrees of external knowledge.
Multilingual Named Entity Recognition named-entity-recognition +2
1 code implementation • EMNLP 2017 • Kiril Gashteovski, Rainer Gemulla, Luciano del Corro
The goal of Open Information Extraction (OIE) is to extract surface relations and their arguments from natural-language text in an unsupervised, domain-independent manner.