Search Results for author: Noam Ordan

Found 11 papers, 1 papers with code

The Knesset Corpus: An Annotated Corpus of Hebrew Parliamentary Proceedings

no code implementations • 28 May 2024 • Gili Goldin, Nick Howell, Noam Ordan, Ella Rabinovich, Shuly Wintner

We present the Knesset Corpus, a corpus of Hebrew parliamentary proceedings containing over 30 million sentences (over 384 million tokens) from all the (plenary and committee) protocols held in the Israeli parliament between 1998 and 2022.

Paper
Add Code

A Second Wave of UD Hebrew Treebanking and Cross-Domain Parsing

2 code implementations • 14 Oct 2022 • Amir Zeldes, Nick Howell, Noam Ordan, Yifat Ben Moshe

Foundational Hebrew NLP tasks such as segmentation, tagging and parsing, have relied to date on various versions of the Hebrew Treebank (HTB, Sima'an et al. 2001).

Language Modelling

Paper
Code

Found in Translation: Reconstructing Phylogenetic Language Trees from Translations

no code implementations • ACL 2017 • Ella Rabinovich, Noam Ordan, Shuly Wintner

Translation has played an important role in trade, law, commerce, politics, and literature for thousands of years.

Translation

Paper
Add Code

On the Similarities Between Native, Non-native and Translated Texts

no code implementations • ACL 2016 • Ella Rabinovich, Sergiu Nisioi, Noam Ordan, Shuly Wintner

We present a computational analysis of three language varieties: native, advanced non-native, and translation.

Translation

Paper
Add Code

Statistical Machine Translation with Automatic Identification of Translationese

no code implementations • WS 2015 • Naama Twitto, Noam Ordan, Shuly Wintner

Language Modelling Machine Translation +2

Paper
Add Code

USAAR-CHRONOS: Crawling the Web for Temporal Annotations

no code implementations • SEMEVAL 2015 • Liling Tan, Noam Ordan

Paper
Add Code

Data Mining with Shallow vs. Linguistic Features to Study Diversification of Scientific Registers

no code implementations • LREC 2014 • Stefania Degaetano-Ortlieb, Peter Fankhauser, Hannah Kermes, Ekaterina Lapshinova-Koltunski, Noam Ordan, Elke Teich

We present a methodology to analyze the linguistic evolution of scientific registers with data mining techniques, comparing the insights gained from shallow vs. linguistic features.

Text Categorization