no code implementations • 28 May 2024 • Gili Goldin, Nick Howell, Noam Ordan, Ella Rabinovich, Shuly Wintner
We present the Knesset Corpus, a corpus of Hebrew parliamentary proceedings containing over 30 million sentences (over 384 million tokens) from all the (plenary and committee) protocols held in the Israeli parliament between 1998 and 2022.
2 code implementations • 14 Oct 2022 • Amir Zeldes, Nick Howell, Noam Ordan, Yifat Ben Moshe
Foundational Hebrew NLP tasks such as segmentation, tagging and parsing, have relied to date on various versions of the Hebrew Treebank (HTB, Sima'an et al. 2001).
no code implementations • ACL 2017 • Ella Rabinovich, Noam Ordan, Shuly Wintner
Translation has played an important role in trade, law, commerce, politics, and literature for thousands of years.
no code implementations • ACL 2016 • Ella Rabinovich, Sergiu Nisioi, Noam Ordan, Shuly Wintner
We present a computational analysis of three language varieties: native, advanced non-native, and translation.
no code implementations • LREC 2014 • Stefania Degaetano-Ortlieb, Peter Fankhauser, Hannah Kermes, Ekaterina Lapshinova-Koltunski, Noam Ordan, Elke Teich
We present a methodology to analyze the linguistic evolution of scientific registers with data mining techniques, comparing the insights gained from shallow vs. linguistic features.