Paper tables with annotated results for The Danish Gigaword Project

Paper

The Danish Gigaword Project

Danish language technology has been hindered by a lack of broad-coverage corpora at the scale modern NLP prefers. This paper describes the Danish Gigaword Corpus, the result of a focused effort to provide a diverse and freely-available one billion word corpus of Danish text. The Danish Gigaword corpus covers a wide array of time periods, domains, speakers' socio-economic status, and Danish dialects.

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

The Danish Gigaword Project

Reader Guidelines

Editor Guidelines