-
CEN
Corpus of Economic News (CEN) contains 797 documents from Polish Wikipedia annotated with 65 categories of proper names in ccl format.... -
Polish Corpus of Wrocław University of Technology 1.3 Korpus Języka Polskieg...
KPWr (Polish Corpus of Wrocław University of Technology, pol. Korpus Języka Polskiego Politechniki Wrocławskiej) is a corpus of written and spoken documents available on the... -
Polish Corpus of Wrocław University of Technology 1.2 Korpus Języka Polskieg...
KPWr (Polish Corpus of Wrocław University of Technology, pol. Korpus Języka Polskiego Politechniki Wrocławskiej) is a corpus of written and spoken documents available on the... -
High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation
This text corpus contains a carefully optimized set of sentences that could be used in the process of preparing a speech corpus for the development of personalized... -
Amharic Web Corpus
Amharic web corpus. Crawled by SpiderLing in August 2013 and October 2015 and January 2016. Encoded in UTF-8, cleaned, deduplicated. Tagged by TreeTagger trained on Amharic WIC...