-
Corpus for the epidemiomonitoring of plant
The corpus is the collection of 165 documents on plant health to which the manual annotations of the 'Training and development dataset for information extraction in plant... -
CEN
Corpus of Economic News (CEN) contains 797 documents from Polish Wikipedia annotated with 65 categories of proper names in ccl format.... -
Polish Corpus of Wrocław University of Technology 1.3 Korpus Języka Polskieg...
KPWr (Polish Corpus of Wrocław University of Technology, pol. Korpus Języka Polskiego Politechniki Wrocławskiej) is a corpus of written and spoken documents available on the... -
Polish Corpus of Wrocław University of Technology 1.2 Korpus Języka Polskieg...
KPWr (Polish Corpus of Wrocław University of Technology, pol. Korpus Języka Polskiego Politechniki Wrocławskiej) is a corpus of written and spoken documents available on the... -
Amharic Web Corpus
Amharic web corpus. Crawled by SpiderLing in August 2013 and October 2015 and January 2016. Encoded in UTF-8, cleaned, deduplicated. Tagged by TreeTagger trained on Amharic WIC... -
High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation
This text corpus contains a carefully optimized set of sentences that could be used in the process of preparing a speech corpus for the development of personalized... -
INEL Nenets Corpus
Corpus Citation Budzisch, Josefina; Wagner-Nagy, Beáta. 2024. INEL Nenets Corpus. Version 1.0. Publication date 2024-12-31.... -
INEL Enets Corpus
Corpus Citation Shluinsky, Andrey; Khanina, Olesya; Wagner-Nagy, Beáta. 2024. INEL Enets Corpus. Version 1.0. Publication date 2024-11-30....