-
CALEM (Comprehensive Arabic LEMmas)
Comprehensive Arabic LEMmas is a lexicon covering a large list of Arabic lemmas and their corresponding inflected word forms (stems) with details (POS + Root). Each lexical... -
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated... -
The Diorisis Ancient Greek Corpus
An annotated corpus of literary Ancient Greek sourced from the Perseus Canonical Greek Lit repository (https://github.com/PerseusDL/canonical-greekLit), “The Little Sailing”... -
Universal Dependencies 2.3 Models for UDPipe (2018-11-15)
Tokenizer, POS Tagger, Lemmatizer and Parser models for 84 treebanks of 56 languages of Universal Depenencies 2.3 Treebanks, created solely using UD 2.3 data... -
UDPipe 2
UDPipe 2 is a POS tagger, lemmatizer and dependency parser. Compared to UDPipe 1: UDPipe 2 is Python-only and tested only in Linux, UDPipe 2 is meant as a research tool,... -
CoNLL 2018 Shared Task - UDPipe Baseline Models and Supplementary Materials
Baseline UDPipe models for CoNLL 2018 Shared Task in UD Parsing, and supplementary material. The models require UDPipe version at least 1.2 and are evaluated using the official... -
Universal Dependencies 1.2 Models for UDPipe
Tokenizer, POS Tagger, Lemmatizer and Parser models for all Universal Depenencies 1.2 Treebanks, created solely using UD 1.2 data (http://hdl.handle.net/11234/1-1548). To use... -
POS Tagging and Lemmatization (Czech model)
Model trained for Czech POS Tagging and Lemmatization using Czech version of BERT model, RobeCzech. Model is trained on data from Prague Dependency Treebank 3.5. Model is a part... -
Czech Verbal MWEs
Lexicon of Czech verbal multiword expressions (VMWEs) used in Parseme Shared Task 2017....