434 datasets found

Language: Czech

Filter Results
  • NameTag 2 Models (2021-09-16)

    NER models for NameTag 2, named entity recognition tool, for English, German, Dutch, Spanish and Czech. Model documentation including performance can be found here:...
  • CzeDLex 1.0

    CzeDLex 1.0 is the first production version (the fourth development version) of the Lexicon of Czech discourse connectives. The lexicon contains connectives partially...
  • THEaiTRobot 2.0

    The THEaiTRobot 2.0 tool allows the user to interactively generate scripts for individual theatre play scenes. The previous version of the tool...
  • MorfoCzech

    A dictionary of morphologically segmented word forms in Czech. Rules of manual segmentation are described in Pelegrinová, K., Mačutek, J., Čech, R. (2021). The Menzerath-Altmann...
  • RobeCzech Base

    RobeCzech is a monolingual RoBERTa language representation model trained on Czech data. RoBERTa is a robustly optimized Transformer-based pretraining approach. We show that...
  • Facebook Data for Sentiment Analysis

    Corpus consisting of 10,000 Facebook posts manually annotated on sentiment (2,587 positive, 5,174 neutral, 1,991 negative and 248 bipolar posts). The archive contains data and...
  • DeriNet 1.6 (2018-09-24)

    DeriNet is a lexical network which models derivational relations in the lexicon of Czech. Nodes of the network correspond to Czech lexemes, while edges represent derivational...
  • ParaCrawl Corpus version 1.0

    The January 2018 release of the ParaCrawl is the first version of the corpus. It contains parallel corpora for 11 languages paired with English, crawled from a large number of...
  • MorfFlex CZ 160310

    Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. Currently it contains full morphological information for...
  • Semantic Features and Their Role In Conceptual Representation In School Age C...

    Language acquisition is one of the currently much discussed topics in the field of psycholinguistics. Considerable space for future research can be seen in the development of...
  • Multiword expressions in the Prague Dependency Treebank 2.0

    This dataset adds annotation of multiword expressions and multiword named entities to the original PDT 2.0 data. The annotation is stand-off, stored in the same PML format as...
  • Artificial Treebank with Ellipsis

    Artificially created treebank of elliptical constructions (gapping), in the annotation style of Universal Dependencies. Data taken from UD 2.1 release, and from large web...
  • Czech Models (MorfFlex CZ 2.0 + PDT-C 1.0) for MorphoDiTa 220710

    Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ 2.0,...
  • Automatic Paraphrases of Czech Reference Sentences for WMT11, 13 and 14

    This dataset contains automatic paraphrases of Czech official reference translations for the Workshop on Statistical Machine Translation shared task. The data covers the years...
  • AKCES 2 ver. 2

    Corpus AKCES 2 ver. 2 consists of full, unabridged trancripts of recordings of classes at Czech elementary and secondary schools (AKCES/CLAC - Czech Language Acquisition...
  • Czech Models (MorfFlex CZ 161115 + PDT 3.0) for MorphoDiTa 161115

    Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ...
  • Czech Verbal MWEs

    Lexicon of Czech verbal multiword expressions (VMWEs) used in Parseme Shared Task 2017....
  • Universal Dependencies 2.9

    Universal Dependencies is a project that seeks to develop cross-linguistically consistent treebank annotation for many languages, with the goal of facilitating multilingual...
  • Moses Web Demo

    An interactive web demo of selected ÚFAL MT systems.
  • English-Czech Corpus from Wikipedia

    Sentence-parallel corpus made from English and Czech Wikipedias based on translated articles from English into Czech. The work done is described in the paper: ŠTROMAJEROVÁ,...