-
Czech Malach Cross-lingual Speech Retrieval Test Collection
The package contains Czech recordings of the Visual History Archive which consists of the interviews with the Holocaust survivors. The archive consists of audio recordings, four... -
PDT-Vallex: Czech Valency lexicon linked to treebanks
The valency lexicon PDT-Vallex has been built in close connection with the annotation of the Prague Dependency Treebank project (PDT) and its successors (mainly the Prague... -
Corpus of contemporary blogs
In NLP Centre, dividing text into sentences is currently done with a tool which uses rule-based system. In order to make enough training data for machine learning, annotators... -
Annotate
Annotate is a web and desktop application that should simplify the process of transforming photos of manuscripts to a browsable collection. It also allows users to annotate... -
SiR 1.0
SiR 1.0 is a corpus of Czech articles published on iRozhlas, a news server of a Czech public radio (https://www.irozhlas.cz/). It is a collection of 1 718 articles (42 890... -
TrEd
Tree Editor TrEd is a fully customizable and programmable graphical editor and viewer for tree-like structures. Among other projects, it was used as the main annotation tool for... -
Quality and Efficiency of Manual Annotation: Data from the Pre-annotation Bia...
Input data, individual experimental annotations, and a complete and detailed overview of the measured results related to the experiment described in the referenced paper. -
Czech Court Decisions Dataset
We present the Czech Court Decisions Dataset (CCDD) -- a dataset of 300 manually annotated court decisions published by The Supreme Court of the Czech Republic and the... -
PiRATE: a Pipeline to Retrieve and Annotate Transposable Elements
To date, genome assembly of non-model organisms is usually not at chromosomal level and higly fragmented. This fragmentation is recognized to be, in part, the result of a bad... -
Transkriptionskonventionen im Vergleich
Synopsis of transcription conventions used in six international sign language research projects including annotation tool and tiers in transcripts, divided into conventional... -
Die Erstellung von Fachgebärdenlexika am Institut für Deutsche Gebärdensprach...
Detailed description of how six corpus-based LSP dictionaries German – German Sign Language (DGS) were produced including elicitation methods, annotation and... -
Quickstart: Annotation in the EXMARaLDA Partitur Editor
A quickstart introduction into annotation in the EXMARaLDA Partitur Editor