-
Polish-Russian Parallel Corpus
Polish-Russian Parallel Corpus -
Chunker WS
Chunker-WS provides shallow parsing of Polish. The parser may be run against plain text (input format: text, then it runs WCRFT for tagging) or already tagged input (other input... -
Khresmoi Query Translation Test Data 2.0
This package contains data sets for development and testing of machine translation of medical queries between Czech, English, French, German, Hungarian, Polish, Spanish ans... -
Khresmoi Summary Translation Test Data 2.0
This package contains data sets for development (Section dev) and testing (Section test) of machine translation of sentences from summaries of medical articles between Czech,... -
SimDiK
Data from the SimDiK project. -
Hamburg Corpus of Polish in Germany (HamCoPoliG)
This corpus version is deprecated for version 0.2. -
EXMARaLDA Demo corpus 1.1
A selection of short audio and video recordings in various languages to be used for instruction or demonstration of the EXMARaLDA system. The EXMARaLDA Demo Corpus is a small... -
Hamburg Corpus of Polish in Germany (HamCoPoliG)
Audio recordings of German/Polish bilingual and Polish monolingual adults (16-46 years). Recordings of semi-spontaneous data (3 topics) and renarration of a picture story. The... -
Hamburg Corpus of Polish in Germany (HamCoPoliG)
Original Data: Audio recordings of German/Polish bilingual and Polish monolingual adults (16-46 years). Recordings of semi-spontaneous data (3 topics) and renarration of a... -
Community Interpreting Database Pilot Corpus (ComInDat)
Audio and video recordings of various types of community interpreted discourse (doctor-patient communication, simulated doctor-patient communication, courtroom communication) in...