-
The corpus of older Slovenian narrative prose PriLit 1.0
The PriLit corpus contains 37 texts of older Slovenian narrative prose by 12 authors. One text, Sreča v nesreči (Fortune in Misfortune) by Janez Cigler (first published in... -
Digital library and corpus of historical Slovene IMP 1.1
The IMP digital library contains historical Slovene books and other publications, together 658 texts with over 45,000 pages from the period 1584-1919. Each text contains... -
Reference corpus of historical Slovene goo300k 1.2
goo300k is a manually annotated reference corpus of historical Slovene. It contains 1,100 pages (about 300,000 tokens) sampled from 89 texts from the period 1584-1899. Each text... -
Dataset of normalised Slovene text KonvNormSl 1.0
Data used in the experiments described in: Nikola Ljubešić, Katja Zupan, Darja Fišer and Tomaž Erjavec: Normalising Slovene data: historical texts vs. user-generated content.... -
Lexicon of historical Slovene imp25k 1.1
The imp25k lexicon of historical Slovene was created automatically from the goo300k and foo3M annotated corpora and contains attested and manually verified word forms and their... -
Words of the 16th-Century Slovenian Literary Language
This dictionary provides comprehensive information on the vocabulary used in the Slovenian literary language during the period of the Reformation. It was written based on... -
Concordances of Primož Trubar's "Ta evangeli sv. Matevža" (1555)
The 23603 concordances represent a transcription of the book "Ta evangeli sv. Matevža" (1555) by Primož Trubar. -
IMP corpus n-grams 1.0
This is a collection of n-grams extracted from the IMP corpus of historical Slovene (http://hdl.handle.net/11356/1031). In addition to the separate lists of n-grams for tokens... -
Slovenian-German Dictionary of Maks Pleteršnik (1894-1895)
The Slovenian-German Dictionary of Maks Pleteršnik was first published in 1894-1895. It contains 103,185 dictionary entries. Beside standard and dialect lexis of the 19th... -
Dictionary of the Slovenian Language in the Works of Janez Svetokriški
The Dictionary of the Slovenian Language in the Works of Janez Svetokriški (Slovar jezika Janeza Svetokriškega) presents and explains the lexis, including proper nouns, from 233...