-
Frequency lists of character-level n-grams from the GOS 1.0 corpus 1.1
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool... -
Frequency lists of character-level n-grams from the Gigafida 2.0 corpus
Frequency lists of character-level n-grams were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (https://viri.cjvt.si/gigafida/) using the LIST corpus... -
Frequency list of words from the Trendi corpus 2019
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene (see e.g.... -
Frequency list of words by source from the Trendi corpus 2022-07
The frequency list of words by source was prepared in the following manner: words (i.e. lemmas with their lexical features) were extracted from 15 most frequent sources in the... -
Frequency list of words from the Trendi corpus 2021
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene... -
Frequency lists of words from the GOS 1.0 corpus
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool... -
Consonant-vowel structures in the GOS 1.0 corpus 1.1
The lists contain consonant-vowel structures of all lemmas, word forms, and standardized word forms in the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040).... -
Frequency lists of words from the GOS 1.0 corpus 1.1
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool... -
Consonant-vowel structures in the Gigafida 2.0 corpus
The lists contain consonant-vowel structures of all lemmas and word forms in the Gigafida 2.0 corpus. In each unit, its characters were converted as follows: C - consonant (in... -
Consonant-vowel structures in the GOS 1.0 corpus
The lists contain consonant-vowel structures of all lemmas, word forms, and normalized word forms in the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040). In... -
Frequency list of words from the Trendi corpus 2020
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene... -
Frequency lists of character-level n-grams from the GOS 1.0 corpus
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool... -
Frequency lists of word parts from the Gigafida 2.0 corpus
Frequency lists of words split into word parts were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (https://viri.cjvt.si/gigafida/) using the LIST corpus... -
Wordlist of Lemmas from the Joint Corpus of Lithuanian
The resource is a wordlist of lemmas from the Joint Corpus of Lithuanian (JCL). The JCL is a merge of three corpora: 1) Vilnius university corpus compiled out of the Lithuanian...