Dataset on Locative and Existential Predication
This dataset provides a comprehensive collection of annotated linguistic data on locative, existential, and possessive predication across various languages. Developed within the research project "Lokative und existentiale Prädikation in Sprachen des Ob-Jenissei-Areals: Typologie und
Informationsstruktur" (07/2022-06/2025, 490822200), the dataset includes digital corpora from 16 Siberian languages, enabling typological and pragmatic analysis.
Key features of the dataset include:
Corpus-based annotation of existential, locative, and possessive clauses
Detailed linguistic categorization, including morphosyntactic structures and information status
Multi-layer annotation using XML-based EXMARaLDA framework
Statistical analysis-ready data, structured in SPSS format
Languages covered in the dataset:
Uralic languages: Khanty, Mansi, Nenets, Enets, Nganasan, Selkup, Kamas
Turkic languages: Dolgan, Sakha, Chulym Turkic, Khakas
Tungusic languages: Evenki, Even
Yeniseian languages: Ket, Yugh
Yukaghir
The preparation of this dataset was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) grant — project no. 490822200.