Replication Data for: The decade construction rivalry in Russian: Using a corpus to study historical linguistics

Dataset

DOI

This dataset contains 3 data files, 5 files with R code, and a short read-me file with documentation. The data files contain information about the development of two competing constructions in Russian temporal adverbials. The files with R code give the code for analysis of the databases.

ARTICLE ABSTRACT: What can a corpus do for the historical linguist? How can corpus data shed light on the diachronic development of so-called rival forms, i.e., words or grammatical constructions that appear to be synonyms? This article addresses these questions based on a detailed empirical analysis of two seemingly synonymous constructions in Russian. Corresponding to the English ‘decade construction’ in the twenties, Russian has two rival constructions, viz. v dvadcatye gody [lit. “in the twentieth years”] (with the numeral and noun in the accusative) and v dvadcatyx godax (with the numeral and noun in the locative case). Three hypotheses about rival forms are considered: leveling (whereby one form ousts its rival), sociolinguistic differentiation (whereby the two rivals survive in different varieties of a language) and semantic differentiation (whereby the two rivals develop different meanings over time). Contrary to what has been suggested in the literature, we find little evidence for semantic and sociolinguistic differentiation. Instead, we demonstrate that leveling is taking place, since the accusative construction is in the process of ousting its rival. While our study shows that corpus data facilitate detailed analysis of the interaction between leveling, sociolinguistic differentiation and semantic differentiation, our analysis also points to limitations, especially when it comes to corpus-based analysis of sociolinguistic and semantic factors.

Identifier
DOI	https://doi.org/10.18710/QKHCVE
Related Identifier	https://doi.org/10.1075/dia.16043.nes
Metadata Access	https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/QKHCVE

Provenance
Creator	Nesset, Tore; Makarova, Anastasia
Publisher	DataverseNO
Contributor	Nesset, Tore; UiT The Arctic University of Norway; The Tromsø Repository of Language and Linguistics
Publication Year	2017
Rights	CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess	true
Contact	Nesset, Tore (UiT The Arctic University of Norway)

Representation
Resource Type	Corpus data; Dataset
Format	text/plain; text/csv; text/tab-separated-values
Size	6789; 6485190; 8037582; 3809376; 958895; 781; 805; 1069; 1721; 1169
Version	1.2
Discipline	Humanities; Linguistics