-
Whatsapp corpus Berntzen
Whatsapp conversations collected by master students Communication & Information Studies (2013-2014; 2014-2015). All participants in the conversations are over 18 and have signed... -
Whatsapp corpus Verheijen
Whatsappdata collected for the PhD research of Lieke Verheijen (Radboud University). Informed consent only from contributor and not from conversational partner. Consequently,... -
QLK Subkorpus Transstimmen
The sub-corpus "Transstimmen" is part of the project "QLK - Queerlinguistisches Korpus". This is a one-year pilot project funded by the Equal Opportunities... -
QLK Subkorpus CSD-Berichterstattung
The sub-corpus "CSD-Berichterstattung" is part of the project "QLK - Queerlinguistisches Korpus". This is a one-year pilot project funded by the Equal... -
QLK Subkorpus Queere Tiere
The sub-corpus "Queere Tiere" is part of the project "QLK - Queerlinguistisches Korpus". This is a one-year pilot project funded by the Equal Opportunities... -
Background data for: Regression and random forests: Synergies for variationis...
This dataset contains tabular files recording occurrences of the verb REGRET complemented by a that- or -ing-complement clause (CC) in the GloWbE corpus. Tokens were retrieved... -
Data on Terminological Semantic Variation between the (US and British) Press ...
The data set contains three spreadsheets, two of them being displayed in one single Excel file. The first file, entitled « Cosine_Similarity_UN-Press », represents the cosine... -
Aufweichen, abbremsen, abschirmen – Wirtschaftsmetaphern zwischen politischer...
Our study examines the use of metaphors in the German discourse regarding European monetary and fiscal policy. Though this policy area sparked considerable interest among... -
Background data (adapted from Jenset & McGillivray 2017) for: Down-sampling f...
Dataset description This dataset, which is adapted from Jenset and McGillivray (2017), contains tabular files documenting the alternating usage of -(e)th and -(e)s to mark... -
Replication data for: Big data in Russian linguistics? Another look at paucal...
This post contains a database of Russian numeral constructions from the RuTenTen corpus (https://www.sketchengine.co.uk/rutenten-russian-corpus/). The constructions are of the... -
Replication Data for: The decade construction rivalry in Russian: Using a cor...
This dataset contains 3 data files, 5 files with R code, and a short read-me file with documentation. The data files contain information about the development of two competing... -
Replication Data for: Zur Determiniererlosigkeit bei prädikativ verwendeten z...
This data set contains the replication data for the article "Zur Determiniererlosigkeit bei prädikativ verwendeten zählbaren Nomen im Deutschen: Korpusdaten und ihre... -
Q-CAT Corpus Annotation Tool 1.5
The Q-CAT (Querying-Supported Corpus Annotation Tool) is a tool for manual linguistic annotation of corpora, which also enables advanced queries on top of these annotations. The... -
Q-CAT Corpus Annotation Tool 1.4
The Q-CAT (Querying-Supported Corpus Annotation Tool) is a tool for manual linguistic annotation of corpora, which also enables advanced queries on top of these annotations. The... -
Q-CAT Corpus Annotation Tool 1.3
The Q-CAT (Querying-Supported Corpus Annotation Tool) is a computational tool for manual annotation of language corpora, which also enables advanced queries on top of these... -
Q-CAT Corpus Annotation Tool 1.2
The Q-CAT (Querying-Supported Corpus Annotation Tool) is a computational tool for manual annotation of language corpora, which also enables advanced queries on top of these... -
Q-CAT Corpus Annotation Tool 1.1
The Q-CAT (Querying-Supported Corpus Annotation Tool) is a computational tool for manual annotation of language corpora, which also enables advanced queries on top of these... -
Q-CAT Corpus Annotation Tool 1.0
The Q-CAT (Querying-Supported Corpus Annotation Tool) is a computational tool for manual annotation of language corpora, which also enables advanced queries on top of these... -
Corpus extraction tool LIST 1.0
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI... -
Dependency tree extraction tool STARK 1.0
STARK is a python-based command-line tool for extraction of dependency trees from parsed corpora, aimed at corpus-driven linguistic investigations of syntactic phenomena of...