-
Background data for: Advancing our understanding of dispersion measures in co...
Dataset description This dataset contains background data and supplementary material for Sönning (forthcoming), a study that looks at the behavior of dispersion measures when... -
Background data for: Some obstacles to replication in corpus linguistics
This dataset contains tabular files recording occurrences and frequencies of modal verbs in the Brown family corpora; nine modal verbs (can, could, may, might, must, shall,... -
Replication data for The contribution of the visual modality to vowel percept...
Data and scripts for the statistical analyses of correct identification (by English listeners) of vowels in Audio-only (A) and Audiovisual (AV) speech in noise for 2 French and... -
Corpus de les construccions comparatives intensificadores de la lletjor en ca...
Corpus de les construccions comparatives intensificadores en català, espanyol, anglès i francés. Les ocurrències que composen cadascun dels corpus han estat extretes a partir... -
Getuigen Verhalen, Geallieerde Bombardementen in Amsterdam Noord op de Fokker...
Mevrouw Klara Hoogendam-Woudstra is elf jaar als in Amsterdam Noord de Fokkerfabriek door de geallieerden wordt gebombardeerd, juli 1943. Ze komt uit een Gereformeerd gezin van... -
Replication Data for: On the role of ecological validity in language and spee...
Dataset abstract This dataset contains the results from 40 language and speech researchers, who completed a survey. In the first part of the survey, respondents were asked to... -
Background data for: Regression and random forests: Synergies for variationis...
This dataset contains tabular files recording occurrences of the verb REGRET complemented by a that- or -ing-complement clause (CC) in the GloWbE corpus. Tokens were retrieved... -
Dataset for "Scrabble yourself to success: Methods in teaching transcription."
This dataset is from a quasi-experimental study that evaluated two methods for teaching phonemic transcription to university students of English: (i) the transcription of... -
Concessive constructions in varieties of English: Corpus data
The data were used in a corpus-based study that investigates the variation of concessive constructions across nine varieties of English. Concessive constructions are here taken... -
Method in the madnessless: Exploring factors that impact the processing of tw...
Dataset abstract The data collected includes lexical decision data and reaction time data from 56 participants. Three sets of 30 two-suffixed pseudowords were created, each... -
Genre-sensitive Neural Situation Entity classifier (DE, EN)
This is a Classifier for situation entity types as described in Becker et al., 2017. These clause types depend on a combination of syntactic-semantic and contextual features. We... -
The superlative alternation in present-day English: Questionnaire data
This dataset contains elicitation data collected through a questionnaire on superlative strategy choice in English (X-est vs. most X). Native speakers (n = 675) were asked to... -
Replication Data for: The acquisition of the English dative alternation by Ru...
Dataset abstract The dataset contains the ratings for a 100-split task performed by Russian learners of English. 272 Russian learners were subdivided into two groups. One... -
WiKNN Text Classifier
WiKNN is an online text classifier service for Polish and English texts. It supports hierarchical labelled classification of user-submitted texts with Wikipedia categories.... -
Wittgenstein Archives at the University of Bergen (WAB): WiTTLex - The WiTTFi...
WiTTLex - The WiTTFind Lexicon of Wittgenstein’s Philosophical Nachlass, with Frequency Lists and Indication of the Words’ Sources in the Nachlass WiTTLex is an electronic... -
UHR's Termbase for Norwegian higher education institutions UHRs termbase for...
This is a collection of 2000 administrative terms with English - Norwegian bokmål/Norwegian bokmål - English and English - Norwegian nynorsk/Norwegian nynorsk - English... -
Europarl – svenska-engelska (2013-11-17) Europarl – Swedish-English (2013-11...
Part of European Parliament Proceedings Parallel Corpus Del av European Parliament Proceedings Parallel Corpus -
LSI (2020-08-25)
Linguistic Survey of India -
OpenEDGeS (2021-05-24)
The public license subset of the EDGeS Diachronic Bible Corpus, a diachronically and synchronically parallel corpus of Bible translations in Dutch,English, German and Swedish,... -
The English-Swedish Parallel Corpus (ESPC) (2022-11-15)
This dataset has no description