Background data for: Regression and random forests: Synergies for variationist corpus research

DOI

This dataset contains tabular files recording occurrences of the verb REGRET complemented by a that- or -ing-complement clause (CC) in the GloWbE corpus. Tokens were retrieved using the online interface (https://www.english-corpora.org/glowbe/) and manually annotated for several syntactic and semantic variables (variety, text type, finiteness, meaning of the verb regret, voice of the CC, words in the CC, coreferentiality, intervening material, negation in the CC, temporal relation). See ReadMe file for more details. Related publication: Sönning, Lukas, Jason Grafmiller & Raquel P. Romasanta. 2024. Regression and random forests: Synergies for variationist corpus research. ICAME 45, University of Vigo, 18-22 June 2024.

Identifier
DOI https://doi.org/10.18710/MHGXDH
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/MHGXDH
Provenance
Creator Romasanta, Raquel P. ORCID logo
Publisher DataverseNO
Contributor Romasanta, Raquel P.; University of Vigo; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year 2024
Funding Reference The Spanish Ministry of Economy and Competitiveness FFI2017-82162-P ; The Spanish Ministry of Economy and Competitiveness PRE2018-083249 ; The Spanish Ministry of Science and Innovation funded by MCIN/AEI/10.13039/501100011033 PID2020-117030GB-I00 ; The Recovery, Transformation, and Resilience Plan of the European Union “NextGenerationEU”, University of Vigo 585507
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Romasanta, Raquel P. (University of Santiago de Compostela)
Representation
Resource Type Annotated corpus data; Dataset
Format text/plain
Size 5854; 425482
Version 1.0
Discipline Humanities; Linguistics