Replication Data for: Zooming in on the semantics of French ingressives: a collostructional analysis

DOI

Dataset abstract: The dataset includes an annotated corpus sample of N = 2000 French sentences with se mettre à or commencer à (1000 observations of each verb). The sample was drawn from the literary corpus Frantext and the journalistic corpus Le Monde (1000 observations from both corpora). The sample is balanced for verb as well as corpus, so we have 500 observations for each Verb-Corpus combination. The data is annotated for 3 variables: Source (corpus), Verb, collexeme.

Article abstract: This paper examines the semantic value of the infinitive in the ingressive constructions se mettre à (SMA) and commencer à (COMA) using a distinctive collexeme analysis. We find that the collexemes significant for the construction SMA are fairly homogeneous across the different corpora and can be grouped into the general category of expressive collexemes. The collexemes significant for COMA are more heterogeneous and belong to the category of cognitive collexemes and to semantic fields of sensory and creative acts. The results are compatible with the hypothesis put forward by Verroens and De Cuypere (2023) stating that the overall meaning of the SMA construction is intrinsically punctual. The punctual value of SMA is not only compatible with expressive collexemes, but, moreover, emphasizes their unforeseen and unintentional meaning. Conversely, the incremental value of COMA is consistent with the gradual onset of cognitive and sensory collexemes.

Verroens, F., & De Cuypere, L. (2023). French ingressives and (phasal) aspect: A frame-semantic corpus-based analysis. Canadian Journal of Linguistics/Revue Canadienne de Linguistique, 68(3), 435-461. doi:10.1017/cnj.2023.19

PerlClx, 1.0b

MS Excel, Microsoft Office Professional Plus 2016

Identifier
DOI https://doi.org/10.18710/SZZDLI
Related Identifier https://doi.org/10.1017/cnj.2023.19
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/SZZDLI
Provenance
Creator Verroens, Filip ORCID logo
Publisher DataverseNO
Contributor Verroens, Filip; Ghent University; Verroens Filip; The Tromsø Repository of Language and Linguistics (TROLLing)
Publication Year 2024
Rights info:eu-repo/semantics/openAccess
OpenAccess true
Contact Verroens, Filip (Ghent University)
Representation
Resource Type annotated corpus data; Dataset
Format text/plain; text/comma-separated-values; application/zip
Size 8995; 90752; 53559; 54636; 66966; 5721; 642; 9490
Version 1.0
Discipline Humanities
Spatial Coverage Belgium, Flanders