PANACEA English V-SUBCAT gold-standard for LAB domain

DOI

This is a domain-specific gold-standard for English subcategorization frames, in the case, for labour (LAB) domain. This gold-standard was manually developed, choosing a set of 29 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manually annotated. The sentences were selected from crawled Web pages that were automatically detected to be in the English language and were automatically classified as relevant to the LAB domain. Data collection took place in the summer of 2011. This gold-standard was created in the context of PANACEA http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064.

Identifier
DOI https://doi.org/10.34810/data370
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data370
Provenance
Creator University of Cambridge. Department of Theoretical and Applied Linguistics
Publisher CORA.Repositori de Dades de Recerca
Contributor Institut Universitari de Lingüística Aplicada (IULA)
Publication Year 2023
Funding Reference European Commission 248064
Rights Custom Dataset Terms; info:eu-repo/semantics/openAccess; https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data370
OpenAccess true
Representation
Resource Type Textual data; Dataset
Format application/pdf; text/xml; text/plain; text/html
Size 194024; 171884; 231; 9767; 1958
Version 1.0
Discipline Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences