PANACEA Annotated Dependency Spanish Environment Corpus Version 2

DOI

-

PANACEA Annotated Spanish Environment Corpus Version 2 consists of Spanish texts in the Environment (ENV) domain that were collected and automatically annotated in the framework of PANACEA (http://www.panacea-lr.eu), an EU-FP7 Funded Project under Grant Agreement 248064. The texts were crawled web pages that were automatically detected to be in the Spanish language and were automatically classified as relevant to the ENV domain. Data collection took place in the summer of 2011. The automatically assigned annotations deal with sentence and token segmentation, POS and lemma, dependency relations and named entities

Identifier
DOI https://doi.org/10.34810/data342
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data342
Provenance
Creator Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
Publisher CORA.Repositori de Dades de Recerca
Publication Year 2023
Funding Reference European Commission 248064
Rights Custom Dataset Terms; info:eu-repo/semantics/openAccess; https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data342
OpenAccess true
Representation
Resource Type Textual data; Dataset
Format application/pdf; text/plain; text/html; application/zip
Size 194024; 566; 250; 9208; 693742726; 769070151; 769069974; 569545592; 924833860; 3937
Version 1.0
Discipline Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences