GrAF version of the Basque Dependency Treebank

DOI

This is the stand-off GrAF version of the Basque Dependency Treebank (BDT). It is the Reference Corpus for the Processing of Basque (EPEC) annotated at syntactic level. EPEC is a 300,000 word corpus of standard written journal texts which aims to be a training corpus for the development and inprovement of several Natural Language Procesing tools. It has been manually tagged at different levels: morphology, partial syntax and semantic This is the stand-off GrAF version of the Constituent Basque Treebank.

Identifier
DOI https://doi.org/10.34810/data281
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data281
Provenance
Creator Aldezabal, Izaskun ORCID logo; Aranzabe, Maxux; Arriola, Jose Mari; Atutxa, Aitziber ORCID logo; Díaz de Ilarraza, Arantza; Estarrona, Ainara ORCID logo; Fernandez, Kike; Iruskieta, Mikel ORCID logo; Uria, Larraitz ORCID logo; Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA); Euskal Herriko Unibertsitatea. IXA Taldea
Publisher CORA.Repositori de Dades de Recerca
Publication Year 2023
Rights Custom Dataset Terms; info:eu-repo/semantics/openAccess; https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data281
OpenAccess true
Representation
Resource Type Textual data; Dataset
Format application/zip; application/pdf; text/plain; text/xml; text/html
Size 2512996; 183630; 1713; 13867; 503; 19222
Version 1.0
Discipline Other