IULA Penn Treebank

DOI

This treebank consists of a number of Spanish and English sentences that has been manually annotated with syntactical information. The sentences have been choosed from the Penn TreeBank corpus, a resource containing texts from Wall Street Journal and originally compiled by the University of Pennsylvania./nIt contains 805 sentences that have been human translated to Spanish. The original English and the translated Spanish sentences share the same identification number. Sentences in both languages have been processed using the DELPH-IN environment (http://www.delph-in.net/).

Identifier
DOI https://doi.org/10.34810/data265
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data265
Provenance
Creator Universitat Pompeu Fabra. Institut Universitari de Lingüística Aplicada (IULA)
Publisher CORA.Repositori de Dades de Recerca
Publication Year 2023
Rights Custom Dataset Terms; info:eu-repo/semantics/openAccess; https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data265
OpenAccess true
Representation
Resource Type Textual data; Dataset
Format application/pdf; application/zip; application/vnd.openxmlformats-officedocument.wordprocessingml.document; text/xml; text/plain; text/html
Size 172184; 16437571; 58602; 12678; 232; 12821; 1446
Version 1.0
Discipline Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences