WMT18 APE Shared Task: En-DE NMT Train and Dev Data

PID

Training and development data for the WMT 2018 Automatic post-editing task. They consist in English-German triplets (source, target and post-edit) belonging to the information technology domain and already tokenized. Training and development respectively contain 13,442 and 1,000 triplets. A neural machine translation system has been used to generate the target segments. All data is provided by the EU project QT21 (http://www.qt21.eu/).

Identifier
PID http://hdl.handle.net/11372/LRT-2613
Related Identifier http://www.statmt.org/wmt18/ape-task.html
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11372/LRT-2613
Provenance
Creator Turchi, Marco; Negri, Matteo; Chatterjee, Rajen
Publisher Fondazione Bruno Kessler, Trento, Italy
Publication Year 2018
Funding Reference info:eu-repo/grantAgreement/EC/H2020/645452
Rights AGREEMENT ON THE USE OF DATA IN QT21 APE Task; https://lindat.mff.cuni.cz/repository/xmlui/page/licence-TAUS_QT21; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language English; German
Resource Type corpus
Format text/plain; charset=utf-8; application/x-gzip; downloadable_files_count: 1
Discipline Linguistics