Evaluating MT for LIWC

Data belonging to article 'Machine-translated texts as an alternative to translated dictionaries for LIWC'.

The data are the result of computations described in the article. Each zip file contains the results of a single run. The name of the zip file consists of the run number, the language code, the corpus used, the MT engine used and the LIWC version.

The zip files contain: - the LIWC output for the working language, English gold translation and English machine translation (.csv files). - computed statistics, including correlations and effect sizes for each of the three language pairs (.csv files) - graphs giving the correlations and effect sizes for each LIWC category as well as the relevant averages (.png files) - a report showing a number of texts where there are sizable differences between the LIWC output in the different languages (.html files) - the scripts used in performing the computations

Identifier
DOI https://doi.org/10.17026/dans-xgp-nyp7
PID https://nbn-resolving.org/urn:nbn:nl:ui:13-1y-jn9e
Metadata Access https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:197027
Provenance
Creator Boot, P ORCID logo
Publisher Data Archiving and Networked Services (DANS)
Contributor Boot, P; dr P Boot (Huygens ING)
Publication Year 2021
Rights info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Language English
Resource Type Dataset
Format html; csv; png; py
Discipline Humanities