Evaluating MT for LIWC

Dataset

DOI PID

Data belonging to article 'Machine-translated texts as an alternative to translated dictionaries for LIWC'.

The data are the result of computations described in the article. Each zip file contains the results of a single run. The name of the zip file consists of the run number, the language code, the corpus used, the MT engine used and the LIWC version.

The zip files contain: - the LIWC output for the working language, English gold translation and English machine translation (.csv files). - computed statistics, including correlations and effect sizes for each of the three language pairs (.csv files) - graphs giving the correlations and effect sizes for each LIWC category as well as the relevant averages (.png files) - a report showing a number of texts where there are sizable differences between the LIWC output in the different languages (.html files) - the scripts used in performing the computations

Identifier
DOI	https://doi.org/10.17026/dans-xgp-nyp7
PID	https://nbn-resolving.org/urn:nbn:nl:ui:13-1y-jn9e
Metadata Access	https://easy.dans.knaw.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:easy.dans.knaw.nl:easy-dataset:197027

Provenance
Creator	Boot, P
Publisher	Data Archiving and Networked Services (DANS)
Contributor	Boot, P; dr P Boot (Huygens ING)
Publication Year	2021
Rights	info:eu-repo/semantics/openAccess; License: http://creativecommons.org/licenses/by/4.0; http://creativecommons.org/licenses/by/4.0
OpenAccess	true

Representation
Language	English
Resource Type	Dataset
Format	html; csv; png; py
Discipline	Humanities