Qidian-Webnovel Corpus 110

DOI

The corpus creation process involved a manual search for translated novels available on Webnovel.com within the NOVEL category, only targeting completed works. Subsequently, each identified translated novel was mapped with its original counterpart on Qidian.com.

As a result, we got 120 novels (from which 10 of these novels' copyright on Qidian.com had expired, so no data for this 10 novels on Qidian). The final corpus consists of 110 novels, and all the reader comments and replies to the novels. (Timestamp 01/09/2024). Comments and replies are categorized by book-level, chapter-level and paragraph level, and stored by per novel.

For example: We also collected the user profiles of readers who has left comments or replies on the novels. We only collected personal data that are necessary for the purpose of the scientific research, and strictly abide by the GDPR.

Identifier
DOI https://doi.org/10.34894/GQXX3K
Metadata Access https://dataverse.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34894/GQXX3K
Provenance
Creator Yu, Ze ORCID logo; Pianzola, Federico (ORCID: 0000-0001-6634-121X); Tatar, Emin ORCID logo
Publisher DataverseNL
Contributor Groningen Digital Competence Centre; University of Groningen; Ze Yu; Emin Tatar; Federico Pianzola; DataverseNL Network
Publication Year 2024
Funding Reference European Commission 101040938
Rights info:eu-repo/semantics/restrictedAccess
OpenAccess false
Contact Groningen Digital Competence Centre (University of Groningen)
Representation
Resource Type machine-readable text; Dataset
Format text/csv; application/zip; text/plain
Size 16392; 148397; 82344; 1253056; 141589869; 270934845; 1719; 118084; 164626; 2008239; 6384512; 19374543; 14366424; 41535351; 52937107; 26360006
Version 1.0
Discipline Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences