Qidian-Webnovel Corpus 110

Dataset

DOI

The corpus creation process involved a manual search for translated novels available on Webnovel.com within the NOVEL category, only targeting completed works. Subsequently, each identified translated novel was mapped with its original counterpart on Qidian.com.

As a result, we got 120 novels (from which 10 of these novels' copyright on Qidian.com had expired, so no data for this 10 novels on Qidian). The final corpus consists of 110 novels, and all the reader comments and replies to the novels. (Timestamp 01/09/2024). Comments and replies are categorized by book-level, chapter-level and paragraph level, and stored by per novel.

For example: We also collected the user profiles of readers who has left comments or replies on the novels. We only collected personal data that are necessary for the purpose of the scientific research, and strictly abide by the GDPR.

Identifier
DOI	https://doi.org/10.34894/GQXX3K
Metadata Access	https://dataverse.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34894/GQXX3K

Provenance
Creator	Yu, Ze ; Pianzola, Federico (ORCID: 0000-0001-6634-121X); Tatar, Emin
Publisher	DataverseNL
Contributor	Groningen Digital Competence Centre; University of Groningen; Ze Yu; Emin Tatar; Federico Pianzola; DataverseNL Network
Publication Year	2024
Funding Reference	European Commission 101040938
Rights	info:eu-repo/semantics/restrictedAccess
OpenAccess	false
Contact	Groningen Digital Competence Centre (University of Groningen)

Representation
Resource Type	machine-readable text; Dataset
Format	text/csv; application/zip; text/plain
Size	16392; 148397; 82344; 1253056; 141589869; 270934845; 1719; 118084; 164626; 2008239; 6384512; 19374543; 14366424; 41535351; 52937107; 26360006
Version	1.0
Discipline	Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Humanities; Life Sciences; Social Sciences; Social and Behavioural Sciences; Soil Sciences