Categorical and Numeric Relations Dataset

DOI

Collaboratively constructed knowledge bases play an important role in information systems, but are essentially always incomplete. Thus, a large number of models has been developed for Knowledge Base Completion, the task of predicting new attributes of entities given partial descriptions of these entities. Virtually all of these models either concentrate on numeric attributes (what is Italy’s GDP?) or they concentrate on categorical attributes (Tim Cook is the chairman of Apple). This dataset was created as a part of a research experiment to develop a model for the joint prediction of numeric and categorical attributes based on embeddings learned from textual occurrences of the entities in question. This dataset consists of numeric and categorical relation tuples spanning from 7 different domains such as 'animal', 'country', 'people', etc. The tuples presented in this dataset have been used to train and test a neural network framework to perform the above mentioned task. All data presented in this dataset has been scraped from FreeBase.*FORTHCOMING PUBLICATION: the paper corresponding to this dataset will be available soon*

Identifier
DOI https://doi.org/10.17026/dans-zxp-t7tf
Metadata Access https://phys-techsciences.datastations.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.17026/dans-zxp-t7tf
Provenance
Creator T. Venkatesh
Publisher DANS Data Station Phys-Tech Sciences
Contributor Thejas Venkatesh; S. Pado (Universitat Stuttgart); A. Gupta (Universitat Stuttgart)
Publication Year 2019
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Thejas Venkatesh
Representation
Resource Type Dataset
Format text/plain; application/zip
Size 42996; 447987; 88133; 2825; 221752; 11025; 31811; 97574; 1352892; 34299; 270592; 8844; 133060; 668417; 32211; 44399; 90259; 222922; 452177; 2920; 11320; 33359; 45387; 14491; 11285; 374; 28726; 115030; 105099; 34521; 347616; 139827; 42704; 90066; 324009; 1125; 499; 115713; 115513; 14704; 30219; 11397; 46689; 474
Version 1.0
Discipline Other