Categorical and Numeric Relations Dataset

Dataset

DOI

Collaboratively constructed knowledge bases play an important role in information systems, but are essentially always incomplete. Thus, a large number of models has been developed for Knowledge Base Completion, the task of predicting new attributes of entities given partial descriptions of these entities. Virtually all of these models either concentrate on numeric attributes (what is Italy’s GDP?) or they concentrate on categorical attributes (Tim Cook is the chairman of Apple). This dataset was created as a part of a research experiment to develop a model for the joint prediction of numeric and categorical attributes based on embeddings learned from textual occurrences of the entities in question. This dataset consists of numeric and categorical relation tuples spanning from 7 different domains such as 'animal', 'country', 'people', etc. The tuples presented in this dataset have been used to train and test a neural network framework to perform the above mentioned task. All data presented in this dataset has been scraped from FreeBase.*FORTHCOMING PUBLICATION: the paper corresponding to this dataset will be available soon*

Identifier
DOI	https://doi.org/10.17026/dans-zxp-t7tf
Metadata Access	https://phys-techsciences.datastations.nl/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.17026/dans-zxp-t7tf

Provenance
Creator	T. Venkatesh
Publisher	DANS Data Station Phys-Tech Sciences
Contributor	Thejas Venkatesh; S. Pado (Universitat Stuttgart); A. Gupta (Universitat Stuttgart)
Publication Year	2019
Rights	CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess	true
Contact	Thejas Venkatesh

Representation
Resource Type	Dataset
Format	text/plain; application/zip
Size	42996; 447987; 88133; 2825; 221752; 11025; 31811; 97574; 1352892; 34299; 270592; 8844; 133060; 668417; 32211; 44399; 90259; 222922; 452177; 2920; 11320; 33359; 45387; 14491; 11285; 374; 28726; 115030; 105099; 34521; 347616; 139827; 42704; 90066; 324009; 1125; 499; 115713; 115513; 14704; 30219; 11397; 46689; 474
Version	1.0
Discipline	Other