The CLEF-IP 2013 Test Collection

DOI

CLEF-IP: Cross-Language Evaluation Forum - Intellectual Property The CLEF-IP track ran from 2009 to 2013 and aimed to investigate IR techniques for patent retrieval.The track utilizes a collection of more than 1.3M patent documents (~2.6 million files) derived from EPO (European Patent Office) sources and EuroPCT Applications (more than 400K documents) published by WIPO (World Intelectual Property Organization). The collection contains documents in English, French and German with at least 150,000 documents in each language, all published before 2001. There was one task in 2013: The first one was to find patent documents that are candidates to constitute prior art for a given claim taken from a patent document.  Files

Document CollectionThe corpus consists of two parts. The first one is a set of XML files representing a total of over 1.3 million patent documents - this collection is to be used for the first task.NOTE: the document collection is the same as the one published for CLEF-IP 2011, excluding images. Topics and AnswersBoth the training and the test topic sets contain also the relevance assessments for the topics.

Identifier
DOI https://doi.org/10.48436/nw2xc-41j75
Related Identifier IsDescribedBy https://doi.org/10.1007/978-3-642-40802-1_25
Related Identifier IsVersionOf https://doi.org/10.48436/2xs1a-sd524
Metadata Access https://researchdata.tuwien.ac.at/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:researchdata.tuwien.ac.at:nw2xc-41j75
Provenance
Creator Piroi, Florina ORCID logo; Hanbury, Allan ORCID logo; Lupu, Mihai ORCID logo
Publisher TU Wien
Publication Year 2021
Rights Creative Commons Attribution Non Commercial Share Alike 3.0 Unported; https://creativecommons.org/licenses/by-nc-sa/3.0/legalcode
OpenAccess true
Contact tudata(at)tuwien.ac.at
Representation
Language English
Resource Type Dataset
Version 1.0.0
Discipline Other