Exploratory Patent Search with Faceted Search and Configurable Entity Mining

Searching for patents is usually a recall-oriented problem and depending on the patent search type, quite often a problem which is characterized by uncertainty and evolution or change of the information need. We propose an exploratory strategy for patent search that exploits the metadata already available in patents in addition to the results of clustering and entity mining that are performed at query time. The results (metadata, clusters and entities grouped in categories) can complement the ranked list of patents produced from the core search engine with useful information for the user (e.g. providing a concise overview of the search results) which are further exploited in a faceted and sessionbased interaction scheme that allows the users to focus their searches gradually and to change between search methods as their information need is better defined and their understanding of the topic evolves in response to found information. In addition, we propose the exploitation of Linked Data for specifying the entities of interest and for providing further information about the identified entities. The proposed system offers a dynamic, entity-based integration of patent documents, patents metadata and other external (semantic) resources.

[1]  Tomek Strzalkowski,et al.  Evaluating document retrieval in patent database: a preliminary report , 1997, CIKM '97.

[2]  Jungi Kim,et al.  Cluster-based patent retrieval , 2007, Inf. Process. Manag..

[3]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[4]  Leah S. Larkey,et al.  A patent search and classification system , 1999, DL '99.

[5]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[6]  W. Bruce Croft,et al.  Transforming patents into prior-art queries , 2009, SIGIR.

[7]  Giovanni Maria Sacco,et al.  Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience , 2009, The Information Retrieval Series.

[8]  Jungi Kim,et al.  Cluster-Based Patent Retrieval Using International Patent Classification System , 2006, ICCPOL.

[9]  Makoto Iwayama,et al.  Proposal of two-stage patent retrieval method considering the claim structure , 2005, TALIP.

[10]  Yannis Tzitzikas,et al.  Exploiting Available Memory and Disk for Scalable Instant Overview Search , 2011, WISE.

[11]  Wim Vanderbauwhede,et al.  A survey of patent users: an analysis of tasks, behavior, search functionality and system requirements , 2010, IIiX.

[12]  Bo Gao,et al.  PatentMiner: topic-driven patent analysis and mining , 2012, KDD.

[13]  Yuen-Hsien Tseng,et al.  Text mining techniques for patent analysis , 2007, Inf. Process. Manag..

[14]  Allan Hanbury,et al.  1st international workshop on advances in patent information retrieval (AsPIRe'10) , 2010, SIGF.

[15]  W. Pratt,et al.  The usefulness of dynamically categorizing search results. , 2000, Journal of the American Medical Informatics Association : JAMIA.

[16]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.

[17]  Mika Käki,et al.  Findex: improving search result use through automatic filtering categories , 2005, Interact. Comput..

[18]  Allan Hanbury,et al.  4th international workshop on patent information retrieval (PaIR'11) , 2011, CIKM '11.

[19]  Mika Käki,et al.  Findex: search result categories help users when document ranking fails , 2005, CHI.

[20]  Fulvio Corno,et al.  Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics , 2010 .

[21]  Yannis Tzitzikas,et al.  Exploratory Web Searching with Dynamic Taxonomies and Results Clustering , 2009, ECDL.

[22]  Yannis Tzitzikas,et al.  STC+ and NM-STC: Two Novel Online Results Clustering Methods for Web Searching , 2009, WISE.

[23]  Andreas Rauber,et al.  Improving Retrievability of Patents in Prior-Art Search , 2010, ECIR.

[24]  Steven Foster,et al.  On the role of classification in patent invalidity searches , 2009 .

[25]  Yannis Tzitzikas,et al.  Web Searching with Entity Mining at Query Time , 2012, IRFC.

[26]  Yannis Tzitzikas,et al.  On exploiting static and dynamically mined metadata for exploratory web searching , 2011, Knowledge and Information Systems.

[27]  Mika Käki,et al.  Information search and re-access strategies of experienced web users , 2005, WWW '05.

[28]  Barry Bishop,et al.  FactForge: A fast track to the Web of data , 2011, Semantic Web.