Large-Scale Logical Retrieval: Technology for Semantic Modelling of Patent Search

Patent retrieval has emerged as an important application of information retrieval (IR). It is considered to be a complex search task because patent search requires an extended chain of reasoning beyond basic document retrieval. As logic-based IR is capable of modelling both document retrieval and decision-making, it can be seen as a suitable framework for modelling patent data and search strategies. In particular, we demonstrate logic-based modelling for semantic data in patent documents and retrieval strategies which are tailored to patent search and exploit more than just the text in the documents. Given the expressiveness of logic-based IR, however, there is an attendant compromise on issues of scalability and quality. To address these trade-offs we suggest how a parallelised architecture can ensure that logical IR scales in spite of its expressiveness.

[1]  Norbert Fuhr Optimum Database Selection in Networked IR , 1996, Networked Information Retrieval.

[2]  Mounia Lalmas,et al.  Information Retrieval: Uncertainty and Logics: Advanced Models for the Representation and Retrieval of Information , 1998 .

[3]  Peter Ingwersen,et al.  Developing a Test Collection for the Evaluation of Integrated Search , 2010, ECIR.

[4]  Norbert Fuhr,et al.  Information Retrieval with Probabilistic Datalog , 1998 .

[5]  Hans-Jörg Schek,et al.  A Relational Object Model , 1990, ICDT.

[6]  Norbert Fuhr,et al.  Proceedings of the SIGIR'96 Workshop on Networked Information Retrieval, ETH Zurch, Switzerland, August 22, 1996 , 1996, Networked Information Retrieval.

[7]  David Poole,et al.  Probabilistic Horn Abduction and Bayesian Networks , 1993, Artif. Intell..

[8]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[9]  Hany Azzam,et al.  Logic-Based Retrieval: Technology for Content-Oriented and Analytical Querying of Patent Data , 2010, IRFC.

[10]  Hany Azzam,et al.  Modelling retrieval models in a probabilistic relational algebra with a new operator: the relational Bayes , 2007, The VLDB Journal.

[11]  Michael Stonebraker,et al.  Object-Relational DBMSs: Tracking the Next Great Wave , 1998 .

[12]  Gabriella Kazai,et al.  Modelling Anchor Text Retrieval in Book Search based on Back-of-Book Index , 2008 .

[13]  Hany Azzam,et al.  A case for probabilistic logic for scalable patent retrieval , 2009, PaIR@CIKM.

[14]  Yoshitaka Kameya,et al.  Parameter Learning of Logic Programs for Symbolic-Statistical Modeling , 2001, J. Artif. Intell. Res..

[15]  Norbert Fuhr,et al.  Probabilistic Datalog—a logic for powerful retrieval methods , 1995, SIGIR '95.

[16]  Jamie Callan,et al.  DISTRIBUTED INFORMATION RETRIEVAL , 2002 .