A hybrid keyword and patent class methodology for selecting relevant sets of patents for a technological field

This paper presents a relatively simple, objective and repeatable method for selecting sets of patents that are representative of a specific technological domain. The methodology consists of using search terms to locate the most representative international and US patent classes and determines the overlap of those classes to arrive at the final set of patents. Five different technological fields (computed tomography, solar photovoltaics, wind turbines, electric capacitors, electrochemical batteries) are used to test and demonstrate the proposed method. Comparison against traditional keyword searches and individual patent class searches shows that the method presented in this paper can find a set of patents with more relevance and completeness and no more effort than the other two methods. Follow on procedures to potentially improve the relevancy and completeness for specific domains are also defined and demonstrated. The method is compared to an expertly selected set of patents for an economic domain, and is shown to not be a suitable replacement for that particular use case. The paper also considers potential uses for this methodology and the underlying techniques as well as limitations of the methodology.

[1]  Charles Oppenheim,et al.  Patent citation analysis , 1997, Scientometrics.

[2]  Jacques Michel,et al.  Patent citation analysis.A closer look at the basic input data from patent search reports , 2001, Scientometrics.

[3]  Andrei Popescu-Belis,et al.  Automatic content linking: speech-based just-in-time retrieval for multimedia archives , 2010, SIGIR '10.

[4]  Paola Criscuolo,et al.  The 'home advantage' effect and patent families. A comparison of OECD triadic patents, the USPTO and the EPO , 2006, Scientometrics.

[5]  Sumio Fujita Revisiting Document Length Hypotheses: NTCIR-4 CLIR and Patent Experiments at Patolis , 2004, NTCIR.

[6]  Christopher L. Magee,et al.  A Framework for Analyzing the Underlying Inventions That Drive Technical Improvements in a Specific Technological Field , 2012 .

[7]  Tetsuya Ishikawa,et al.  Associative document retrieval by query subtopic analysis and its application to invalidity patent search , 2004, CIKM '04.

[8]  Allan Hanbury,et al.  Multidisciplinary Information Retrieval , 2011, Lecture Notes in Computer Science.

[9]  Shyh-Jen Wang,et al.  The state of art patent search with an example of human vaccines , 2011, Human vaccines.

[10]  Mostafa Keikha,et al.  Building Queries for Prior-Art Search , 2011, IRFC.

[11]  Allan Hanbury,et al.  Advances in Multidisciplinary Retrieval, First Information Retrieval Facility Conference, IRFC 2010, Vienna, Austria, May 31, 2010. Proceedings , 2010, IRFC.

[12]  Walid Magdy,et al.  PRES: a score metric for evaluating recall-oriented information retrieval applications , 2010, SIGIR.

[13]  Wim Vanderbauwhede,et al.  A survey of patent users: an analysis of tasks, behavior, search functionality and system requirements , 2010, IIiX.

[14]  M. Trajtenberg A Penny for Your Quotes : Patent Citations and the Value of Innovations , 1990 .

[15]  R. S. Campbell,et al.  Patent trends as a technological forecasting tool , 1983 .

[16]  Eva D'hondt,et al.  Lexical issues of a syntactic approach to interactive patent retrieval , 2009 .

[17]  Leah S. Larkey,et al.  A patent search and classification system , 1999, DL '99.

[18]  Kristine H. Atkinson Toward a more rational patent search paradigm , 2008, PaIR '08.

[19]  Nicholas J. Belkin,et al.  Proceedings of the third symposium on Information interaction in context , 2010, IIiX 2010.

[20]  C. J. van Rijsbergen,et al.  Knowledge Modeling in Prior Art Search , 2010, IRFC.

[21]  Martin G. Moehrle,et al.  A new instrument for technology monitoring: novelty in patents measured by semantic patent analysis , 2012, Scientometrics.

[22]  Doreen Alberts,et al.  Introduction to Patent Searching , 2011, Current Challenges in Patent Information Retrieval.

[23]  W. Bruce Croft,et al.  Automatic query generation for patent search , 2009, CIKM.

[24]  Mead McKim,et al.  Boston Public Library , 1895 .

[25]  Edward A. Fox,et al.  Proceedings of the Fourth ACM conference on Digital Libraries, August 11-14, 1999, Berkeley, CA, USA , 1999 .

[26]  Atsushi Fujii Enhancing patent retrieval by citation analysis , 2007, SIGIR.

[27]  Laurent Romary,et al.  Experiments with Citation Mining and Key-Term Extraction for Prior Art Search , 2010, CLEF.

[28]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.