Using Classification Code Hierarchies for Patent Prior Art Searches

Searches in patent collections to determine if a given patent application has related prior art patents is non-trivial and often requires extensive manpower. When time is constrained, an automatically generated, ranked list of prior art patents associated with a given patent application decreases search costs and improves search efficiency. One may view the discovery of this prior art patent set as a problem of finding patents ‘related’ to the patent application. To accomplish this, we examine whether semantic relations between patent classification codes can aid in the recognition of related prior art patents. We explore similarity measures for hierarchically ordered patent classes and subclasses for this purpose. Next, we examine various patent feature-weighting schemes to achieve the best similarities between our patent applications and related prior art patents. Finally, we provide a method and demonstrate that patent prior art searches can successfully be used as an aid in patent ranking.

[1]  Steven Foster,et al.  On the role of classification in patent invalidity searches , 2009 .

[2]  Kazuya Konishi Query Terms Extraction from Patent Document for Invalidity Search , 2005, NTCIR.

[3]  Piotr Indyk,et al.  Enhanced hypertext categorization using hyperlinks , 1998, SIGMOD '98.

[4]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[5]  Makoto Iwayama,et al.  Overview of the Patent Mining Task at the NTCIR-7 Workshop , 2008, NTCIR.

[6]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[7]  Patrick Ruch,et al.  Simple Pre and Post Processing Strategies for Patent Searching in CLEF Intellectual Property Track 2009 , 2009, CLEF.

[8]  Padmini Srinivasan,et al.  TREC Blog and TREC Chem: A View from the Corn Fields , 2009, TREC.

[9]  Falk Scholer,et al.  User performance versus precision measures for simple search tasks , 2006, SIGIR.

[10]  Luo Si,et al.  Strategies for Effective Chemical Information Retrieval , 2009, TREC.

[11]  Edward A. Fox,et al.  Proceedings of the Fourth ACM conference on Digital Libraries, August 11-14, 1999, Berkeley, CA, USA , 1999 .

[12]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[13]  Patrick Ruch,et al.  Report on the TREC 2009 Experiments: Chemical IR Track , 2009, TREC.

[14]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  Luo Si,et al.  A language modeling framework for resource selection and results merging , 2002, CIKM '02.

[17]  Louis Falasco United States patent classification: system organization , 2002 .

[18]  Makoto Iwayama,et al.  The patent mining task in the seventh NTCIR workshop , 2008, PaIR '08.

[19]  Padmini Srinivasan,et al.  Comparison of IPC and USPC classification systems in patent prior art searches , 2010, PaIR '10.

[20]  A. Törcsvári,et al.  Automated categorization in the international patent classification , 2003, SIGF.

[21]  Noriko Kando,et al.  Overview of the Patent Retrieval Task at the NTCIR-6 Workshop , 2007, NTCIR.

[22]  Patrick Ruch,et al.  Exploring a Wide Range of Simple Pre and Post Processing Strategies for Patent Searching in CLEF IP 2009 , 2009, CLEF.

[23]  Radford M. Neal,et al.  Improving Classification When a Class Hierarchy is Available Using a Hierarchy-Based Prior , 2005, math/0510449.

[24]  Sudha Ram,et al.  Proceedings of the 1997 ACM SIGMOD international conference on Management of data , 1997, ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems.

[25]  Leah S. Larkey,et al.  A patent search and classification system , 1999, DL '99.

[26]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..