Measures for textual patent similarities: a guided way to select appropriate approaches

The measurement of textual patent similarities is crucial for important tasks in patent management, be it prior art analysis, infringement analysis, or patent mapping. In this paper the common theory of similarity measurement is applied to the field of patents, using solitary concepts as basic textual elements of patents. After unfolding the term ‘similarity’ in a content and formal oriented level and presenting a basic model of understanding, a segmented approach to the measurement of underlying variables, similarity coefficients, and the criteria-related profiles of their combinations is lined out. This leads to a guided way to the application of textual patent similarities, interesting both for theory and practice.

[1]  J. John SepkoskiJr.,et al.  Quantified coefficients of association and measurement of similarity , 1974 .

[2]  Martin G. Moehrle,et al.  Evaluating the Risk of Patent Infringement by Means of Semantic Patent Analysis: The Case of DNA Chips , 2008 .

[3]  Arie Rip,et al.  Co-word maps of biotechnology: An example of cognitive scientometrics , 1984, Scientometrics.

[4]  Marie-Francine Moens,et al.  Information Extraction: Algorithms and Prospects in a Retrieval Context , 2006, The Information Retrieval Series.

[5]  Yuen-Hsien Tseng,et al.  Text mining techniques for patent analysis , 2007, Inf. Process. Manag..

[6]  H. P. F. Peters,et al.  Co-word-based science maps of chemical engineering. Part I: Representations by direct multidimensional scaling , 1993 .

[7]  Jian Qin,et al.  Semantic similarities between a keyword database and a controlled vocabulary database: An investigation in the antibiotic resistance literature , 2000, J. Am. Soc. Inf. Sci..

[8]  Anthony J. Trippe,et al.  Patinformatics: Tasks to tools , 2003 .

[9]  Thomas Klose,et al.  Text mining and visualization tools - Impressions of emerging capabilities , 2008 .

[10]  Henk F. Moed,et al.  Mapping of Science : Critical elaboration and new approaches, a case study in agricultural biochemistry , 1988 .

[11]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[12]  Peter Vinkler,et al.  Ratio of short term and long term impact factors and similarities of chemistry journals represented by references , 1999, Scientometrics.

[13]  Hyunbo Cho,et al.  A novel method for measuring semantic similarity for XML schema matching , 2008, Expert Syst. Appl..

[14]  Ronald Rousseau,et al.  Similarity measures in scientometric research: The Jaccard index versus Salton's cosine formula , 1989, Inf. Process. Manag..

[15]  Christian Sternitzke,et al.  Similarity measures for document mapping: A comparative study on the level of an individual scientist , 2007, Scientometrics.

[16]  Fulvio Corno,et al.  Review of the state-of-the-art in patent information and forthcoming evolutions in intelligent patent informatics , 2010 .

[17]  Henry G. Small,et al.  Paradigms, citations, and maps of science: A personal history , 2003, J. Assoc. Inf. Sci. Technol..

[18]  Jinseok Park Evolution of Industry Knowledge in the Public Domain: Prior Art Searching for Software Patents , 2005 .

[19]  Markus Reitzig,et al.  Measuring patent assessment quality--Analyzing the degree and kind of (in)consistency in patent offices' decision making , 2007 .

[20]  E. Hippel Sticky Information and the Locus of Problem Solving: Implications for Innovation , 1994 .

[21]  V. Batagelj,et al.  Comparing resemblance measures , 1995 .

[22]  Runhua Tan,et al.  A Text-Mining-based Patent Analysis in Product Innovative Process , 2007, IFIP CAI.

[23]  H. P. F. Peters,et al.  Co-word-based science maps of chemical engineering. Part II: Representations by combined clustering and multidimensional scaling , 1993 .

[24]  Thorsten Teichert,et al.  Inventive progress measured by multi-stage patent citation analysis , 2005 .

[25]  Minoo Philipp,et al.  Patent filing and searching: Is deflation in quality the inevitable consequence of hyperinflation in quantity? , 2006 .

[26]  Henry Small Visualizing science by citation mapping , 1999 .

[27]  Lijun Jiang,et al.  Ontology-Based Similarity Between Text Documents on Manifold , 2006, ASWC.

[28]  Matthias Dehmer,et al.  Strukturelle Analyse web-basierter Dokumente , 2005 .