Similarity Analysis of Patent Claims Using Natural Language Processing Techniques

Claims typically found at the end of a patent document are one of the key elements of a patent and define the boundaries or scope of protection conferred by a patent. Claims of related patents also need to be read and reviewed carefully by an inventor or a patent attorney at the time of drafting a patent application. We present a method and a tool to do a claim similarity analysis between two different patents based on natural language processing techniques. The technique proposed in this paper relies on computing similarity between two claims based on syntactic and semantic matching of the natural language text describing the claims. We present results of experiments performed on patent claim data obtained from patents published on Google patents website. The motivation behind the research presented in this paper is to build patent processing tools to increase the overall productivity of a patent analyst or a patent attorney while doing claims infringement, validity and quality analysis.

[1]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[2]  Makoto Iwayama,et al.  Patent Claim Processing for Readability - Structure Analysis and Term Explanation , 2003, ACL 2003.

[3]  John Murphy,et al.  Using WordNet as a Knowledge Base for Measuring Semantic Similarity between Words , 1994 .

[4]  Yuen-Hsien Tseng,et al.  Text mining techniques for patent analysis , 2007, Inf. Process. Manag..

[5]  D. Levicky,et al.  Digital Watermarking in Wavelet Transform Domain , 2001 .

[6]  Ingemar J. Cox,et al.  Watermarking as communications with side information , 1999, Proc. IEEE.

[7]  David J. Fleet,et al.  Embedding invisible information in color images , 1997, Proceedings of International Conference on Image Processing.

[8]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[9]  Akihiro Yamamoto,et al.  A digital watermark based on the wavelet transform and its robustness on image compression , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[10]  Svetlana Sheremetyeva Natural Language Analysis of Patent Claims , 2003, ACL 2003.

[11]  Saraju P. Mohanty,et al.  Watermarking of Digital Images , 1999 .

[12]  Yiwei Wang,et al.  A wavelet-based watermarking algorithm for ownership verification of digital images , 2002, IEEE Trans. Image Process..

[13]  Noriko Kando,et al.  Introduction to the special issue on patent processing , 2007, Inf. Process. Manag..

[14]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.