A two-level parser for patent claim parsing

The adaptability of a conventional parser to patent claim parsing is improved.Six claim syntax peculiarities that increase the difficulty of claim parsing are highlighted.New dependency syntax is proposed to handle Long Distance Dependencies.The efficiency of parsing is obviously improved with a segmentation strategy.The improvement on effectiveness of parsing is also promising in a model extraction problem. Patent claim parsing can contribute in many patent-related applications, such as patent search, information extraction, machine translation and summarization. However, patent claim parsing is difficult due to the special structure of patent claims. To overcome this difficulty, the challenges facing the patent claim parsing were first investigated and the peculiarities of claim syntax that obstruct dependency parsing were highlighted. To handle these peculiarities, this study proposes a new two-level parser, in which a conventional parser is imbedded. A patent claim is pre-processed in order to remove peculiarities before passed to the conventional parser. The process is based on a new dependency-based syntax called Independent Claim Segment Dependency Syntax (ICSDS). This two-lever parser has demonstrated promising improvement for patent claim parsing on both effectiveness and efficiency over the conventional parser.

[1]  Von-Wun Soo,et al.  Comparing the Conceptual Graphs Extracted from Patent Claims , 2008, 2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008).

[2]  Davide Russo,et al.  Searching in Cooperative Patent Classification: Comparison between keyword and concept-based search , 2013, Adv. Eng. Informatics.

[3]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[4]  Gabriela Ferraro,et al.  Towards the derivation of verbal content relations from patent claims using deep syntactic structures , 2011, Knowl. Based Syst..

[5]  Alon Lavie,et al.  Parser Combination by Reparsing , 2006, NAACL.

[6]  Von-Wun Soo,et al.  Extract conceptual graphs from plain texts in patent claims , 2012, Eng. Appl. Artif. Intell..

[7]  Samee U. Khan,et al.  A literature review on the state-of-the-art in patent analysis , 2014 .

[8]  Dale Schuurmans,et al.  Simple Training of Dependency Parsers via Structured Boosting , 2007, IJCAI.

[9]  Yongtae Park,et al.  How to assess patent infringement risks: a semantic patent claim analysis using dependency relationships , 2013, Technol. Anal. Strateg. Manag..

[10]  Joakim Nivre,et al.  Deterministic Dependency Parsing of English Text , 2004, COLING.

[11]  Rna Rudi Bekkers,et al.  An Empirical Study on the Determinants of Essential Patent Claims in Compatibility Standards , 2011 .

[12]  Michael Dittenbach,et al.  Patent Claim Decomposition for Improved Information Extraction , 2011, Current Challenges in Patent Information Retrieval.

[13]  Jason Eisner,et al.  Three New Probabilistic Models for Dependency Parsing: An Exploration , 1996, COLING.

[14]  Svetlana Sheremetyeva Natural Language Analysis of Patent Claims , 2003, ACL 2003.

[15]  Yuji Matsumoto,et al.  Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[16]  Suzan Verberne,et al.  Quantifying the Challenges in Parsing Patent Claims , 2010, PaIR 2010.

[17]  Mark Steedman,et al.  Proceedings of the 2003 conference on Empirical methods in natural language processing , 2003 .

[18]  D. Giles,et al.  A survival analysis of the approval of US patent applications , 2011 .

[19]  Riccardo Apreda,et al.  Automatic extraction of function-behaviour-state information from patents , 2013, Adv. Eng. Informatics.

[20]  Gabriela Ferraro,et al.  Segmentation of patent claims for improving their readability , 2014, PITR@EACL.

[21]  Allan Hanbury,et al.  1st international workshop on advances in patent information retrieval (AsPIRe'10) , 2010, SIGF.

[22]  Joakim Nivre,et al.  Dependency Grammar and Dependency Parsing , 2005 .

[23]  Joan Codina,et al.  The challenge of syntactic dependency parsing adaptation for the patent domain , 2013 .

[24]  Joakim Nivre,et al.  Integrating Graph-Based and Transition-Based Dependency Parsers , 2008, ACL.

[25]  Stephen Clark,et al.  A Tale of Two Parsers: Investigating and Combining Graph-based and Transition-based Dependency Parsing , 2008, EMNLP.