Tuple Refinement Method Based on Relationship Keyword Extension

Entity relation extraction is mainly focused on researching extraction approaches and improving precision of the extraction results. Although many efforts have been made on this field, there still exist some problems. In order to improve the performance of extracting entity relation, we propose a tuple refinement method based on relationship keyword extension. Firstly, we utilize the diversity of relationships to extend relationship keywords, and then, use the redundancy of network information to extract the second entity based on the principle of proximity and the predefined entity type. Under open web environment, we take four relationships in the experiments and adopt bootstrapping algorithm to acquire the initial tuple set. Three tuple refinement methods are compared: refinement method with threshold set, refinement method with relation extension and refinement method without relation extension. The average F-scores of the experimental results show the proposed method can effectively improve the performance of entity relation extraction.

[1]  Ralph Grishman,et al.  Extracting Relations with Integrated Information Using Kernel Methods , 2005, ACL.

[2]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[3]  Ch Chen,et al.  Pattern recognition and artificial intelligence , 1976 .

[4]  Gerhard Paass,et al.  Semantic relation extraction with kernels over typed dependency trees , 2010, KDD.

[5]  Luis Gravano,et al.  Extracting Relations from Large Plain-Text Collections , 1999 .

[6]  Danushka Bollegala,et al.  Relational duality: unsupervised extraction of semantic relations between entities on the web , 2010, WWW '10.

[7]  Eugene Agichtein Confidence Estimation Methods for Partially Supervised Information Extraction , 2006, SDM.

[8]  Andrew McCallum,et al.  Confidence Estimation for Information Extraction , 2004, NAACL.

[9]  Li Sheng Automated Entity Relation Tuple Extraction Using Web Mining , 2007 .

[10]  Eugene Agichtein Confidence Estimation Methods for Partially Supervised Relation Extraction , 2006 .

[11]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[12]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[13]  Gottfried Vossen,et al.  The World Wide Web and Databases , 2001, Lecture Notes in Computer Science.

[14]  Wendy G. Lehnert,et al.  Information extraction , 1996, CACM.

[15]  Neel Sundaresan,et al.  Mining the Web for acronyms using the duality of patterns and relations , 1999, WIDM '99.

[16]  Kevin Chen-Chuan Chang,et al.  Searching patterns for relation extraction over the web: rediscovering the pattern-relation duality , 2011, WSDM '11.