Expanded Semantic Graph Representation for Matching Related Information of Interest across Free Text Documents

This research proposes an expanded semantic graph definition that serves as a basis for an expanded semantic graph representation and graph matching approach. This representation separates the content and context and adds a number of semantic structures that encapsulate inferred information. The expanded semantic graph approach facilitates finding additional matches, identifying and eliminating poor matches, and prioritizing matches based on how much new knowledge is provided. By focusing on information of interest, doing pre-processing, and reducing processing requirements, the approach is applicable to problems where related information of interest is sought across a massive body of free text documents. Key aspects of the approach include (1) expanding the nodes and edges through inference using DL-Safe rules, abductive hypotheses, and syntactic patterns, (2) separating semantic content into nodes and semantic context into edges, and (3) applying relatedness measures on a node, edge, and sub graph basis. Results from tests using a ground-truthed subset of a large dataset of law enforcement investigator emails illustrate the benefits of these approaches.

[1]  Laura Zager,et al.  Graph similarity and matching , 2005 .

[2]  Edmond Chow,et al.  Knowledge Representation Issues in Semantic Graphs for Relationship Detection , 2005, AAAI Spring Symposium: AI Technologies for Homeland Security.

[3]  Brian Gallagher,et al.  Matching Structure and Semantics: A Survey on Graph-Based Pattern Matching , 2006, AAAI Fall Symposium: Capturing and Using Patterns for Evidence Detection.

[4]  L. Hébert Dispositifs pour l'analyse des textes et des images : introduction à la sémiotique appliquée , 2007 .

[5]  Endika Bengoetxea,et al.  Inexact Graph Matching Using Estimation of Distribution Algorithms , 2002 .

[6]  Bhavani M. Thuraisingham,et al.  Identification of related information of interest across free text documents , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[7]  Marko Grobelnik,et al.  Extracting Summary Sentences Based on the Document Semantic Graph , 2005 .

[8]  W. Campbell,et al.  THE UNIVERSITY OF TEXAS AT DALLAS , 2004 .

[9]  Latifur Khan,et al.  Law Enforcement Ontology for Identification of Related Information of Interest Across Free Text Dcouments , 2011, 2011 European Intelligence and Security Informatics Conference.

[10]  Bhavani M. Thuraisingham,et al.  Extraction of expanded entity phrases , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[11]  Bhavani M. Thuraisingham,et al.  Measuring Relatedness and Augmentation of Information of Interest within Free Text Law Enforcement Documents , 2012, 2012 European Intelligence and Security Informatics Conference.

[12]  Rakesh Nagi,et al.  Incremental graph matching for Situation Awareness , 2009, 2009 12th International Conference on Information Fusion.