Summarizing conceptual descriptions using knowledge representations

Summarizing a body of information is a complex task which mainly depends on the ability to distinguish important information and to condense notions through abstraction. Considering a knowledge representation partially ordering concepts into a directed acyclic graph, this study focuses on the problem of summarizing several human descriptions expressed through sets of concepts. We formally define the problem of summarization in this context and we propose a model mimicking a Human-like Intelligence for scoring alternatives with regard to a specific objective. Several interesting theoretical results related to this problem (e.g. for optimization) are also given. Finally, the evaluation of the proposed approach performed in the domain of odor analysis highlights the benefits of our proposal and shows how it could be used to automatize time-consuming expert summarizing processes. Source code implementing the proposed approach as well as datasets are made available to the community.1

[1]  Thomas L. Griffiths,et al.  Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[2]  Abdelhak Imoussaten,et al.  On the consideration of a bring-to-mind model for computing the Information Content of concepts defined into ontologies , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[3]  Claudio Carpineto,et al.  Full-Subtopic Retrieval with Keyphrase-Based Search Results Clustering , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[4]  Jean-Daniel Zucker,et al.  Abstraction in Artificial Intelligence and Complex Systems , 2013, Springer New York.

[5]  Heiner Stuckenschmidt,et al.  Handbook on Ontologies , 2004, Künstliche Intell..

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Frank D. Wood,et al.  Hierarchically Supervised Latent Dirichlet Allocation , 2011, NIPS.

[8]  Nicolas Fiorini Semantic similarities at the core of generic indexing and clustering approaches. (Les similarités sémantiques au cœur d'approches génériques d'indexation et de catégorisation) , 2015 .

[9]  Mohamed Nadif,et al.  Beyond cluster labeling: Semantic interpretation of clusters' contents using a graph representation , 2014, Knowl. Based Syst..

[10]  Sylvie Ranwez,et al.  Semantic Similarity from Natural Language and Ontology Analysis , 2015, Synthesis Lectures on Human Language Technologies.

[11]  Tony Veale,et al.  An Intrinsic Information Content Metric for Semantic Similarity in WordNet , 2004, ECAI.

[12]  John D. Lafferty,et al.  Correlated Topic Models , 2005, NIPS.

[13]  Sylvie Ranwez,et al.  Subontology Extraction Using Hyponym and Hypernym Closure on is-a Directed Acyclic Graphs , 2012, IEEE Transactions on Knowledge and Data Engineering.

[14]  Rishabh K. Iyer,et al.  Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures , 2015, ACL.

[15]  Jean-Daniel Zucker,et al.  A grounded theory of abstraction in artificial intelligence. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.