Query Specific Semantic Matcher and Summarization

Creating trained models that semantically represent the corpus and summarizing relevant portions according to the user query remains a challenging task. We propose a semantic matcher based approach for identifying the relevant sentences in the corpus pertaining to the prominent entities in the corpus and their relationships in the user query. Trained models are created from the corpus by leveraging word-embeddings and are semantically searched to retrieve top results of the user query. Top matched sentences are analyzed for coherence based on semantic chains leveraging Semantic Role Labeler and are summarized, where entity relationships are exploited. The approach is applied and tested on user query based Ticketing System where policy documents in IT industry are used as corpus and specific summaries in the form of steps to be followed are created according to the user query.

[1]  Isabelle Augenstein,et al.  A Supervised Approach to Extractive Summarisation of Scientific Papers , 2017, CoNLL.

[2]  Rui Zhang,et al.  Graph-based Neural Multi-Document Summarization , 2017, CoNLL.

[3]  Jagadeesh Jagarlamudi,et al.  Summarization Approaches Based on Document Probability Distributions , 2009, PACLIC.

[4]  Noah A. Smith,et al.  Toward Abstractive Summarization Using Semantic Representations , 2018, NAACL.

[5]  Duy Duc An Bui,et al.  Extractive text summarization system to aid data extraction from full text in systematic review development , 2016, J. Biomed. Informatics.

[6]  Balaji Jagan,et al.  A Question Answering Model Based on Semantic Matcher for Support Ticketing System , 2018 .

[7]  Claire Cardie,et al.  A Sentence Compression Based Framework to Query-Focused Multi-Document Summarization , 2013, ACL.

[8]  Nathan Schneider,et al.  The NLTK FrameNet API: Designing for Discoverability with a Rich Linguistic Resource , 2017, EMNLP.

[9]  Krys J. Kochut,et al.  Text Summarization Techniques: A Brief Survey , 2017, International Journal of Advanced Computer Science and Applications.

[10]  Piji Li,et al.  Abstractive Multi-Document Summarization via Phrase Selection and Merging , 2015, ACL.

[11]  Zongda Wu,et al.  A topic modeling based approach to novel document automatic summarization , 2017, Expert Syst. Appl..

[12]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[13]  Inderjit S. Dhillon,et al.  Kernel k-means: spectral clustering and normalized cuts , 2004, KDD.

[14]  Oren Etzioni,et al.  Towards Coherent Multi-Document Summarization , 2013, NAACL.

[15]  Mirella Lapata,et al.  Ranking Sentences for Extractive Summarization with Reinforcement Learning , 2018, NAACL.