AutoSUM: Automating Feature Extraction and Multi-user Preference Simulation for Entity Summarization

With the growth of knowledge graphs, entity descriptions are becoming extremely lengthy. Entity summarization task, aiming to generate diverse, comprehensive and representative summaries for entities, has received an increasing interest recently. In most previous methods, features are usually extracted by the hand-crafted templates. Then the feature selection and multi-user preference simulation take place, depending too much on human expertise. In this paper, a novel integration method called AutoSUM is proposed for automatic feature extraction and multi-user preference simulation to overcome the drawbacks of previous methods. There are two modules in AutoSUM: extractor and simulator. The extractor module operates automatic feature extraction based on a BiLSTM with a combined input representation including word embeddings and graph embeddings. Meanwhile, the simulator module automates multi-user preference simulation based on a well-designed two-phase attention mechanism (i.e., entity-phase attention and user-phase attention). Experimental results demonstrate that AutoSUM produces the state-of-the-art performance on two widely used datasets (i.e., DBpedia and LinkedMDB) in both F-measure and MAP.

[1]  Yuzhong Qu,et al.  CD at ENSEC 2016: Generating Characteristic and Diverse Entity Summaries , 2016, SumPre@ESWC.

[2]  W. Bruce Croft,et al.  A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.

[3]  Hamid R. Arabnia,et al.  Combining Word Embedding and Knowledge-Based Topic Modeling for Entity Summarization , 2018, 2018 IEEE 12th International Conference on Semantic Computing (ICSC).

[4]  Mariano P. Consens Managing Linked Data on the Web: The LinkedMDB Showcase , 2008, 2008 Latin American Web Conference.

[5]  Min Yang,et al.  A Novel Top-N Recommendation Approach Based on Conditional Variational Auto-Encoder , 2019, PAKDD.

[6]  Andreas Harth,et al.  FusE , 2019, ACM Trans. Web.

[7]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[8]  Marcin Sydow,et al.  The notion of diversity in graphical entity summarisation on semantic knowledge graphs , 2013, Journal of Intelligent Information Systems.

[9]  Chandan K. Reddy,et al.  An Attentive Spatio-Temporal Neural Model for Successive Point of Interest Recommendation , 2019, PAKDD.

[10]  Zhibing Liu,et al.  MPSUM: Entity Summarization with Predicate-based Matching , 2020, ArXiv.

[11]  Nelia Lasierra,et al.  LinkSUM: Using Link Analysis to Summarize Entity Data , 2016, ICWE.

[12]  Hao Liu,et al.  User Preference-Aware Review Generation , 2019, PAKDD.

[13]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[14]  Hamid R. Arabnia,et al.  ES-LDA: Entity Summarization using Knowledge-based Topic Modeling , 2017, IJCNLP.

[15]  Yuzhong Qu,et al.  RELIN: Relatedness and Informativeness-Based Centrality for Entity Summarization , 2011, International Semantic Web Conference.

[16]  Jens Lehmann,et al.  DBpedia - A crystallization point for the Web of Data , 2009, J. Web Semant..

[17]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[18]  Chang Liu,et al.  CNN‐based reference comparison method for classifying bare PCB defects , 2018, The Journal of Engineering.

[19]  Amit P. Sheth,et al.  Gleaning Types for Literals in RDF Triples with Application to Entity Summarization , 2016, ESWC.

[20]  Gong Cheng,et al.  Entity Summarization: State of the Art and Future Challenges , 2019, ArXiv.

[21]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[22]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[23]  George Hripcsak,et al.  Technical Brief: Agreement, the F-Measure, and Reliability in Information Retrieval , 2005, J. Am. Medical Informatics Assoc..

[24]  Marcin Sydow,et al.  DIVERSUM: Towards diversified summarisation of entities in knowledge graphs , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[25]  Achim Rettinger,et al.  Browsing DBpedia Entities with Summaries , 2014, ESWC.

[26]  Yaxin Liu,et al.  ESA: Entity Summarization with Attention , 2019, ArXiv.

[27]  Mohiuddin Ahmed Data summarization: a survey , 2018, Knowledge and Information Systems.

[28]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[29]  Danai Koutra,et al.  Graph Summarization Methods and Applications , 2016, ACM Comput. Surv..

[30]  Amit P. Sheth,et al.  FACES: Diversity-Aware Entity Summarization Using Incremental Hierarchical Conceptual Clustering , 2015, AAAI.