A multi-level text representation model within background knowledge based on human cognitive process for big data analysis

Text representation is part of the most fundamental work in text comprehension, processing, and search. Various kinds of work has been proposed to mine the semantics in texts and then to represent them. However, most of them only focus on how to mine semantics from the text itself, while few of them take the background knowledge into consideration, which is very important to text understanding. In this paper, on the basis of human cognitive process, we propose a multi-level text representation model within background knowledge, called TRMBK. It is composed of three levels, which are machine surface code, machine text base and machine situational model. All of them are able to be constructed automatically to acquire semantics both inside and outside of the texts. Simultaneously, we also propose a method to establish background knowledge automatically and offer supports for the current text comprehension. Finally, experiments and comparisons have been presented to show the better performance of TRMBK.

[1]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[2]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[3]  Xiangfeng Luo,et al.  Text knowledge representation model based on human concept learning , 2010, 9th IEEE International Conference on Cognitive Informatics (ICCI'10).

[4]  Shunxiang Zhang,et al.  Mining temporal explicit and implicit semantic relations between entities using web search engines , 2014, Future Gener. Comput. Syst..

[5]  Jun Zhang,et al.  Guided Game-Based Learning Using Fuzzy Cognitive Maps , 2010, IEEE Transactions on Learning Technologies.

[6]  Lan Chen,et al.  Knowle: A semantic link network based system for organizing large scale online news events , 2015, Future Gener. Comput. Syst..

[7]  Richard C. Atkinson,et al.  Human Memory: A Proposed System and its Control Processes , 1968, Psychology of Learning and Motivation.

[8]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[9]  Jun Zhang,et al.  ExNa: An Efficient Search Pattern for Search Engines , 2014, WAIM.

[10]  A. Baddeley Working memory: looking back and looking forward , 2003, Nature Reviews Neuroscience.

[11]  Jia Wang,et al.  User comments for news recommendation in forum-based social media , 2010, Inf. Sci..

[12]  John R Anderson,et al.  An integrated theory of the mind. , 2004, Psychological review.

[13]  Peter A. Tucker,et al.  Primary Memory , 1965, Encyclopedia of Database Systems.

[14]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[15]  Freddy Lécué,et al.  Seeking Quality of Web Service Composition in a Semantic Dimension , 2011, IEEE Transactions on Knowledge and Data Engineering.

[16]  Walter Kintsch,et al.  Toward a model of text comprehension and production. , 1978 .

[17]  Xue Chen,et al.  Building Association Link Network for Semantic Link on Web Resources , 2011, IEEE Transactions on Automation Science and Engineering.

[18]  Xiangfeng Luo,et al.  Measuring the semantic discrimination capability of association relations , 2014, Concurr. Comput. Pract. Exp..

[19]  Daniel Dajun Zeng,et al.  ExNa: an efficient search pattern for semantic search engines , 2016, Concurr. Comput. Pract. Exp..

[20]  Javier Snaider,et al.  The LIDA Framework as a General Tool for AGI , 2011, AGI.

[21]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[22]  Lan Chen,et al.  Semantic based representing and organizing surveillance big data using video structural description technology , 2015, J. Syst. Softw..

[23]  Xiaotao Huang,et al.  A Relation-Based Search Engine in Semantic Web , 2007, IEEE Transactions on Knowledge and Data Engineering.

[24]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[25]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[26]  Jan-Ming Ho,et al.  Using Web-Mining for Academic Measurement and Scholar Recommendation in Expert Finding System , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[27]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[28]  Qing Li,et al.  Research of E-Learning Intelligent Affective Model Based on BDI Agent with Learning Materials , 2011, CSISE.

[29]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[30]  Sung-Ho Kim,et al.  Emergency situation monitoring service using context motion tracking of chronic disease patients , 2015, Cluster Computing.