Personalized Text Content Summarizer for Mobile Learning: An Automatic Text Summarization System with Relevance Based Language Model

Although millions of text contents and multimedia published on the Web have potential to be shared as the learning contents for mobile learning, effectively extracting useful information from them is an extremely difficult problem. Oft-decried information overloading is the main issue to impede this potential. Many approaches have been proposed to revise and reinforce content to provide the appropriate delivery for mobile learning. However, approaches of manually converting content to suit the mobile learning require a huge effort on the part of the teachers and the instructional designers. Automatic text summarization can reduce this cost significantly, but it may have negative impact on the understanding of the meaning conveyed, as well as the risk of producing a standard summary for all learners without reflecting their interests and preferences. In this paper, a personalized text-based content summarizer is introduced to address an approach to help mobile learners to retrieve and process information more quickly, based on their interests and preferences. In this work, probabilistic language modeling techniques are adapted to build a user model and an extractive text summarization system to generate the personalized and automatic summary for mobile learning. Experimental results have indicated that the proposed solution provides a proper and efficient approach to help mobile learners by summarizing important content quickly and adaptively.

[1]  Lucy Vanderwende,et al.  Exploring Content Models for Multi-Document Summarization , 2009, NAACL.

[2]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[3]  Vibhu O. Mittal,et al.  Ultra-Summarization: A Statistical Approach to Generating Highly Condensed Non-Extractive Summaries (poster abstract). , 1998, SIGIR 1999.

[4]  Ralph Weischedel,et al.  PERFORMANCE MEASURES FOR INFORMATION EXTRACTION , 2007 .

[5]  ChengXiang Zhai Risk Minimization and Language Modeling in Text Retrieval – Thesis Summary , 2002 .

[6]  Daniel Marcu,et al.  Bayesian Query-Focused Summarization , 2006, ACL.

[7]  Ani Nenkova,et al.  The Impact of Frequency on Summarization , 2005 .

[8]  Regina Barzilay,et al.  Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization , 2004, NAACL.

[9]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Approach to Identifying Sentence Boundaries , 1997, ANLP.

[10]  Jia Zhang,et al.  A Unit of Information-Based Content Adaptation Method for Improving Web Content Accessibility in the Mobile Internet , 2007 .

[11]  Masao Mukaidono,et al.  Probabilistic Inference and Bayesian Theorem on Rough Sets , 2000, Rough Sets and Current Trends in Computing.

[12]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[13]  D. Losada Language modeling for sentence retrieval : A comparison between Multiple-Bernoulli models and Multinomial models , 2005 .

[14]  John Makhoul Information Extraction from speech , 2006, SLT.

[15]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[16]  W. Bruce Croft,et al.  Formal multiple-bernoulli models for language modeling , 2004, SIGIR '04.

[17]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[18]  Regina Barzilay,et al.  Bayesian Unsupervised Topic Segmentation , 2008, EMNLP.

[19]  Inderjeet Mani,et al.  SUMMAC: a text summarization evaluation , 2002, Natural Language Engineering.

[20]  Yue-Shi Lee,et al.  Language Model Passage Retrieval for Question-Oriented Multi Document Summarization , 2007 .

[21]  Jia Zhang,et al.  Applying Web Page Adaptation Technique to the Augmentation of Mobile Learning , 2008, Res. Pract. Technol. Enhanc. Learn..

[22]  W. Bruce Croft,et al.  A language modeling approach to information retrieval , 1998, SIGIR '98.

[23]  Suet-Peng Yong,et al.  A Neural-based Text Summarization System , 2006 .

[24]  Gwm Matthias Rauterberg,et al.  A SURVEY ON USER PROFILE MODELING FOR PERSONALIZED SERVICE DELIVERY SYSTEMS , 2009 .

[25]  Diana G. Oblinger,et al.  Educating the Net Generation , 2005 .

[26]  David J. C. MacKay,et al.  A hierarchical Dirichlet language model , 1995, Natural Language Engineering.

[27]  Vasudeva Varma,et al.  Generating Personalized Summaries Using Publicly Available Web Documents , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[28]  Ani Nenkova,et al.  Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion , 2007, Information Processing & Management.

[29]  Jianfeng Gao,et al.  An Information-Theoretic Approach to Automatic Evaluation of Summaries , 2006, NAACL.

[30]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[31]  Vibhu O. Mittal,et al.  Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries , 1999, SIGIR '99.

[32]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[33]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[34]  Alex Alves Freitas,et al.  Automatic Text Summarization Using a Machine Learning Approach , 2002, SBIA.

[35]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.