A Review on Continuous Summarization and Timeline Generation for Topic Evolutionary Tweet Streams

: Tweets are being made short instant message and shared for both clients and information examiners. Twitter which gets more than 400 million tweets for every day has developed as a precious wellspring of news, web journals, suppositions and the sky is the limit from there. Our proposed work comprises three parts tweet stream grouping to bunch tweet utilizing k-model bunch algorithm(In existing base paper, k-implies bunching calculation used to make the underlying bunches. With worldwide bunch, it didn't function admirably. So in our proposed work, we utilize k-model bunching produce more tightly groups than k-implies bunching, particularly if the bunches are globular) and second tweet bunch vector strategy to create rank outline utilizing ravenous calculation, thusly requires usefulness which fundamentally vary from customary synopsis. When all is said in done, tweet rundown and third to identify and screens the synopsis - based and volume based variety to create timetable naturally from tweet stream. Executing constant tweet stream diminishing a content record is however not a basic undertaking, since countless are useless, random and rambunctious in nature, because of the social way of tweeting. Facilitate, tweets are emphatically associated with their presented case and up-on the-moment tweets have a tendency to land at a quick rate. Effectiveness — tweet streams are constantly huge in level, thus the synopsis calculation ought to be significantly fit; Flexibility - it ought to give tweet outlines of irregular minute lengths. Theme development - it ought to routinely recognize sub - point changes and the minutes that they happen.

[1]  Yiming Yang,et al.  A Probabilistic Model for Online Document Clustering with Application to Novelty Detection , 2004, NIPS.

[2]  Yong Yu,et al.  SVDFeature: a toolkit for feature-based collaborative filtering , 2012, J. Mach. Learn. Res..

[3]  M. Sheelagh T. Carpendale,et al.  A Visual Backchannel for Large-Scale Events , 2010, IEEE Transactions on Visualization and Computer Graphics.

[4]  Alexander J. Smola,et al.  Discovering geographical topics in the twitter stream , 2012, WWW.

[5]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[6]  Shi Zhong,et al.  Efficient streaming text clustering , 2005, Neural Networks.

[7]  W. Beyer CRC Standard Mathematical Tables and Formulae , 1991 .

[8]  Michael R. Lyu,et al.  Improving Recommender Systems by Incorporating Social Contextual Information , 2011, TOIS.

[9]  Chun Chen,et al.  Document Summarization Based on Data Reconstruction , 2012, AAAI.

[10]  Deepayan Chakrabarti,et al.  Event Summarization Using Tweets , 2011, ICWSM.

[11]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[12]  Brian D. Davison,et al.  Co-factorization machines: modeling user interests and predicting individual decisions in Twitter , 2013, WSDM.

[13]  Tao Li,et al.  A Participant-based Approach for Event Summarization Using Twitter Streams , 2013, NAACL.

[14]  Hiroya Takamura,et al.  Generating Live Sports Updates from Twitter by Finding Good Reporters , 2013, 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[15]  Chris H. Q. Ding,et al.  Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization , 2008, SIGIR '08.

[16]  Julio Gonzalo,et al.  Towards real-time summarization of scheduled events from twitter streams , 2012, HT '12.

[17]  Claire Cardie,et al.  39. Opinion mining and sentiment analysis , 2014 .

[18]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[19]  Jie Xu,et al.  Efficient summarization framework for multi-attribute uncertain data , 2014, SIGMOD Conference.

[20]  Philip S. Yu,et al.  Under Consideration for Publication in Knowledge and Information Systems on Clustering Massive Text and Categorical Data Streams , 2022 .

[21]  Yan Zhang,et al.  Evolutionary timeline summarization: a balanced optimization framework via iterative substitution , 2011, SIGIR.

[22]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[23]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[24]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[25]  Beng Chin Ooi,et al.  TI: an efficient indexing mechanism for real-time search on tweets , 2011, SIGMOD '11.

[26]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[27]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[28]  Qi He,et al.  Bursty Feature Representation for Clustering Text Streams , 2007, SDM.

[29]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[30]  Jianping Zeng,et al.  Text stream clustering algorithm based on adaptive feature selection , 2011, Expert Syst. Appl..

[31]  Jugal K. Kalita,et al.  Comparing Twitter Summarization Algorithms for Multiple Post Summaries , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[32]  Paul S. Bradley,et al.  Scaling Clustering Algorithms to Large Databases , 1998, KDD.

[33]  Srinivasan Parthasarathy,et al.  A framework for summarizing and analyzing twitter feeds , 2012, KDD.

[34]  Michael Giering,et al.  Retail sales prediction and item recommendations using customer demographics at store level , 2008, SKDD.

[35]  Benjamin Van Durme Streaming Analysis of Discourse Participants , 2012, EMNLP-CoNLL.

[36]  Chen Lin,et al.  Generating event storylines from microblogs , 2012, CIKM.

[37]  Jeffrey Nichols,et al.  Summarizing sporting events using twitter , 2012, IUI '12.

[38]  Martin Ester,et al.  TrustWalker: a random walk model for combining trust-based and item-based recommendation , 2009, KDD.

[39]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[40]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[41]  Hiroya Takamura,et al.  Summarizing a Document Stream , 2011, ECIR.

[42]  Sanda M. Harabagiu,et al.  Relevance Modeling for Microblog Summarization , 2011, ICWSM.

[43]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[44]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[45]  Jugal K. Kalita,et al.  Summarizing Microblogs Automatically , 2010, NAACL.