Enriching user search experience by mining social streams with heuristic stones and associative ripples

Recently, social networking sites such as Facebook and Twitter are becoming increasingly popular. The high accessibility of these sites has allowed the so-called social streams being spread across the Internet more quickly and widely, as more and more of the populations are being engaged into this vortex of the social networking revolution. Information seeking never means simply typing a few keywords into a search engine in this stream world. In this study, we try to find a way to utilize these diversified social streams to assist the search process without relying solely on the inputted keywords. We propose a method to analyze and extract meaningful information in accordance with users’ current needs and interests from social streams using two developed algorithms, and go further to integrate these organized stream data which are described as associative ripples into the search system, in order to improve the relevance of the results obtained from the search engine and feedback users with a new perspective of the sought issues to guide the further information seeking process, which can benefit both search experience enrichment and search process facilitation.

[1]  Sudipto Guha,et al.  Clustering Data Streams , 2000, FOCS.

[2]  Alexandre Passant,et al.  Microblogging: A Semantic Web and Distributed Approach , 2008 .

[3]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[4]  Reynol Junco,et al.  The effect of Twitter on college student engagement and grades , 2011, J. Comput. Assist. Learn..

[5]  John G. Breslin,et al.  An Overview of SMOB 2: Open, Semantic and Distributed Microblogging , 2010, ICWSM.

[6]  Martin Ebner,et al.  Microblogs in Higher Education - A chance to facilitate informal and process-oriented learning? , 2010, Comput. Educ..

[7]  Martin Ebner,et al.  Introducing Live Microblogging: How Single Presentations Can Be Enhanced by the Mass , 2009 .

[8]  Abraham Silberschatz,et al.  View maintenance issues for the chronicle data model (extended abstract) , 1995, PODS.

[9]  W. Reinhard,et al.  How people are using Twitter during conferences , 2009 .

[10]  Yunming Ye,et al.  A comparative study of feature weighting methods for document co-clustering , 2011, Int. J. Inf. Technol. Commun. Convergence.

[11]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[12]  Hong Chen,et al.  A Framework of Organic Streams: Integrating Dynamically Diversified Contents into Ubiquitous Personal Study , 2010, 2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing.

[13]  Olivia R. Liu Sheng,et al.  LinkSelector: A Web mining approach to hyperlink selection for Web portals , 2004, TOIT.

[14]  Davide Modolo,et al.  Explorative visualization and analysis of a social network for arts: the case of deviantART , 2011 .

[15]  Ed H. Chi,et al.  Information Seeking Can Be Social , 2009, Computer.

[16]  Ryen W. White,et al.  Mining the search trails of surfing crowds: identifying relevant websites from user activity , 2008, WWW.

[17]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[18]  Andreas Hotho,et al.  Semantic Web Mining: State of the art and future directions , 2006, J. Web Semant..

[19]  John G. Breslin,et al.  Using the Semantic Web for linking and reusing data across Web 2.0 communities , 2008, J. Web Semant..

[20]  Vitaly Klyuev,et al.  Semantic retrieval: an approach to representing, searching and summarising text documents , 2011, Int. J. Inf. Technol. Commun. Convergence.

[21]  Ricardo Baeza-Yates,et al.  Query-sets: using implicit feedback and query patterns to organize web documents , 2008, WWW.

[22]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[23]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[24]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[25]  Carlos Ordonez,et al.  Clustering binary data streams with K-means , 2003, DMKD '03.

[26]  Dawid Weiss,et al.  A survey of Web clustering engines , 2009, CSUR.

[27]  Kirsten A. Johnson The effect of Twitter posts on students’ perceptions of instructor credibility , 2011 .

[28]  Evgeny Pyshkin,et al.  How to improve the search quality for various types of information , 2011 .

[29]  Hong Chen,et al.  Generating associative ripples of relevant information from a variety of data streams by throwing a heuristic stone , 2011, ICUIMC '11.