An Incremental Approach to Efficiently Retrieving Representative Information for Mobile Search on Web

Mobile search is one of the emerging and promising fields for researchers and practitioners. Nowadays, in consideration of screen size and navigability, current PC web search engines and information retrieval approaches may hardly be transplanted onto mobile search platforms directly. This paper presents an efficient approach to retrieving a compact set of differentiated documents which is information equivalent to the set of all documents satisfying query criteria. In doing so, based upon the idea [7] that extracts a representative document from each class derived from the transitive closure of a documents’ similarity matrix, the paper proposes an incremental strategy to deal with the computational complexity in generating the transitive closure and respective classes, which becomes crucial in massive data and mobile search applications. Theoretical analysis and data experiments show the advantage of the proposed approach.

[1]  H. Potoczny On similarity relations in fuzzy relational databases , 1984 .

[2]  Guoqing Chen,et al.  Introducing Relation Compactness for Generating a Flexible Size of Search Results in Fuzzy Queries , 2009, IFSA/EUSFLAT Conf..

[3]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[4]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[5]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[6]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[7]  Amanda Spink,et al.  Web Search: Public Searching of the Web , 2011, Information Science and Knowledge Management.

[8]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[9]  Hsuan-Shih Lee An optimal algorithm for computing the max-min transitive closure of a fuzzy similarity matrix , 2001, Fuzzy Sets Syst..

[10]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[11]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[12]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[13]  Fu Guoyao,et al.  An algorithm for computing the transitive closure of a fuzzy similarity matrix , 1992 .

[14]  J. C. Dunn,et al.  A Graph Theoretic Analysis of Pattern Classification via Tamura's Fuzzy Relation , 1974, IEEE Trans. Syst. Man Cybern..

[15]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[16]  Guoqing Chen Fuzzy logic in data modeling: semantics, constraints, and database design , 1998 .