The e-recall environment for cloud based mobile rich media data management

With the pervasiveness of mobile and wireless devices and rapid growth of the Internet, the access availability of rich media continues to accelerate in amount, variety, complexity and scale. This has been exemplified by various online media service portals including Flickr, Facebook and Last.fm. Unfortunately, lack of proper data processing techniques has now become major obstacle for effectively personal data management, especially in a mobile environment. While the related technical developments have been attracted a lot of research attentions recently, many open problems still remain unsolved. Among them, two major challenges need to be addressed. The first one is how to construct comprehensive way to describe 1) queries - model user information needs and 2) contents of rich media data. Moreover, size of media data collected by modern personal digital device (PDA) could be huge and will continue to grow. Consequently, fast data processing and associated scalability issues are becoming more important than ever before, and yet, little serious research has been conducted in this field. In this paper, we report ongoing efforts to develop E-Recall system - a novel platform for cloud based mobile rich media data management. It aims to provide an intelligent and comprehensive infrastructure for (1) scalable media data processing, (2) flexible media content sharing and publishing and (3) personalized media content integration under mobile environment.

[1]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[2]  Martin Hepp,et al.  Harvesting Wiki Consensus: Using Wikipedia Entries as Vocabulary for Knowledge Management , 2007, IEEE Internet Computing.

[3]  Martin Hepp,et al.  Ontologies: State of the Art, Business Potential, and Grand Challenges , 2008, Ontology Management.

[4]  Hua Li,et al.  Mobile Search With Multimodal Queries , 2008, Proceedings of the IEEE.

[5]  Jialie Shen,et al.  Large Scale Rich Media Information Search: Challenges and Opportunities , 2010, PCM.

[6]  Jialie Shen,et al.  Large scale rich media information search , 2010, PCM 2010.

[7]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[8]  Fabio Crestani,et al.  “Is this document relevant?…probably”: a survey of probabilistic models in information retrieval , 1998, CSUR.

[9]  Rong Yan,et al.  Learning query-class dependent weights in automatic video retrieval , 2004, MULTIMEDIA '04.

[10]  Beng Chin Ooi,et al.  Indexing multi-dimensional data in a cloud system , 2010, SIGMOD Conference.

[11]  Shih-Fu Chang,et al.  Query-Adaptive Fusion for Multimodal Search , 2008, Proceedings of the IEEE.

[12]  Martin Hepp,et al.  Possible Ontologies: How Reality Constrains the Development of Relevant Ontologies , 2007, IEEE Internet Computing.

[13]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[14]  JUSTIN ZOBEL,et al.  Inverted files for text search engines , 2006, CSUR.

[15]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[16]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[17]  Shih-Fu Chang,et al.  Automatic discovery of query-class-dependent models for multimodal search , 2005, MULTIMEDIA '05.

[18]  King-Lup Liu,et al.  Building efficient and effective metasearch engines , 2002, CSUR.