MultiMedia Modeling

This paper aims to present a framework for collaborative design environments during the manipulation of multimedia content. The proposed desktop system can be assumed as an imitation of physical desktop including three additional properties: a tangible timeline layer, a projection of 2D plan and 3D perspective views which can be controlled through specified augmented reality markers. The potentials of this desktop application, such as unfoldable experience of time via a timeline, multi-touch manipulation and organisation of 2D and 3D visual data, were explored for the further augmented reality studies.

[1]  Hema Raghavan,et al.  Discovering users' specific geo intention in web search , 2009, WWW '09.

[2]  Ramesh C. Jain,et al.  GPSView: A scenic driving route planner , 2013, TOMCCAP.

[3]  Ali Farhadi,et al.  A latent model of discriminative aspect , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[4]  Hailong Sun,et al.  gTravel: a global social travel system , 2012, ACM Multimedia.

[5]  Bo Yu,et al.  A query-aware document ranking method for geographic information retrieval , 2007, GIR '07.

[6]  Anni Cai,et al.  The Application of Spatio-temporal Feature and Multi-Sensor in Home Medical Devices , 2010, J. Digit. Content Technol. its Appl..

[7]  Jessica K. Hodgins,et al.  Hierarchical Aligned Cluster Analysis for Temporal Clustering of Human Motion , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Mohan S. Kankanhalli,et al.  Creating audio keywords for event detection in soccer video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[9]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[10]  Wen Gao,et al.  Learning to Distribute Vocabulary Indexing for Scalable Visual Search , 2013, IEEE Transactions on Multimedia.

[11]  Michal Irani,et al.  Aligning Sequences and Actions by Maximizing Space-Time Correlations , 2006, ECCV.

[12]  Alireza Sahami Shirazi,et al.  Real-time nonverbal opinion sharing through mobile phones during sports events , 2011, CHI.

[13]  Patrick Pérez,et al.  Periodic motion detection and segmentation via approximate sequence alignment , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  C. J. van Rijsbergen,et al.  Information Retrieval , 1979, Encyclopedia of GIS.

[15]  Jia Chen,et al.  DLMSearch: diversified landmark search by photo , 2012, ACM Multimedia.

[16]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[17]  Alexander G. Hauptmann,et al.  The co-attention model for tiny activity analysis , 2013, Neurocomputing.

[18]  Xing Xie,et al.  Learning travel recommendations from user-generated GPS traces , 2011, TIST.

[19]  Qi Tian,et al.  Less is More: Efficient 3-D Object Retrieval With Query View Selection , 2011, IEEE Transactions on Multimedia.

[20]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[21]  Laks V. S. Lakshmanan,et al.  Breaking out of the box of recommendations: from items to packages , 2010, RecSys '10.

[22]  Sheng Tang,et al.  Personalized multimedia web summarizer for tourist , 2008, WWW.

[23]  Dong Xu,et al.  Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Fernando De la Torre,et al.  Generalized time warping for multi-modal alignment of human motion , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[26]  Juan Carlos Niebles,et al.  Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.

[27]  Lior Wolf,et al.  Wide Baseline Matching between Unsynchronized Video Sequences , 2006, International Journal of Computer Vision.

[28]  Kari Pulli,et al.  Style translation for human motion , 2005, SIGGRAPH 2005.

[29]  Patrick Pérez,et al.  View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Patrick Pérez,et al.  Cross-View Action Recognition from Temporal Self-similarities , 2008, ECCV.

[31]  Peter Kovesi,et al.  Using Space-Time Interest Points for Video Sequence Synchronization , 2007, MVA.

[32]  Rong Yan,et al.  Multimedia Search with Pseudo-relevance Feedback , 2003, CIVR.

[33]  Yue Gao,et al.  Camera Constraint-Free View-Based 3-D Object Retrieval , 2012, IEEE Transactions on Image Processing.

[34]  Xindong Wu,et al.  3-D Object Retrieval With Hausdorff Distance Learning , 2014, IEEE Transactions on Industrial Electronics.

[35]  Takayuki Okatani,et al.  Video Synchronization Based on Co-occurrence of Appearance Changes in Video Sequences , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[36]  Alexander G. Hauptmann,et al.  MoSIFT: Recognizing Human Actions in Surveillance Videos , 2009 .

[37]  Xing Xie,et al.  Collaborative Filtering Meets Mobile Recommendation: A User-Centered Approach , 2010, AAAI.

[38]  Changsheng Xu,et al.  Using Webcast Text for Semantic Event Detection in Broadcast Sports Video , 2008, IEEE Transactions on Multimedia.

[39]  Mohamed A. Sharaf,et al.  Location-Based Emerging Event Detection in Social Networks , 2013, APWeb.