论文信息 - Towards robust and reliable multimedia analysis through semantic integration of services

Towards robust and reliable multimedia analysis through semantic integration of services

Thanks to ubiquitous Web connectivity and portable multimedia devices, it has never been so easy to produce and distribute new multimedia resources such as videos, photos, and audio. This ever-increasing production leads to an information overload for consumers, which calls for efficient multimedia retrieval techniques. Multimedia resources can be efficiently retrieved using their metadata, but the multimedia analysis methods that can automatically generate this metadata are currently not reliable enough for highly diverse multimedia content. A reliable and automatic method for analyzing general multimedia content is needed. We introduce a domain-agnostic framework that annotates multimedia resources using currently available multimedia analysis methods. By using a three-step reasoning cycle, this framework can assess and improve the quality of multimedia analysis results, by consecutively (1) combining analysis results effectively, (2) predicting which results might need improvement, and (3) invoking compatible analysis methods to retrieve new results. By using semantic descriptions for the Web services that wrap the multimedia analysis methods, compatible services can be automatically selected. By using additional semantic reasoning on these semantic descriptions, the different services can be repurposed across different use cases. We evaluated this problem-agnostic framework in the context of video face detection, and showed that it is capable of providing the best analysis results regardless of the input video. The proposed methodology can serve as a basis to build a generic multimedia annotation platform, which returns reliable results for diverse multimedia analysis problems. This allows for better metadata generation, and improves the efficient retrieval of multimedia resources.

[1] Yo-Ping Huang,et al. Query-by-Humming/Singing of MIDI and Audio Files by Fuzzy Inference System , 2012, 2012 Third FTRA International Conference on Mobile, Ubiquitous, and Intelligent Computing.

[2] Gero Mühl,et al. QoS aggregation for Web service composition using workflow patterns , 2004 .

[3] David D. Denison,et al. Nonlinear estimation and classification , 2003 .

[4] Daniel A. Menascé,et al. Composing Web Services: A QoS View , 2004, IEEE Internet Comput..

[5] Elijah Blessing Rajsingh,et al. A novel fault tolerant service selection framework for pervasive computing , 2012, Human-centric Computing and Information Sciences.

[6] R. Doyle. The American terrorist. , 2001, Scientific American.

[7] Christian Gütl,et al. Hydra: A Vocabulary for Hypermedia-Driven Web APIs , 2013, LDOW.

[8] Mita Nasipuri,et al. Machine Learning Based Keyphrase Extraction: Comparing Decision Trees, Naïve Bayes, and Artificial Neural Networks , 2012, J. Inf. Process. Syst..

[9] Qun Jin,et al. An adaptively emerging mechanism for context-aware service selections regulated by feedback distributions , 2012, Human-centric Computing and Information Sciences.

[10] Peretz Shoval,et al. Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[11] Sören Auer,et al. Proceedings of the WWW2013 Workshop on Linked Data on the Web, Rio de Janeiro, Brazil, 14 May, 2013 , 2013, LDOW.

[12] Paul A. Viola,et al. Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13] Lotfi A. Zadeh,et al. Fuzzy Logic , 2009, Encyclopedia of Complexity and Systems Science.

[14] Huajun Chen,et al. The Semantic Web , 2011, Lecture Notes in Computer Science.

[15] Dieter Fensel. Semantic Web Enabled Web Services , 2002, KI.

[16] Douglas R. Smith,et al. The Design of Divide and Conquer Algorithms , 1985, Sci. Comput. Program..

[17] Steffen Staab,et al. From Manual to Semi-Automatic Semantic Annotation: About Ontology-Based Text Annotation Tools , 2000, SAIC@COLING.

[18] Gajanan K. Kharate,et al. Face Recognition Based on PCA on Wavelet Subband of Average-Half-Face , 2012, J. Inf. Process. Syst..

[19] Wolfgang Pree,et al. Design Patterns for Object-Oriented Software Development , 1994, Proceedings of the (19th) International Conference on Software Engineering.

[20] Ming Ma,et al. Online Recognition of Handwritten Korean and English Characters , 2012, J. Inf. Process. Syst..

[21] Robert E. Schapire,et al. The Boosting Approach to Machine Learning An Overview , 2003 .

[22] Mohan S. Kankanhalli,et al. Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[23] Anne H. H. Ngu,et al. QoS-aware middleware for Web services composition , 2004, IEEE Transactions on Software Engineering.

[24] Alexander G. Hauptmann. Lessons for the Future from a Decade of Informedia Video Analysis Research , 2005, CIVR.

[25] Rik Van de Walle,et al. Enabling context-aware multimedia annotation by a novel generic semantic problem-solving platform , 2012, Multimedia Tools and Applications.

[26] Erik Hjelmås,et al. Face Detection: A Survey , 2001, Comput. Vis. Image Underst..

[27] Tomoya Enokido,et al. Quorums-based Replication of Multimedia Objects in Distributed Systems , 2011, 2011 14th International Conference on Network-Based Information Systems.

[28] John R. Smith,et al. Metadata standards roundup , 2006, IEEE Multimedia.

[29] Rik Van de Walle,et al. Capturing the functionality of Web services with functional descriptions , 2012, Multimedia Tools and Applications.

[30] Rens Bod,et al. Including the power of interpretation through a simulation of Peirce's process of inquiry , 2013, Lit. Linguistic Comput..