Discovery of Environmental Web Resources Based on the Combination of Multimedia Evidence

This work proposes a framework for the discovery of environmental Web resources providing air quality measurements and forecasts. Motivated by the frequent occurrence of heatmaps in such Web resources, it exploits multimedia evidence at different stages of the discovery process. Domain-specific queries generated using empirical information and machine learning driven query expansion are submitted both to the Web and Image search services of a general-purpose search engine. Post-retrieval filtering is performed by combining textual and visual (heatmap-related) evidence in a supervised machine learning framework. Our experimental results indicate improvements in the effectiveness when performing heatmap recognition based on SURF and SIFT descriptors using VLAD encoding and when combining multimedia evidence in the discovery process.

[1]  Andrew McCallum,et al.  Automating the Construction of Internet Portals with Machine Learning , 2000, Information Retrieval.

[2]  Chew Lim Tan,et al.  Text/Graphics Separation in Maps , 2001, GREC.

[3]  Paul Over,et al.  TRECVID 2007--Overview , 2007, TRECVID.

[4]  David Hawking,et al.  Focused crawling for both topical relevance and quality of medical information , 2005, CIKM '05.

[5]  Toru Ishida,et al.  Domain-specific Web search with keyword spices , 2004, IEEE Transactions on Knowledge and Data Engineering.

[6]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[7]  Anastasios Bassoukos,et al.  A method for the inverse reconstruction of environmental data applicable at the Chemical Weather portal , 2010 .

[8]  Yiannis Kompatsiaris,et al.  Focussed crawling of environmental web resources: A pilot study on the combination of multimedia evidence , 2014, EMR@ICMR.

[9]  Allan Hanbury,et al.  Domain Specific Search , 2014, Professional Search in the Modern World.

[10]  Ioannis Patras,et al.  A Study on the Use of a Binary Local Descriptor and Color Extensions of Local Descriptors for Video Concept Detection , 2015, MMM.

[11]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[12]  Yiannis Kompatsiaris,et al.  Discovery of Environmental Nodes in the Web , 2012, IRFC.

[13]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Shih-Fu Chang,et al.  Overview of the MPEG-7 standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[15]  Yiannis Kompatsiaris,et al.  Discovery of environmental resources based on heatmap recognition , 2013, 2013 IEEE International Conference on Image Processing.

[16]  Qiang Wang,et al.  Ontology-Based Focused Crawling , 2009, 2009 International Conference on Information, Process, and Knowledge Management.

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[19]  Koji Iwanuma,et al.  Rapid Synthesis of Domain-Specific Web Search Engines Based on Semi-Automatic Training-Example Generation , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[22]  Kostas Karatzas URBAN AIR QUALITY MANAGEMENT AND INFORMATION SYSTEMS IN EUROPE: LEGAL FRAMEWORK AND INFORMATION ACCESS , 2000 .

[23]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[24]  Paul Over,et al.  TRECVID 2007 workshop participants notebook papers, Gaithersburg, MD, USA, November 2007 , 2007, TRECVID.

[25]  Thomas C. Henderson,et al.  Raster Map Image Analysis , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[26]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[27]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.