A Framework for the Discovery, Analysis, and Retrieval of Multimedia Homemade Explosives Information on the Web

This work proposes a novel framework that integrates diverse state-of-the-art technologies for the discovery, analysis, retrieval, and recommendation of heterogeneous Web resources containing multimedia information about homemade explosives (HMEs), with particular focus on HME recipe information. The framework corresponds to a knowledge management platform that enables the interaction with HME information, and consists of three major components: (i) a discovery component that allows for the identification of HME resources on the Web, (ii) a content-based multimedia analysis component that detects HME-related concepts in multimedia content, and (iii) an indexing, retrieval, and recommendation component that processes the available HME information to enable its (semantic) search and provision of similar information. The proposed framework is being developed in a user-driven manner, based on the requirements of law enforcement and security agencies personnel, as well as HME domain experts. In addition, its development is guided by the characteristics of HME Web resources, as these have been observed in an empirical study conducted by HME domain experts. Overall, this framework is envisaged to increase the operational effectiveness and efficiency of law enforcement and security agencies in their quest to keep the citizen safe.

[1]  Hsinchun Chen Dark Web: Exploring and Mining the Dark Side of the Web , 2012, ICFCA.

[2]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[3]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Kyoungro Yoon,et al.  Index structures for structured documents , 1996, DL '96.

[5]  Carlo Aliprandi,et al.  CAPER: Crawling and analysing Facebook for intelligence purposes , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[6]  Oren Etzioni,et al.  Dynamic Reference Sifting: A Case Study in the Homepage Domain , 1997, Comput. Networks.

[7]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[8]  Yiannis Kompatsiaris,et al.  Focussed crawling of environmental web resources: A pilot study on the combination of multimedia evidence , 2014, EMR@ICMR.

[9]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[10]  SteinBenno,et al.  Plagiarism analysis, authorship identification, and near-duplicate detection PAN'07 , 2007 .

[11]  Allan Hanbury,et al.  Domain Specific Search , 2014, Professional Search in the Modern World.

[12]  Anne Stenersen,et al.  The Internet: A Virtual Training Camp? , 2008 .

[13]  Marc Najork,et al.  Web Crawling , 2010, Found. Trends Inf. Retr..

[14]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[15]  Ioannis Patras,et al.  A Study on the Use of a Binary Local Descriptor and Color Extensions of Local Descriptors for Video Concept Detection , 2015, MMM.

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[17]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Iraklis Paraskakis,et al.  MORMED: Towards a multilingual social networking platform facilitating Medicine 2.0 , 2010 .

[19]  Padmini Srinivasan,et al.  Learning to crawl: Comparing classification schemes , 2005, TOIS.

[20]  A. D. D. Groot,et al.  Methodology: Foundations of inference and research in the behavioral sciences , 1969 .

[21]  Peter Fankhauser,et al.  Boilerplate detection using shallow text features , 2010, WSDM '10.

[22]  Toru Ishida,et al.  Domain-specific Web search with keyword spices , 2004, IEEE Transactions on Knowledge and Data Engineering.

[23]  Kevin Chen-Chuan Chang,et al.  Towards rich query interpretation: walking back and forth for mining query templates , 2010, WWW '10.

[24]  Vasileios Mezaris,et al.  Fast shot segmentation combining global and local visual descriptors , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Linda Cloete,et al.  Dark Web: Exploring and Data Mining the Dark Side of the Web , 2012 .

[26]  Nick Mathewson,et al.  Tor: The Second-Generation Onion Router , 2004, USENIX Security Symposium.

[27]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[28]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[29]  Marc Najork,et al.  On the evolution of clusters of near-duplicate Web pages , 2003, Proceedings of the IEEE/LEOS 3rd International Conference on Numerical Simulation of Semiconductor Optoelectronic Devices (IEEE Cat. No.03EX726).