A Multimodal Approach to Relevance and Pertinence of Documents

Automated document classification process extracts information with a systematical analysis of the content of documents. This is an active research field of growing importance due to the large amount of electronic documents produced in the world wide web and made readily available thanks to diffused technologies including mobile ones. Several application areas benefit from automated document classification, including document archiving, invoice processing in business environments, press releases and search engines. Current tools classify or “tag” either text or images separately. In this paper we show how, by linking image and text-based contents together, a technology improves fundamental document management tasks like retrieving information from a database or automatically routing documents. We present a formal definition of pertinence and relevance concepts, that apply to those documents types we name “multimodal”. These are based on a model of conceptual spaces we believe compulsory for document investigation while using joint information sources coming from text and images forming complex documents.

[1]  George G. Coghill,et al.  Text analysis using local energy , 2001, Pattern Recognit..

[2]  Slimane Larabi,et al.  Textual description of shapes , 2009, J. Vis. Commun. Image Represent..

[3]  Liang-Tien Chia,et al.  Image retrieval with a multi-modality ontology , 2007, Multimedia Systems.

[4]  Ciro Castiello,et al.  Document page segmentation using neuro-fuzzy approach , 2008, Appl. Soft Comput..

[5]  Ying Zhang,et al.  An image-based automatic Arabic translation system , 2009, Pattern Recognit..

[6]  Sargur N. Srihari,et al.  Machine-printed Japanese document recognition , 1997, Pattern Recognit..

[7]  N. H. C. Yung,et al.  Scene categorization via contextual visual words , 2010, Pattern Recognit..

[8]  Matteo Cristani,et al.  A Multimodal Approach to Exploit Similarity in Documents , 2014, IEA/AIE.

[9]  Heng Tao Shen,et al.  Indexing and Integrating Multiple Features for WWW Images , 2006, World Wide Web.

[10]  Laurence T. Yang,et al.  Image indexing and retrieval using an ART-2A neural network architecture , 2008 .

[11]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[12]  Wen Gao,et al.  Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[13]  Xiaoqing Ding,et al.  Visual Similarity Based Document Layout Analysis , 2006, Journal of Computer Science and Technology.

[14]  Heung-Kyu Lee,et al.  Web image retrieval using majority-based ranking approach , 2006, Multimedia Tools and Applications.

[15]  Hsin-Hsi Chen,et al.  Integrating textual and visual information for cross-language image retrieval: A trans-media dictionary approach , 2007, Inf. Process. Manag..

[16]  Vasileios Hatzivassiloglou,et al.  Text-based approaches for non-topical image categorization , 2000, International Journal on Digital Libraries.

[17]  Gabriela Csurka,et al.  Crossing textual and visual content in different application scenarios , 2009, Multimedia Tools and Applications.

[18]  Charles E. Kahn,et al.  Dynamic “Inline” Images: Context-Sensitive Retrieval and Integration of Images into Web Documents , 2008, Journal of Digital Imaging.

[19]  Raimondo Schettini,et al.  A hierarchical classification strategy for digital documents , 2002, Pattern Recognit..

[20]  Masahiko Yachida,et al.  Image labeling using key sentences of HTML , 2006 .

[21]  Yueting Zhuang,et al.  Multiple hypergraph clustering of web images by mining Word2Image correlations , 2010 .

[22]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Ming Zhao,et al.  Text detection in images using sparse representation with discriminative dictionaries , 2010, Image Vis. Comput..

[24]  Kwang-Kyu Seo,et al.  An application of one-class support vector machines in content-based image retrieval , 2007, Expert Syst. Appl..