Automatic cataloguing of advertisement in magazines

Advertising activities are one of the main aspects of companies. Its global evaluation is an important index, not only within economic sectors, but also for public economic policies. The evaluation of the adverts is carried out by specialised companies, which deliver advertisement expenditure information. Their work consist in an exhaustive collection of adverts (e.g., all adverts from any media), and deliver information (integrated from every media) relevant for their clients. The delivered information can range from statistical data to bear witness of an advertisement campaign. Important quality indicators for the delivered information are the exhaustiveness, the delay of delivering and a correct identification of different versions of an advert. In this article, we present an advanced computer-aided system for collecting and delivering advertising information from magazines and daily press. The system provides computer-aided data validation and exploitation resources. Using computerised document image analysis and image database indexing and retrieval, the system is able to locate an advert in a page, extract relevant quality indicators and search the advert (or similar ones) in a database. This tool is configured as an intranet and offers resources for image data acquisition, storage/retrieval and advert quality indicators extraction, however, the key of the system is the underlying idea of incorporating computer-aided visual information management.

[1]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[2]  Robert M. Haralick,et al.  Document image understanding: geometric and logical layout , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[4]  Robert M. Haralick,et al.  Document layout structure extraction using bounding boxes of different entitles , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[5]  James Ze Wang,et al.  Content-based image indexing and searching using Daubechies' wavelets , 1998, International Journal on Digital Libraries.

[6]  Shih-Fu Chang,et al.  The holy grail of content-based media analysis , 2002 .

[7]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[8]  Stefano Messelodi,et al.  Geometric Layout Analysis Techniques for Document Image Understanding: a Review , 2008 .

[9]  Robert M. Haralick,et al.  Textural features for image database retrieval , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[10]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[11]  Beng Chin Ooi,et al.  Fast signature-based color-spatial image retrieval , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[12]  Gio Wiederhold,et al.  Digital libraries, value, and productivity , 1995, CACM.

[13]  Thomas S. Huang,et al.  Supporting content-based queries over images in MARS , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[14]  Stavros J. Perantonis,et al.  Automatic page analysis for the creation of a digital library from newspaper archives , 2000, International Journal on Digital Libraries.

[15]  Yuan Yan Tang,et al.  Document analysis and recognition by computers , 1999 .

[16]  C.-C. Jay Kuo,et al.  Wavelet descriptor of planar curves: theory and applications , 1996, IEEE Trans. Image Process..

[17]  Thomas S. Huang,et al.  Unifying Keywords and Visual Contents in Image Retrieval , 2002, IEEE Multim..

[18]  Roberto Fontana,et al.  Efficient and flexible text extraction from document pages , 1999, International Journal on Document Analysis and Recognition.

[19]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[20]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..