Towards a Formal Theory for Complex Objects and Content-Based Image Retrieval

Advanced services in digital libraries (DLs) have been developed and are widely used to address the required capabilities of an assortment of systems as DLs expand into diverse application domains.  In order to reuse, integrate, unify, manage, and support these heterogeneous resources, the notion of complex objects (COs) has emerged as a means to facilitate aggregation of content and to help developers to manage heterogeneous information resources, and their internal components. In particular, complex image objects (along with the most used service - Content-Based Image Retrieval) have the potential to play a key role in information systems, due to the large availability of images and the need to integrate them with other datasets (and metadata), and image manipulation software. However, the lack of consensus on precise theoretical definitions for these concepts usually leads to ad hoc implementation, duplication of efforts, and interoperability problems.   In  this article  we exploit the 5S Framework to propose  a  formal  description for Complex Objects and Content-Based Image Retrieval, defining their fundamental concepts and relationships from a digital library (DL) perspective.  These formalized concepts can be used to classify, compare, and highlight the differences among components, technologies, and applications, impacting digital library researchers, designers, and developers.  The theoretical extensions of digital library functionality presented here cover complex image objects, within a practical case study, to exemplify the integrative use of services, thus balancing theory and practice.

[1]  Robert B. Allen,et al.  Supporting Student Collaboration for Image Indexing , 2007, ICADL.

[2]  Edward A. Fox,et al.  Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries , 2004, TOIS.

[3]  Edward A. Fox,et al.  Analyzing Compound Object Technologies from the 5 S Perspective , 2011 .

[4]  Edward A. Fox,et al.  ETANA-DL: managing complex information applications - an archaeology digital library , 2004, JCDL.

[5]  Edward A. Fox,et al.  Exploring digital libraries: integrating browsing, searching, and visualization , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[6]  Edward A. Fox,et al.  SIERRA - A Superimposed Application for Enhanced Image Description and Retrieval , 2006, ECDL.

[7]  Herbert Van de Sompel,et al.  IJDL special issue on complex digital objects: Guest editors' introduction , 2005, International Journal on Digital Libraries.

[8]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[9]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[10]  Edward A. Fox,et al.  Superimposed Image Description and Retrieval for Fish Species Identification , 2009, ECDL.

[11]  Edward A. Fox,et al.  Automatic evaluation of digital libraries with 5SQual , 2009, J. Informetrics.

[12]  Nádia P. Kozievitch,et al.  Complex Objects in Digital Libraries , 2009, Bull. IEEE Tech. Comm. Digit. Libr..

[13]  Michael L. Nelson,et al.  A Survey of Complex Object Technologies for Digital Libraries , 2001 .

[14]  Ricardo da S. Torres,et al.  Exploring CBIR concepts in the CTRnet Project , 2010 .

[15]  James Ze Wang,et al.  Scalable integrated region-based image retrieval using IRM and statistical clustering , 2001, JCDL '01.

[16]  Chris Awre Legal issues of compound ETDs , 2009 .

[17]  Edward A. Fox,et al.  International Journal on Digital Libraries manuscript No. (will be inserted by the editor) A Digital Library Framework for Biodiversity Information Systems , 2022 .

[18]  Herng-Yow Chen,et al.  A digital museum of Taiwanese butterflies , 2000, DL '00.

[19]  James C. French,et al.  An application of multiple viewpoints to content-based image retrieval , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[20]  Edward A. Fox,et al.  5SGraph demo: a graphical modeling tool for digital libraries , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[21]  Ricardo da Silva Torres,et al.  Describing OAI-ORE from the 5S Framework Perspective , 2010, ICADL.

[22]  R. Schank,et al.  Tell Me a Story: Narrative and Intelligence , 1991 .

[23]  Rik Van de Walle,et al.  The MPEG-21 Book , 2006 .

[24]  Fillia Makedon,et al.  Generating fuzzy semantic metadata describing spatial relations from images using the R-histogram , 2004, JCDL.

[25]  A. Lynn Abbott,et al.  Rethinking Fingerprint Evidence Through Integration of Very Large Digital Libraries , 2011 .

[26]  Edward A. Fox,et al.  CTRnet DL for disaster information services , 2011, JCDL '11.

[27]  Dean B. Krafft,et al.  Ncore: architecture and implementation of a flexible, collaborative digital library , 2008, JCDL '08.

[28]  Bin Zhu,et al.  Creating a large-scale content-based airphoto image digital library , 2000, IEEE Trans. Image Process..

[29]  Jane Hunter,et al.  SCOPE: A Scientific Compound Object Publishing and Editing System , 2008, Int. J. Digit. Curation.

[30]  Lionel Brunie,et al.  Integrating similarity-based queries in image DBMSs , 2004, SAC '04.

[31]  Edward A. Fox,et al.  Building digital libraries from simple building blocks , 2003, Online Inf. Rev..

[32]  Edward A. Fox,et al.  Extending the 5S Framework of Digital Libraries to support Complex Objects, Superimposed Information, and Content-Based Image Retrieval Services , 2010 .

[33]  Jim Melton,et al.  SQL multimedia and application packages (SQL/MM) , 2001, SGMD.

[34]  Herbert Van de Sompel,et al.  The OAI-ORE effort: progress, challenges, synergies , 2007, JCDL '07.

[35]  Ricardo da Silva Torres,et al.  User-Oriented Evaluation of Color Descriptors for Web Image Retrieval , 2010, ECDL.

[36]  Edward A. Fox,et al.  5SL: a language for declarative specification and generation of digital libraries , 2002, JCDL '02.

[37]  Luciano da Fontoura Costa,et al.  A graph-based approach for multiscale shape analysis , 2004, Pattern Recognit..

[38]  Chris Awre Managing compound objects within Fedora , 2009 .

[39]  John Lyons,et al.  A Photo-Based Computer System for Identifying Wisconsin Fishes , 2006 .

[40]  Herbert Van de Sompel,et al.  The open archives initiative: building a low-barrier interoperability framework , 2001, JCDL '01.

[41]  Martha L. Brogan A Survey of Digital Library Aggregation Services , 2003 .

[42]  André Santanchè,et al.  User-author centered multimedia building blocks , 2006, Multimedia Systems.

[43]  Fatos T. Yarman-Vural,et al.  SASI: a generic texture descriptor for image retrieval , 2003, Pattern Recognit..

[44]  Martin Slabbertje,et al.  Requirements for Management & Storage to support complex objects & ORE in DSpace , 2009 .

[45]  Edward A. Fox,et al.  A Teaching Tool for Parasitology: Enhancing Learning with Annotation and Image Retrieval , 2010, ECDL.

[46]  Ricardo da Silva Torres,et al.  Content-Based Image Retrieval: Theory and Applications , 2006, RITA.

[47]  André Santanchè,et al.  A Component Model and Infrastructure for a Fluid Web , 2007, IEEE Transactions on Knowledge and Data Engineering.

[48]  Mario A. Nascimento,et al.  A compact and efficient image retrieval approach based on border/interior pixel classification , 2002, CIKM '02.

[49]  Agma J. M. Traina,et al.  Image domain formalization for content-based image retrieval , 2005, SAC '05.

[50]  Lois M. L. Delcambre,et al.  Use of subimages in fish species identification: a qualitative study , 2011, JCDL '11.

[51]  Chung-Sheng Li,et al.  Progressive Content-Based Retrieval from Satellite Image Archives , 1997, D Lib Mag..