Multimedia in Cultural Heritage Manuscripts: Integrating Description, Transcription, and Image Content

Cultural heritage documents are often subject to digitization processes resulting in image material, even for textual contents. It is therefore common, in collections of valuable documents, to have descriptive information generated by the institutions, along with digitized images, transcriptions created by scholars, translations and even miscellaneous annotations. To offer a faceted access to the collection it is necessary to explore these diverse materials, integrate them according to a model that accounts for both metadata and the content and provide a comprehensive retrieval environment. In this work we have applied the MetaMedia multimedia database framework to a collection of ancient documents, processed the documents in their descriptive, textual, and image content and produced a browsing and searching system. The main challenges are the integrated management of metadata and content, the indexing of the image content, and the design of the browsing and searching interface where various views on the data are kept together.

[1]  Ben Shneiderman,et al.  Find that photo! , 2006, Commun. ACM.

[2]  Jean Tague-Sutcliffe,et al.  The Pragmatics of Information Retrieval Experimentation Revisited , 1997, Inf. Process. Manag..

[3]  Shih-Fu Chang,et al.  Multimedia Knowledge Integration, Summarization And Evaluation , 2002, MDM/KDD.

[4]  Margaret Elizabeth Dickson,et al.  CONTENTdm Digital Collection Management Software and End-User Efficacy , 2008 .

[5]  Sandra Payette,et al.  Fedora: an architecture for complex objects and their relationships , 2005, International Journal on Digital Libraries.

[6]  Heiko Schuldt,et al.  The Delos digital library reference model : foundations for digital libraries , 2007 .

[7]  Nicu Sebe,et al.  A New Study on Distance Metrics as Similarity Measurement , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[8]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[9]  Hermann Ney,et al.  Features for Image Retrieval: A Quantitative Comparison , 2004, DAGM-Symposium.

[10]  Jane Hunter,et al.  Combining the CIDOC CRM and MPEG-7 to Describe Multimedia in Museums. , 2002 .

[11]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[12]  Cristina Ribeiro,et al.  A Metadata Model for Multimedia Databases , 2001, ICHIM.

[13]  Fernando Pereira,et al.  MPEG-7 the generic multimedia content description standard, part 1 - Multimedia, IEEE , 2001 .

[14]  Marcel Worring,et al.  VideOlympics: Real-Time Evaluation of Multimedia Retrieval Systems , 2008, IEEE MultiMedia.

[15]  T.H.W. Westerveld,et al.  RECVID as a Re-Usable Test-Collection for Video Retrieval , 2003 .

[16]  A FoxEdward,et al.  Streams, structures, spaces, scenarios, societies (5s) , 2004 .

[17]  Hamid Pirahesh,et al.  System RX: one part relational, one part XML , 2005, SIGMOD '05.

[18]  Cristina Ribeiro,et al.  Multidimensional Descriptor Indexing: Exploring the BitMatrix , 2006, CIVR.

[19]  William I. Grosky,et al.  Narrowing the semantic gap - improved text-based web document retrieval using visual features , 2002, IEEE Trans. Multim..

[20]  Siegfried Krause CIDOC: Conceptual Reference Model: oder: das Schweizer Taschenmesser für die Museums- und Kulturinformatik , 2002 .

[21]  Brien Brothman,et al.  ISAD(G): General International Standard Archival Description , 1992 .

[22]  Robin C. Cover,et al.  Metadata Encoding and Transmission Standard (METS) , 2002 .

[23]  Rik Van de Walle,et al.  MPEG-21: goals and achievements , 2001 .

[24]  Paul Scifleet General International Standard Archival Description,ISAD(G), 2nd edn (book review) , 2002 .

[25]  Michael Stonebraker,et al.  Object-Relational DBMSs: The Next Great Wave , 1995 .

[26]  Corey Harper Dublin Core Metadata Initiative: Beyond the Element Set , 2010 .

[27]  Cristina Ribeiro,et al.  A multimedia database workbench for content and context retrieval , 2004, IEEE 6th Workshop on Multimedia Signal Processing, 2004..

[28]  Mark Sanderson,et al.  Information retrieval system evaluation: effort, sensitivity, and reliability , 2005, SIGIR '05.

[29]  C. M. Sperberg-McQueen,et al.  Guidelines for electronic text encoding and interchange , 1994 .

[30]  Edward A. Fox,et al.  Streams, structures, spaces, scenarios, societies (5s): A formal model for digital libraries , 2004, TOIS.

[31]  Ian H. Witten,et al.  Building digital library collections with greenstone , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[32]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .