On the Notion of Genre in Digital Preservation

In this paper, we discuss the notion of genre as a basis for addressing the problem of context representation in digital preservation. We outline several reference points for the notion of genre. This includes a review of diplomatic principles that can support and enhance the power of genre as a key to capture information about context relations. Further, we discuss the impact of open genre models and open topic models in information retrieval and finally present a list of research questions concerning future research in automation of digital preservation.

[1]  Yunhyong Kim,et al.  Genre Classification in Automated Ingest and Appraisal Metadata , 2006, ECDL.

[2]  Alexander Mehler,et al.  Genres on the Web: Computational Models and Empirical Studies , 2010 .

[3]  B. Barbiche Diplomatics of Modern Official Documents (Sixteenth - Eighteenth Centuries): Evaluation and Perspectives , 1996 .

[4]  T.L.J. Ferris,et al.  Tracing Genres Through Organizations: A Sociocultural Approach to Information , 2004, IEEE Transactions on Professional Communication.

[5]  Yunhyong Kim,et al.  Searching for Ground Truth: A Stepping Stone in Automating Genre Classification , 2007, DELOS.

[6]  Carsten S. Østerlund Genre Combinations: A Window into Dynamic Communication Practices , 2007, J. Manag. Inf. Syst..

[7]  Nenad Stojanovic Information-need driven query refinement , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[8]  Stefan M. Rüger,et al.  High-dimensional visual vocabularies for image retrieval , 2007, SIGIR.

[9]  Mark Baillie,et al.  Relevance in Technicolor , 2010, ECDL.

[10]  Carolyn R. Miller Genre as social action , 1984 .

[11]  K. C. Fraser Diplomatics: : New Uses for an Old Science , 1999 .

[12]  C. Morris Foundations of the theory of signs , 1938 .

[13]  P. Garvin,et al.  Prolegomena to a Theory of Language , 1953 .

[14]  Alexander Mehler A Quantitative Graph Model of Social Ontologies by Example of Wikipedia , 2011 .

[15]  N. Cocchiarella,et al.  Situations and Attitudes. , 1986 .

[16]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..

[17]  Yunhyong Kim,et al.  Detecting Family Resemblance: Automated Genre Classification , 2007, Data Sci. J..

[18]  Lars Littig,et al.  Classification of Web Sites at Super-genre Level , 2011, Genres on the Web.

[19]  L. Wittgenstein Philosophical investigations = Philosophische Untersuchungen , 1958 .

[20]  Yunhyong Kim,et al.  Building a document genre corpus: a profile of the KRYS I corpus , 2008 .

[21]  Olivier Guyotjeannin The Expansion of Diplomatics as a Discipline , 2009 .

[22]  Luciana Duranti,et al.  Diplomatics: New Uses for an Old Science , 1998 .

[23]  Yunhyong Kim,et al.  "The Naming of Cats": Automated Genre Classification , 2008, Int. J. Digit. Curation.

[24]  Andreas Hotho,et al.  BibSonomy: a social bookmark and publication sharing system , 2006 .

[25]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[26]  Georg Rehm,et al.  Towards automatic Web genre identification: a corpus-based approach in the domain of academia by example of the Academic's Personal Homepage , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[27]  Duen-Ren Liu,et al.  Toward incorporating a task-stage identification technique into the long-term document support process , 2008, Inf. Process. Manag..

[28]  W. Orlikowski,et al.  Genre Systems: Structuring Interaction through Communicative Norms , 2002 .

[29]  JoAnne Yates,et al.  Genre taxonomy: A knowledge repository of communicative actions , 2001, TOIS.

[30]  G. Bateson,et al.  STEPS TO AN ECOLOGY OF MIND COLLECTED ESSAYS IN ANTHROPOLOGY, PSYCHIATRY, EVOLUTION, AND EPISTEMOLOGY , 2006 .

[31]  Werner Ulrich The naming of cats , 2009 .

[32]  Serge Sharoff,et al.  In the Garden and in the Jungle Comparing Genres in the BNC and Internet , 2010 .

[33]  Alexander Mehler,et al.  Riding the Rough Waves of Genre on the Web , 2011, Genres on the Web.

[34]  Alexander Mehler,et al.  Integrating Content and Structure Learning: A Model of Hypertext Zoning and Sounding , 2012, Modeling, Learning, and Processing of Text Technological Data Structures.

[35]  Serge Sharoff In the Garden and in the Jungle , 2011, Genres on the Web.

[36]  W. Orlikowski,et al.  Genres of Organizational Communication: A Structurational Approach to Studying Communication and Media , 1992 .

[37]  Clay Spinuzzi,et al.  Tracing Genres through Organizations: A Sociocultural Approach to Information Design , 2003 .

[38]  Alexander Mehler,et al.  Enhancing document modeling by means of open topic models: Crossing the frontier of classification schemes in digital libraries by example of the DDC , 2009, Libr. Hi Tech.

[39]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[40]  Alexander Mehler,et al.  Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems , 2008, LREC.

[41]  Heather MacNeil Contemporary Archival Diplomatics as a Method of Inquiry: Lessons Learned from Two Research Projects , 2004 .

[42]  Christopher A. Lee,et al.  Taking Context Seriously: A Framework for Contextual Information in Digital Collections , 2007 .

[43]  Marina Santini Cross-Testing a Genre Classification Model for the Web , 2011, Genres on the Web.

[44]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[45]  Alexander Mehler Modeling, Learning, and Processing of Text Technological Data Structures , 2012, Studies in Computational Intelligence.