Data Mining and Serial Documents

This paper is concerned with the investigation of the relevance and suitability of the data mining approach to serial documents. Conceptually the paper is divided into three parts. The first part presents the salient features of data mining and its symbiotic relationship to data warehousing. In the second part of the paper, historical serial documents are introduced, and the Ottoman Tax Registers (Defters) are taken as a case study. Their conformance to the data mining approach is established in terms of structure, analysis and results. A high-level conceptual model for the Defters is also presented. The final part concludes with a brief consideration of the implication of data mining for historical research.

[1]  Daniel I. Greenstein A Historian's Guide to Computing , 1994 .

[2]  Rachid Anane,et al.  HiSQL: A Front-end Query System for Historical Relational Databases , 1997, Comput. Humanit..

[3]  Evangelos Simoudis,et al.  Reality Check for Data Mining , 1996, IEEE Expert.

[4]  Evangelos Simoudis,et al.  Mining business databases , 1996, CACM.

[5]  Richard D. Hackathorn,et al.  Using the Data Warehouse , 1994 .

[6]  A. Sauvy Économie et population , 1952 .

[7]  Ken Bartley Classifying the Past: Discriminant Analysis and its Application to Medieval Farming Systems , 1996, Hist. Comput..

[8]  Evangelia Balta L'Eubée à la fin du XVe siècle : économie et population, les registres de l'année 1474 , 1989 .

[9]  Peter Denley Models, Sources and Users: Historical Database Design in the 1990s , 1994, Hist. Comput..

[10]  Wolf-Dieter Hutteroth,et al.  Historical geography of Palestine, Transjordan and Southern Syria in the late 16th century , 1977 .

[11]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[12]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[13]  Roger Middleton,et al.  Review: P. Denley, S. Fogelvik and C.E. Harvey, (eds) (1989) History and computing II. Manchester: Manchester University Press, 1989. , 1990 .

[14]  Kevin Schürer,et al.  A guide to historical datafiles held in machine-readable form. , 1992 .

[15]  W. H. Inmon,et al.  The data warehouse and data mining , 1996, CACM.

[16]  Ramasamy Uthurusamy,et al.  Data mining and knowledge discovery in databases , 1996, CACM.