Analysis and Validation of Information Access Through Mono, Multidimensional and Dynamic Taxonomies

Access to complex information bases through multidimensional, dynamic taxonomies (also improperly known as faceted classification systems) is rapidly becoming pervasive in industry, especially in e-commerce. In this paper, the major shortcomings of conventional, monodimensional taxonomic approaches, such as the independence of different branches of the taxonomy and insufficient scalability, are discussed. The dynamic taxonomy approach, the first and most complete model for multidimensional taxonomic access to date, is reviewed and compared to conventional taxonomies. We analyze the reducing power of dynamic taxonomies and conventional taxonomies and report experimental results on real data, which confirm that monodimensional taxonomies are not useful for browsing/retrieval on large databases, whereas dynamic taxonomies can effectively manage very large databases and exhibit a very fast convergence.

[1]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[2]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[3]  M. Tamer Özsu,et al.  Queries and query processing in object-oriented database systems , 1990, TOIS.

[4]  Dennis Wollersheim,et al.  Methodology for creating a sample subset of dynamic taxonomy to use in navigating medical text databases , 2002, Proceedings International Database Engineering and Applications Symposium.

[5]  Shiyali Ramamrita Ranganathan,et al.  The colon classification , 1965 .

[6]  Giovanni Maria Sacco No (e-)Democracy Without (e-)Knowledge , 2005, TCGOV.

[7]  S. B. Yao,et al.  Approximating block accesses in database organizations , 1977, CACM.

[8]  Giovanni M. Sacco The Intelligent E-Sales Clerk: the Basic Ideas , 2004 .

[9]  Giovanni Maria Sacco Systematic Browsing for Multimedia Infobases , 2003, CISST.

[10]  Giovanni Maria Sacco Uniform access to multimedia information bases through dynamic taxonomies , 2004, IEEE Sixth International Symposium on Multimedia Software Engineering.

[11]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[12]  Marti A. Hearst,et al.  Finding the flow in web site search , 2002, CACM.

[13]  Giovanni Maria Sacco,et al.  Dynamic Taxonomies: A Model for Large Information Bases , 2000, IEEE Trans. Knowl. Data Eng..

[14]  William P. Heising,et al.  Note on Random Addressing Techniques , 1963, IBM Syst. J..

[15]  Alfonso F. Cardenas Analysis and performance of inverted data base structures , 1975, CACM.