Superimposing codes representing hierarchical information in web directories

In this article we describe how superimposed coding can be used to represent hierarchical information, which is especially useful in categorized information retrieval systems (for example, Web directories). Superimposed coding have been widely used in signature files in a rigid manner, but our approach is more flexible and powerful. The categorization is based on a directed acyclic graph and each document is assigned to one or more nodes, using superimposed coding we represent the categorization information of each document in a signature. In this paper we explain the superimposed coding theory and how this coding technique can be applied to more flexible environments. Furthermore, we realize an exhaustive analysis of the important factors that have repercussions on the performance of the system. Finally we expose the conclusions obtained from this article.

[1]  FaloutsosChristos,et al.  Description and performance analysis of signature file methods for office filing , 1987 .

[2]  Prabhakar Raghavan,et al.  Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases , 1997, VLDB.

[3]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[4]  Christos Faloutsos,et al.  A survey of information retrieval and filtering methods , 1995 .

[5]  Dik Lun Lee,et al.  Efficient Signature File Methods for Text Retrieval , 1995, IEEE Trans. Knowl. Data Eng..

[6]  Simon Stiassny Mathematical analysis of various superimposed coding methods , 1960 .

[7]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[8]  LeeDik Lun,et al.  Efficient Signature File Methods for Text Retrieval , 1995 .

[9]  Christos Faloutsos,et al.  Access methods for text , 1985, CSUR.

[10]  Roger L. Haskin,et al.  Special-Purpose Processors for Text Retrieval. , 1981 .

[11]  C.S. Roberts,et al.  Partial-match retrieval via the method of superimposed codes , 1979, Proceedings of the IEEE.

[12]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[13]  Balachander Krishnamurthy,et al.  Focusing search in hierarchical structures with directory sets , 1998, CIKM '98.

[14]  Christos Faloutsos,et al.  Signature files: an access method for documents and its analytical performance evaluation , 1984, TOIS.

[15]  Christos Faloutsos,et al.  Description and performance analysis of signature file methods for office filing , 1987, TOIS.

[16]  Calvin N. Mooers,et al.  Application of random codes to the gathering of statistical information , 1948 .

[17]  Dik Lun Lee,et al.  Partitioned signature files: design issues and performance evaluation , 1989, TOIS.