A neural cascade architecture for document retrieval

This paper describes a fuzzy neural approach adopted for information retrieval. After a thematic analysis of documents that produces two conceptual sets called themes and rhemes, a fuzzy representation is derived. The fuzzy representation reflects the hierarchical nature of texts and suggests the use of type-2 fuzzy sets. It is then translated into a cascade of two neural networks. The first level in this cascade is a fuzzy associative memory network (FAM) which maps rhemes to themes and the second level consists of a fuzzy adaptive resonance theory network (Fuzzy ART) which relates themes to document categories. The approach was experimentally evaluated.

[1]  F. Daneš Functional Sentence Perspective and the Organization of the Text , 1974 .

[2]  Tu Bao Ho,et al.  Information Retrieval Using Rough Sets , 1998 .

[3]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[4]  Weiguo Fan,et al.  Effective information retrieval using genetic algorithms based matching functions adaptation , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[5]  Gloria Bordogna,et al.  Controlling retrieval through a user-adaptive representation of documents , 1995, Int. J. Approx. Reason..

[6]  Norman Y. Foo,et al.  Semantic distance in conceptual graphs , 1992 .

[7]  C. J. van Rijsbergen,et al.  Towards an information logic , 1989, SIGIR '89.

[8]  Guszti Bartfai,et al.  An ART-based modular architecture for learning hierarchical clusterings , 1996, Neurocomputing.

[9]  Ross Wilkinson,et al.  Using the cosine measure in a neural network for document retrieval , 1991, SIGIR '91.

[10]  Dana Vrajitoru,et al.  Large Population or Many Generations for Genetic Algorithms? Implications in Information Retrieval , 2000 .

[11]  Mounia Lalmas,et al.  Dempster-Shafer's theory of evidence applied to structured documents: modelling uncertainty , 1997, SIGIR '97.

[12]  Michael D. Gordon User‐based document clustering by redescribing subject descriptions with a genetic algorithm , 1991 .

[13]  Vasant Honavar,et al.  Learn++: an incremental learning algorithm for supervised neural networks , 2001, IEEE Trans. Syst. Man Cybern. Part C.

[14]  W. Bruce Croft,et al.  Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[15]  Stephen Grossberg,et al.  Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system , 1991, Neural Networks.

[16]  Timo Honkela,et al.  WEBSOM - Self-organizing maps of document collections , 1998, Neurocomputing.

[17]  Edward E. Smith,et al.  On the adequacy of prototype theory as a theory of concepts , 1981, Cognition.

[18]  Padmini Srinivasan,et al.  Intelligent information retrieval using rough set approximations , 1989, Inf. Process. Manag..