Self-organizing documentary maps for information retrieval

This paper presents a neural-network-based approach for information retrieval, an important issue in natural language processing. We first describe a method for creating self-organizing documentary maps - visible and continuous representations in which all queries and documents are mapped in topological order according to their similarities. We then show that documents related to queries can be retrieved by merely calculating the Euclidean distances between the positions at which queries and documents are placed and choosing the N closest documents in the ranking order for each query. Small-scale computer experiments have demonstrated that the proposed method is capable of high precision.

[1]  Masaki Murata,et al.  Japanese probabilistic information retrieval using location and category information , 2000, IRAL '00.

[2]  Claire Cardie,et al.  An Analysis of Statistical and Syntactic Phrases , 1997, RIAO.

[3]  ChengXiang Zhai,et al.  Noun-Phrase Analysis in Unrestricted Text for Information Retrieval , 1996, ACL.

[4]  Masaki Murata,et al.  Self-organizing semantic maps of Japanese nouns in terms of adnominal constituents , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[5]  SaltonGerard,et al.  Term-weighting approaches in automatic text retrieval , 1988 .

[6]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[7]  Teuvo Kohonen,et al.  Self-Organizing Maps, Second Edition , 1997, Springer Series in Information Sciences.

[8]  Takenobu Tokunaga,et al.  The Use of WordNet in Information Retrieval , 1998, WordNet@ACL/COLING.