An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation): symbolic branch-and-bound search vs. connectionist Hopfield net activation

This paper presents a framework for knowledge discovery and concept exploration. In order to enhance the concept exploration capability of knowledge‐based systems and to alleviate the limitations of the manual browsing approach, we have developed two spreading activation‐based algorithms for concept exploration in large, heterogeneous networks of concepts (e.g., multiple thesauri). One algorithm, which is based on the symbolic AI paradigm, performs a conventional branch‐and‐bound search on a semantic net representation to identify other highly relevant concepts (a serial, optimal search process). The second algorithm, which is based on the neural network approach, executes the Hopfield net parallel relaxation and convergence process to identify “convergent” concepts for some initial queries (a parallel, heuristic search process). Both algorithms can be adopted for automatic, multiple‐thesauri consultation. We tested these two algorithms on a large text‐based knowledge network of about 13,000 nodes (terms) and 80,000 directed links in the area of computing technologies. This knowledge network was created from two external thesauri and one automatically generated thesaurus. We conducted experiments to compare the behaviors and performances of the two algorithms with the hypertext‐like browsing process. Our experiment revealed that manual browsing achieved higher‐term recall but lower‐term precision in comparison to the algorithmic systems. However, it was also a much more laborious and cognitively demanding process. In document retrieval, there were no statistically significant differences in document recall and precision between the algorithms and the manual browsing process. In light of the effort required by the manual browsing process, our proposed algorithmic approach presents a viable option for efficiently traversing large‐scale, multiple thesauri (knowledge network). © 1995 John Wiley & Sons, Inc.

[1]  K. J. Lynch,et al.  International developments in the information technologies: the Mosaic group at the University of Arizona , 1990, Twenty-Third Annual Hawaii International Conference on System Sciences.

[2]  Betsy L. Humphreys,et al.  The UMLS Knowledge Sources: Tools for Building Better User Interfaces. , 1990 .

[3]  Gerald Salton,et al.  Automatic text processing , 1988 .

[4]  Linda K. Cook Book review: INTELLIGENT DATABASES: OBJECT-ORIENTED, DEDUCTIVE HYPERMEDIA TECHNOLOGIES by K. Parsaye, M. Chignell, S. Khoshafian & H. Wong (John Wiley & Sons, Inc., 1989) , 1990, SGCH.

[5]  Geoffrey E. Hinton,et al.  A Distributed Connectionist Production System , 1988, Cogn. Sci..

[6]  Carolyn L. Foss,et al.  Tools for reading and browsing hypertext , 1989, Inf. Process. Manag..

[7]  J J Hopfield,et al.  Collective computation in neuronlike circuits. , 1987, Scientific American.

[8]  K. J. Lynch,et al.  Automatic construction of networks of concepts characterizing document databases , 1992, IEEE Trans. Syst. Man Cybern..

[9]  Douglas R. Vogel,et al.  The Arizona Analyst Information System: supporting collaborative research on international technological trends , 1990 .

[10]  Jaime G. Carbonell,et al.  An Overview of Machine Learning , 1983 .

[11]  Peter Willett,et al.  Effectiveness of query expansion in ranked-output document retrieval systems , 1992, J. Inf. Sci..

[12]  R. Michalski,et al.  Learning from Observation: Conceptual Clustering , 1983 .

[13]  Larry Kerschberg,et al.  Mining for Knowledge in Databases: Goals and General Description of the INLEN System , 1989, Knowledge Discovery in Databases.

[14]  K. J. Lynch,et al.  Generating, integrating, and activating thesauri for concept-based document retrieval , 1993, IEEE Expert.

[15]  Eugene Charniak,et al.  A Common Representation for Problem-Solving and Language-Comprehension Information , 1981, Artif. Intell..

[16]  Paul R. Cohen,et al.  Information retrieval by constrained spreading activation in semantic networks , 1987, Inf. Process. Manag..

[17]  Kevin Knight,et al.  Connectionist ideas and algorithms , 1990, CACM.

[18]  Mark H. Chignell,et al.  Intelligent databases , 1990 .

[19]  Hsinchun Chen,et al.  Browsing in hypertext: a cognitive study , 1992, IEEE Trans. Syst. Man Cybern..

[20]  Richard D. Alexander,et al.  A Comparative Review , 1968 .

[21]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Hsinchun Chen,et al.  Reducing Indeterminism in Consultation: A Cognitive Model of User/Librarian Interactions , 1987, AAAI.

[23]  D. Lindberg,et al.  Building the Unified Medical Language System , 1989 .

[24]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[25]  D. F. Morrison,et al.  Multivariate Statistical Methods , 1968 .

[26]  Stephen I. Gallant,et al.  Connectionist expert systems , 1988, CACM.

[27]  Peter Willett,et al.  The Limitations of Term Co-Occurrence Data for Query Expansion in Document Retrieval Systems , 1991 .

[28]  Hsinchun Chen,et al.  Cognitive process as a basis for intelligent retrieval systems design , 1991, Inf. Process. Manag..

[29]  Peretz Shoval,et al.  Principles, procedures and rules in an expert system for information retrieval , 1985, Inf. Process. Manag..

[30]  James Allan,et al.  Automatic structuring and retrieval of large text files , 1994, CACM.

[31]  Geoffrey E. Hinton,et al.  Learning and relearning in Boltzmann machines , 1986 .

[32]  D. Whitteridge,et al.  Learning and Relearning , 1959, Science's STKE.

[33]  Lokendra Shastri,et al.  Why Semantic Networks? , 1991, Principles of Semantic Networks.

[34]  Thomas G. Dietterich,et al.  A Comparative Review of Selected Methods for Learning from Examples , 1983 .

[35]  Jaime G. Carbonell,et al.  CoalSORT: A Knowledge-Based Interface , 1987, IEEE Expert.

[36]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[37]  J. Dalton,et al.  Artificial neural networks , 1991, IEEE Potentials.

[38]  J. Ross Quinlan,et al.  Learning Efficient Classification Procedures and Their Application to Chess End Games , 1983 .

[39]  Patrick Henry Winston,et al.  Artificial intelligence (2nd ed.) , 1984 .