Information Retrieval, Information Structure, and Information Agents

This paper presents a customizable architecture for software agents that capture and access information in large, heterogeneous, distributed electronic repositories. The key idea is to exploit underlying structure at various levels of granularity to build high-level indices with task-specific interpretations. Information agents construct such indices and are configured as a network of reusable modules called structure detectors and segmenters. We illustrate our architecture with the design and implementation of smart information filters in two contexts: retrieving stock market data from Internet newsgroups, and retrieving technical reports from Internet ftp sites.

[1]  Anil K. Jain,et al.  Address block location on envelopes using Gabor filters , 1992, Pattern Recognit..

[2]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[4]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[5]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[6]  Daniela Rus,et al.  Using White Space for Automated Document Structuring , 1994 .

[7]  Bruce Randall Donald,et al.  On Information Invariants in Robotics , 1995, Artif. Intell..

[8]  Daniela Rus,et al.  Transportable Information Agents , 1997, Agents.

[9]  Steven J. Plimpton,et al.  Massively parallel methods for engineering and science problems , 1994, CACM.

[10]  David Kotz,et al.  Autonomous and Adaptive Agents that Gather Information , 1996 .

[11]  Luis Gravano,et al.  The Efficacy of GlOSS for the Text Database Discovery Problem , 1993, SIGMOD 1993.

[12]  Stephen Robertson,et al.  The methodology of information retrieval experiment , 1981 .

[13]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[14]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[15]  Sargur N. Srihari,et al.  Classification of newspaper image using texture analysis , 1989 .

[16]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[17]  Keith W. Miller,et al.  How good is good enough?: an ethical analysis of software construction and use , 1994, CACM.

[18]  Leslie G. Valiant,et al.  Fast probabilistic algorithms for hamiltonian circuits and matchings , 1977, STOC '77.

[19]  Sargur N. Srihari,et al.  Classification of newspaper image blocks using texture analysis , 1989, Comput. Vis. Graph. Image Process..

[20]  Devika Subramanian,et al.  Multi-media RISSC Informatics: Retrieval of Information with Simple Structural Components (Part I: The Architecture). , 1993, CIKM 1993.

[21]  B. Clifford Neuman,et al.  A Comparison of Internet Resource Discovery Approaches , 1992, Comput. Syst..

[22]  José L. Balcázar,et al.  Structural Complexity I , 1995, Texts in Theoretical Computer Science An EATCS Series.

[23]  H. Kucera,et al.  Computational analysis of present-day American English , 1967 .

[24]  Bruce Randall Donald,et al.  Constructive recognizability for task-directed robot programming , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[25]  Carl Lagoze,et al.  "Drop-In" Publishing with the World Wide Web , 1995, Comput. Networks ISDN Syst..

[26]  Claudia Pearce,et al.  Generating a dynamic hypertext environment with n-gram analysis , 1993, CIKM '93.

[27]  Devika Subramanian,et al.  Customizing information capture and access , 1997, TOIS.

[28]  James R. Munkres,et al.  Topology; a first course , 1974 .

[29]  Rodney A. Brooks,et al.  Elephants don't play chess , 1990, Robotics Auton. Syst..

[30]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[31]  Jock D. Mackinlay,et al.  Information visualization using 3D interactive animation , 1991, CHI.

[32]  Manuel Blum,et al.  On the power of the compass (or, why mazes are easier to search than graphs) , 1978, 19th Annual Symposium on Foundations of Computer Science (sfcs 1978).

[33]  Michael E. Lesk,et al.  The CORE electronic chemistry library , 1991, SIGIR '91.

[34]  Oren Etzioni,et al.  A softbot-based interface to the Internet , 1994, CACM.

[35]  Masaaki Mizuno,et al.  Document Recognition System with Layout Structure Generator , 1990, MVA.

[36]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[37]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[38]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[39]  Bart Selman,et al.  Bottom-up design of software agents , 1994, CACM.

[40]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[41]  Bruce Randall Donald,et al.  Constructive recognizability for task-directed robot programming , 1992, Robotics Auton. Syst..

[42]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[43]  Daniel P. Huttenlocher,et al.  Tracking non-rigid objects in complex scenes , 1993, 1993 (4th) International Conference on Computer Vision.

[44]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.

[45]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[46]  Tom M. Mitchell,et al.  Experience with a learning personal assistant , 1994, CACM.

[47]  Michael R. Genesereth,et al.  Software agents , 1994, CACM.

[48]  John Canny,et al.  A RISC Paradigm for Industrial Robotics , 1993 .

[49]  Devika Subramanian,et al.  Multi-media RISC informatics: retrieving information with simple structural components , 1993, CIKM '93.

[50]  Gerard Salton,et al.  Improving Retrieval Performance by Relevance Feedback , 1997 .

[51]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .