Structural Queries in Electronic Corpora

We present a methodology for automatically constructing structural hyperlinks in electronic technical corpora. A structural hyperlink connects components of a document that have specified structural properties with word-based content similarity. Our approach enables queries that may be posed in terms of keywords, as well as structural segments such as definitions, figures, etc.

[1]  James Allan,et al.  Automatic hypertext link typing , 1996 .

[2]  Bruce Randall Donald,et al.  Analyzing teams of cooperating mobile robots , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[3]  Michael Fuller,et al.  Structured answers for a large structured document collection , 1993, SIGIR.

[4]  James Allan,et al.  Selective text utilization and text traversal , 1995, Int. J. Hum. Comput. Stud..

[5]  Gerard Salton,et al.  Automatic Text Theme Generation and the Analysis of Text Structure , 1994 .

[6]  G Salton,et al.  Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts , 1994, Science.

[7]  Heikki Mannila,et al.  Retrieval from hierarchical texts by partial patterns , 1993, SIGIR.

[8]  W. Bruce Croft,et al.  Inference networks for document retrieval , 1989, SIGIR '90.

[9]  Sargur N. Srihari,et al.  Classification of newspaper image using texture analysis , 1989 .

[10]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[12]  Anil K. Jain,et al.  Address block location on envelopes using Gabor filters , 1992, Pattern Recognit..

[13]  Devika Subramanian,et al.  Customizing information capture and access , 1997, TOIS.

[14]  Masaaki Mizuno,et al.  Document Recognition System with Layout Structure Generator , 1990, MVA.

[15]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.

[16]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[17]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[18]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.

[19]  Devika Subramanian,et al.  Customizing multimedia information access , 1995, CSUR.

[20]  AutomatedDocument StructuringDaniela Rus Geometric Algorithms and Experiments for Automated Document Structuring , 1997 .

[21]  James Allan,et al.  Automatic Hypertext Construction , 1995 .

[22]  Sargur N. Srihari,et al.  Classification of newspaper image blocks using texture analysis , 1989, Comput. Vis. Graph. Image Process..

[23]  Haruo Asada,et al.  Major components of a complete text reading system , 1992 .

[24]  Gerard Salton,et al.  The smart document retrieval project , 1991, SIGIR '91.

[25]  Devika Subramanian,et al.  Information Retrieval, Information Structure, and Information Agents , 1997, Intelligent Hypertext.

[26]  Luis Gravano,et al.  The Efficacy of GlOSS for the Text Database Discovery Problem , 1993, SIGMOD 1993.

[27]  Daniela Rus,et al.  Using White Space for Automated Document Structuring , 1994 .