Automatic extraction of citations from the text of English-language patents - an example of template mining

Methods for automatically isolating and extracting biblio graphic references from the full texts of patents are described and evaluated; these include citations both to patents and to other bibliographic sources. Patents are unusual as citing documents in that citations occur princi pally in the text of the abstracts or description parts of the documents, rather than as footnotes or in separate sections. A template mining approach has been developed for this purpose, to relieve patent examiners of the chore of doing this manually.

[1]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[2]  C. D. Paice,et al.  A ‘Select and Generate’ Approach to Automatic Abstracting , 1993 .

[3]  Paul E. Blower,et al.  Extraction of chemical reaction information from primary journal text using computational linguistics techniques. 2. Semantic phase , 1984, J. Chem. Inf. Comput. Sci..

[4]  Gerrit Kateman,et al.  A systematic representation of analytical chemical actions , 1993, J. Chem. Inf. Comput. Sci..

[5]  Stephen Van Dulken Introduction to Patents Information , 1990 .

[6]  Francis Narin,et al.  Patent Citation Cycles , 1993, Libr. Trends.

[7]  Melissa Macpherson,et al.  Distilling information from text: the EDS TemplateFiller system , 1993 .

[8]  Gobinda G. Chowdhury,et al.  Automatic interpretation of the texts of chemical patent abstracts. 1. Lexical analysis and categorization , 1992, J. Chem. Inf. Comput. Sci..

[9]  G. J. Postma,et al.  Chapter 36 TICA: A Program for the Extraction of Analytical Chemical Information from Texts , 1990 .

[10]  Sam Coates-Stephens,et al.  The analysis and acquisition of proper names for robust text understanding , 1992 .

[11]  Gobinda G. Chowdhury,et al.  Automatic interpretation of the texts of chemical patent abstracts. 2. Processing and results , 1992, J. Chem. Inf. Comput. Sci..

[12]  G. J. Postma,et al.  TICA : a system for the extraction of data from analytical chemical text , 1990 .

[13]  Yorick Wilks,et al.  Evaluation of an Algorithm for the Recognition and Classification of Proper Names , 1996, COLING.

[14]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[15]  Francis Narin,et al.  Status report: Linkage between technology and science , 1992 .

[16]  Wendy A. Warr,et al.  Chemical Information Management , 1992 .

[17]  Melissa Macpherson,et al.  Distilling Information from Text: The EDS TemplateFiller System , 1993, J. Am. Soc. Inf. Sci..