Engineering the Production of Meta-Information: The Abstracting Concern

In order to improve the automatic production of meta-information in the abstracting field, an essential starting point is the exposition of the current state of the art. At the level of content, three significantly different types of procedure stand out, depending on the document structure in question: extracting, rhetorical summarizing and cognitive summarizing. In addition, reticular and graphic models of information representation, much more appropriate to digital environments, offer a complementary method. In all cases, prior definition of the domain, with its specific documents and actors, is needed. However, the low quality of the product derived from full automation (extract and summaries), above all lacking in coherence, led us to the concept of partial automation, a hybrid man-machine methodology that, at least for the time being, seems to be the best solution for the abstract and abstracting problem.

[1]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[2]  Gobinda G. Chowdhury,et al.  Automatic extraction of citations from the text of English-language patents - an example of template mining , 1996, J. Inf. Sci..

[3]  Betty Ann Mathis Techniques for the Evaluation and Improvement of Computer-Produced Abstracts. , 1972 .

[4]  Inderjeet Mani,et al.  Multi-Document Summarization by Graph Search and Matching , 1997, AAAI/IAAI.

[5]  Joost Kircz,et al.  Rhetorical Structure of Scientific Articles: the Case for Argumentational Analysis in Information Retrieval , 1991, J. Documentation.

[6]  F. W. Lancaster,et al.  Indexing and abstracting in theory and practice , 1991 .

[7]  Dragomir R. Radev,et al.  Generating summaries of multiple news articles , 1995, SIGIR '95.

[8]  Chris Armstrong,et al.  Metadata, recall, and abstracts: can abstracts ever be reliable indicators of document value? , 1997 .

[9]  Angela Goh,et al.  FIES: financial information extraction system , 1998 .

[10]  Giovanni Guida,et al.  Computational models of natural language processing , 1984 .

[11]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[12]  C. Benito Annual Review of Information Science and Technology (ARIST) , 2003 .

[13]  C. D. Paice,et al.  A ‘Select and Generate’ Approach to Automatic Abstracting , 1993 .

[14]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[15]  Lisa F. Rau,et al.  SCISOR: extracting information from on-line news , 1990, CACM.

[16]  Harold Borko,et al.  Abstracting Concepts and Methods , 1975 .

[17]  Wendy G. Lehnert,et al.  Strategies for Natural Language Processing , 1982 .

[18]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .

[19]  Marie-Francine Moens,et al.  Automatic Indexing and Abstracting of Document Texts , 2000, Computational Linguistics.

[20]  Carolyn E. Lipscomb Indexing and Abstracting in Theory and Practice. , 1999 .

[21]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[22]  Simone Teufel,et al.  Sentence extraction as a classification task , 1997 .

[23]  J. C. Becker Advanced systems development , 1980 .

[24]  E. F. Skorochod'ko Adaptive Method of Automatic Abstracting and Indexing , 1971, IFIP Congress.

[25]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[26]  Gerald Salton,et al.  Automatic text processing , 1988 .

[27]  William C. Mann,et al.  RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[28]  G Salton,et al.  Automatic Analysis, Theme Generation, and Summarization of Machine-Readable Texts , 1994, Science.

[29]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[30]  John Feather,et al.  The management of digital data: A metadata approach , 1998 .

[31]  C. J. Armstrong,et al.  A SURVEY OF THE CONTENT AND CHARACTERISTICS OF ELECTRONIC ABSTRACTS , 1997 .

[32]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[33]  Seiji Miike,et al.  Abstract Generation Based on Rhetorical Structure Extraction , 1994, COLING.

[34]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[35]  Jian Qin,et al.  Web Indexing with Meta Fields: A Survey of Web Objects in Polymer Chemistry , 1998 .

[36]  Karen Spärck Jones Automatic summarising: factors and directions , 1998, ArXiv.

[37]  Jessica L. Milstead Thesauri in a Full-Text World , 1998 .

[38]  Pauline A. Cochrane,et al.  Visualizing Subject Access for 21st Century Information Resources , 1998 .

[39]  Gerrit Kateman,et al.  A systematic representation of analytical chemical actions , 1993, J. Chem. Inf. Comput. Sci..

[40]  Timothy C. Craven Abstracts produced using computer assistance , 2000, J. Am. Soc. Inf. Sci..

[41]  Candy Schwartz Subject and information analysis , 1987 .

[42]  Lorcan Dempsey,et al.  Metadata: a current view of practice and issues , 1998, J. Documentation.

[43]  Philip J. Hayes,et al.  Automatic Extraction of Facts from Press Releases to Generate News Stories , 1992, ANLP.

[44]  Giovanni Guida,et al.  A propositional language for text representation , 1984 .

[45]  Edward A. Fox,et al.  Representation and exchange of knowledge as a basis of information processes: H.J. Dietschmann. (Ed.), North-Holland, Amsterdam, New York and Oxford (1984), 433 pp. US$57.75/ Dfl. 150 , 1985 .

[46]  Chris D. Paice,et al.  The identification of important concepts in highly structured technical papers , 1993, SIGIR.

[47]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[48]  Gobinda G. Chowdhury,et al.  Template Mining for Information Extraction from Digital Documents , 1999, Libr. Trends.

[49]  Helena Ahonen Knowledge Discovery in Documents by Extracting Frequent Word Sequences , 1999, Libr. Trends.

[50]  Gerard Salton,et al.  Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[51]  Lisa F. Rau,et al.  Knowledge organization and access in a conceptual information system , 1987, Inf. Process. Manag..

[52]  Lisa F. Rau,et al.  Automatic Condensation of Electronic Publications by Sentence Selection , 1995, Inf. Process. Manag..

[53]  James E. Rush,et al.  Automatic abstracting and indexing. II. Production of indicative abstracts by application of contextual inference and syntactic coherence criteria , 1971 .

[54]  Journal of Information Science , 1984 .

[55]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .