论文信息 - Extracting conceptual relationships from specialized documents

Extracting conceptual relationships from specialized documents

Conceptual modeling has been fundamental to the management of structured data. However, its value is increasingly being recognized for knowledge management in general. In trying to develop suitable conceptual models for unstructured information, issues such as the level of representation and complexity of processing techniques arise. Here, we investigate the use of a conceptual model that is simple enough to allow efficient automatic extraction from two kinds of documents--scientific research papers and patents. Our model focused on the problem-solution relationship that is central to the analysis of scientific papers, while allowing supporting relationships such as methods and claims. We evaluated the utility of the approach by building a prototype system and carrying out experiments that assessed the accuracy level of the techniques used in building the model and the acceptability of the model through preliminary user studies. The feedback from these experiments shows promising results that support our choice in the tradeoffs between the granularity of the model and the processing techniques used. We discuss a variety of issues that arouse from this project and describe several directions for future work.

Eric S. K. Yu | Bowen Hui

[1] M. F. Porter,et al. An algorithm for suffix stripping , 1997 .

[2] William C. Mann,et al. RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[3] Chris D. Paice,et al. The automatic generation of literature abstracts: an approach based on the identification of self-indicating phrases , 1980, SIGIR '80.

[4] Jakob Nielsen,et al. Usability engineering , 1997, The Computer Science and Engineering Handbook.

[5] Karen Sparck Jones,et al. Book Reviews: Evaluating Natural Language Processing Systems: An Analysis and Review , 1996, CL.

[6] Chris D. Paice,et al. Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[7] Karen Spärck Jones. Towards Better NLP System Evaluation , 1994, HLT.

[8] H. P. Edmundson,et al. New Methods in Automatic Extracting , 1969, JACM.

[9] Lois L. Earl,et al. Experiments in automatic extracting and indexing , 1970, Inf. Storage Retr..

[10] Daniel Marcu,et al. From discourse structures to text summaries , 1997 .

[11] Mark T. Maybury,et al. Advances in Automatic Text Summarization , 1999 .