论文信息 - Automatic knowledge extraction from manufacturing research publications

Automatic knowledge extraction from manufacturing research publications

Knowledge mining is a young and rapidly growing discipline aiming at automatically identifying valuable knowledge in digital documents. This paper presents the results of a study of the application of document retrieval and text mining techniques to extract knowledge from CIRP research papers. The target is to find out if and how such tools can help researchers to find relevant publications in a cluster of papers and increase the citation indices their own papers. Two different approaches to automatic topic identification are investigated. One is based on Latent Dirichlet Allocation of a huge document set, the other uses Wikipedia to discover significant words in papers. The study uses a combination of both approaches to propose a new approach to efficient and intelligent knowledge mining.

[1] J. W. Uys. A framework for exploiting electronic documentation in support of innovation processes , 2010 .

[2] Rada Mihalcea,et al. Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[3] Moshe Shpitalni,et al. Virtual Research Lab: A New Way To Do Research , 2006 .

[4] Man Lung Yiu,et al. Group-by skyline query processing in relational engines , 2009, CIKM.

[5] Andreas Riel,et al. A Knowledge Mining Approach to Document Classification , 2009 .

[6] William E. Moen,et al. Using Encyclopedic Knowledge for Automatic Topic Identification , 2009, CoNLL.

[7] Alain Bernard,et al. Customised high-value document generation , 2012, ArXiv.