Rule-based method for patent abstract automatic extraction and keyword indexing
暂无分享,去创建一个
The invention a rule-based method for patent abstract automatic extraction and keyword indexing, which mainly comprises the steps: automatically marking key words such as characteristic and technique words and phrases in the full text of the patent literature according to a background knowledge base; determining the functions and mutual relationships of paragraphs in the article according to the types, times, position relations and the like of the occurrence of the characteristic words and phrases in the paragraphs; extracting key paragraphs of the paragraphs to form the extract; and finally, extracting key works from the extract to form the index items of the literature. The method for patent abstract automatic extraction and keyword indexing of the invention consists of five modules: a knowledge base module, a characteristic work marking module, a paragraph analysis and evaluation module, an extract automatic writing module and an indexing module. The method of the invention can obviously improve the efficiency of the deep processing of patent data and reduce the cost of the data processing. And the indexing result has a high retrieval value.