Method for automatically extracting sentence template
暂无分享,去创建一个
The invention relates to a method for automatically extracting sentence templates which comprises the following steps that: a text is divided into a plurality of sentences according to the punctuation; serial numbers are marked in front of the sentences according to the sequence; each sentence obtained by sentence separation is divided into small blocks based on each word by using word separation technology; after the word separation is finished, the sentences are divided into a plurality of groups with ascending order or descending order according to the quantity of the words in the sentences; the sentence template can simply be obtained by applying the sentences in the same group with LCS algorithm to obtain a longest public subsequence. The invention can automatically and efficiently statisticize commonly used words and sentences from plenty of text information.