Automatic Summarization for Chinese Text Based on Sub Topic Partition and Sentence Features

With the explosion of electronic information on web, there is the increasing requirement to obtain the information needed accurately and efficiently. In this article, a method of automatic summarization based on sub topic partition and sentence features is proposed, in which the sentence weight is computed based on LexRank algorithm combining with the score of its own features in every sub topic, such as its length, position, cue words and structure. In addition, we reduce redundancy of candidate sentence collection. With evaluation on six different genres of data sets, our method could get more comprehensive and high-quality summarization with less redundancy than the original LexRank algorithm.