Study on Topic Partition Based on Concept Retrieval in Multiple Documents

Topic partition is a significant problem during text structuring in many domains such as information retrieval and automatic summarization.The popular technique is using the frequency of words to express the documents,but u- sing the concept will improve the efficiency of topic partition in multiple documents.The paper presents a method that uses the HowNet to get the concepts,and then uses the technique of clustering to segment the paragraphs of the docu- ments.And this method solves the problem of text structuring in multiple documents.The experimental results show that this method is more efficient for topic partition in multiple documents.