Paraphrase extraction from interactive Q&A communities

Paraphrase is widely researched in last decade. Most of the researches are focused on acquisition of paraphrase from various language resources and generation of paraphrase. It is a hot topic that how to build large scale of paraphrase corpus, and it is the first step for paraphrase exploration as well. Interactive question answering communities which are a kind of special Q&A platform skipping over natural language understood by computer but just providing a platform for communication among people, have corpus with quick growing rate and sentences in diversified expressions. These advantages provide great value for paraphrase research and extend paraphrase corpus in huge scale. We propose a method on how to extract paraphrase from interactive Q&A community in this paper. The experiment results show the precision, recall and f-measure can reach to 0.7725, 0.7349 and 0.7532 respectively, and paraphrase could be extracted effectively.