An Improved Parallel Algorithm for Text Categorization
暂无分享,去创建一个
This paper proposes an approach using MapReduce-based Rocchio relevance feedback algorithm, which improved the traditional Rocchio algorithm in the MapReduce paradigm, to resolve the problem of massive information filtering. Traditional text classification algorithms have vital impact on information filtering.
[1] James Allan,et al. The effect of adding relevance information in a relevance feedback environment , 1994, SIGIR '94.
[2] Lv Jia. Improvement and Application of TFIDF Method Based on Text Classification , 2006 .
[3] Yu Liu,et al. Towards Systematic Parallel Programming over MapReduce , 2011, Euro-Par.
[4] Tom White,et al. Hadoop: The Definitive Guide , 2009 .
[5] Ralf Lämmel,et al. Google's MapReduce programming model - Revisited , 2007, Sci. Comput. Program..