Evaluation of the Document Categorization in "Fixed-point Observatory"
暂无分享,去创建一个
“Fixed-point observatory” is a prototype to support users to grasp recent trends in the fields of their interest from large-scale information. It consists of content-based categorizer, named-entity-based categorizer and multiple-document summarizer. We have evaluated the content-based categorizer, which adopts the simple “bag-of-words” model. Though the quality seems be sufficient for rough classification, it might be improved to use the categorizer in other applications.
[1] Yoshihiro Ueda,et al. Toward the "At-a-glance" Summary: Phrase-representation Summarization Method , 2000, COLING.
[2] Yoshihiro Ueda,et al. Document Retrieval in Consideration of the Amount of Term Frequencies , 2001, NTCIR.
[3] David R. Karger,et al. Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.