ICTIR Subtopic Mining System at NTCIR-9 INTENT Task

This paper describes the approaches and results of our Chinese subtopic mining system for the NTCIR-9 INTENT task. In this system, we first find out the related queries from query logs, then group them into different clusters using a frequent term-set based clustering algorithm. Finally, the central query of each cluster is used to represent the subtopic of this cluster. Encyclopedia and commercial search engines are also used to enhance the mining effectiveness. The evaluation results of our runs show that our approaches perform well. Among the 5 runs we submit, ICTIR-S-C-1 is ranked within top five in terms of D#-nDCG for l=10, 20, 30 and outperforms others in terms of I-rec.