Research on Chinese online encyclopedia open category hierarchy tree and clustering algorithms

To use open category to categorize and retrieve the encyclopedia entries,this paper proposed the open category clustering algorithm based on word co-occurrence and semantic analysis,and the method of generating open category hierarchy tre es.It proposed a hierarchy tree clustering algorithm based on similarity and correlation computing for increasing the quality of hierarchy trees.Experimental data set was downloaded from Hudong online encyclopedia.The experimental results show that the proposed algorithms achieve high precision and the generated hierarchy trees effectively improve the efficiency of encyclopedia entries retrieval.