Study on Mass Chinese Short Message Text Density Clustering

According to the characteristics of short message text,a clustering method of the Chinese message based on density is given.High-density region of the text data is divided into clusters and a seed queue is constructed,which is arranged in ascending order of the reachable similarity,to store the text of short message text to be expanded.The text message is disposed in a specific order.In order to make higher-density clusters to complete first,the object is selected according to a greater threshold similarity,namely that the dense space text object which can be rapidly located makes the high-density cluster complete first.Experimental result shows that this clustering method's efficiency is increased 10 times of K-means method.