LDA-based Model for Online Topic Evolution Mining

A computational model for online topic evolution mining was established through a latent semantic analysis process on textual data.Topical evolutionary analysis was achieved by tracking the topic trends in different time-slices.In this paper,Latent Dirichlet Allocation(LDA)was extended to the context of online text streams,and an online LDA model was proposed and implemented as well.The main idea is to use the posterior of topic-word distribution of each time-slice to influence the inference of the next time-slice,which also maintains the relevance between the topics.The topic-word and document-topic distributions are inferenced by incremental Gibbs algorithm.Kullback Leibler(KL)relative entropy is uesd to measure the similarity between topics in order to identify topic genetic and topic mutation.Experiments show that the proposed model can discover meaningful topical evolution trends both on English and Chinese corpus.