Sensitive topic detection based on conditional random fields

This paper introduced the conditional random field(CRFs) to construct the sensitive topic detection method based on CRFs.Firstly,the method used a text s selected randomly as observing sequence of CRFs,and used the rest of detected text as its state sequence to calculate the correlative probabilities between the text s and the rest of detected text.Secondly,it combined the text with the highest correlative probability and text s to represent a category,meanwhile,the text with lowest correlative probability was represented other category.Therefore,these tow categories were as observing sequence of CRFs,and the rest of detected text were used as its state sequence.Thirdly,it carried out the iteration as abovementioned to detect sensitive topic.The experimental results show that the cost function and macro average F of the proposed method can achieve high effect.