Research on Internet Hot Topic Detection Based on MapReduce Architecture

Internet public opinion increasingly influences daily lives of peoples and social stability. With the development of the Internet, the amount of information on the Internet is huge and updated quickly, which makes public opinion mining face enormous challenges in dealing with huge amounts of information and complex data. This paper proposes an internet Hot Topic Detection scheme based on cloud computing platform. The MapReduce programming model is introduced into the network public opinion analysis for processing massive, complex data. This scheme uses named entity words as text features, and the title and body are combined as two-dimensional VSM (vector space model) to represent text, to improve the accuracy of the Internet public opinion and the system response speed.