An Analysis of the Search Engine User Behaviors Based on Hadoop

Search engine user behaviors analysis is a focus of network information retrieval.It is a method of analyzing the user's behaviors through clicks to mine useful information to improve search engine's efficiency and retrieval services.In face of easy expansion and programming bottlenecks in traditional parallel computation models,a massive log data processing model based on Hadoop is given,which improves scalability and easy programming through Hadoop Distributed File System and MapReduce.Moreover,the experiment of analyzing 22 million query logs of the Sogou search engine in one month is carried out based on this model.The analysis result is instructive and meaningful to mastering the user's behaviors,evaluating and improving the searching and sorting algorithms.