Research on Network Content Audit Based on Information Fingerprint

Based on the specific features of advertisement robot widely existing in network information, a mixture strategy for network content filtering is presented in this paper. The strategy can determine quickly by calculating the fingerprint of network content, and Bayesian filtering is reused when the strategy can not determine. The result of this experiment shows that the strategy is more advanced than the sole Bayesian method in improving system running efficiency and finding out the phenomenon of advertisement robot.