Documents Type Identification Based on Statistical Characteristic

【Abstract】Malicious tampering with the type of document to conceal identity documents so as to entice users to visit real structure, avoiding detection and hiding data is the most common computer crime means. This paper presents a novel statistical method to identify document types, which recognizing effectively the attributes of the tampered document types. According to that the same type of documents are similar with the statistical features in multidimensional space, the basic assumption that judges this similarity is given, a model based on Euclidean distance spherical space toroidal model and k-spheroid space toroidal model are designed. Meanwhile, both models are optimized by the heavily weighted Euclidean distance based on the document statistics, and the correctness and efficiency of the similarities judgment are improved. 【Key words】Computer forensics; Documentary statistical characteristic; Spherical space toroidal model; k-spheroid space toroidal model

[1]  Mohammad Hossain Heydari,et al.  Content based file type detection algorithms , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[2]  William H. Allen Computer Forensics , 2005, IEEE Secur. Priv..