Content representation and pairwise feature matching method for virtual reconstruction of shredded documents

In forensics, virtual reconstruction of shredded documents is a well-known problem. Semi-automatic document reconstruction systems are usually used for virtual reconstruction. Here a content feature extraction, a content feature representation and a 1:1-matching-method for the use in such reconstruction systems are presented. The content representation is given in the form of so-called abstract structure objects (ASO), which are calculated based on foreground information distributions and on color categories. The presented 1:1-matching-method calculates local optima and places these optima in a global context in relation to cut-edge-pair. Experiments were performed on different real-world-datasets with different foreground characteristics. We show the good discrimination power of the presented method for the use in reconstruction systems regardless of the type of foreground information.

[1]  Ya Wang,et al.  A Two-Stage Approach for Reconstruction of Cross-Cut Shredded Text Documents , 2014, 2014 Tenth International Conference on Computational Intelligence and Security.

[2]  Cinthia O. A. Freitas,et al.  Reconstructing strip-shredded documents using color as feature matching , 2009, SAC '09.

[3]  Wilfried Philips,et al.  Semiautomatic reconstruction of strip-shredded documents , 2005, IS&T/SPIE Electronic Imaging.

[4]  Naren Ramakrishnan,et al.  The Deshredder: A visual analytic approach to reconstructing shredded documents , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[5]  Benjamin B. Kimia,et al.  On solving 2D and 3D puzzles using curve matching , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Yu Zhou,et al.  Information Quantity Based Automatic Reconstruction of Shredded Chinese Documents , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[7]  Jörg Krüger,et al.  A Contour Matching Algorithm to Reconstruct Ruptured Documents , 2010, DAGM-Symposium.

[8]  Huei-Yung Lin,et al.  Reconstruction of shredded document based on image feature matching , 2012, Expert Syst. Appl..

[9]  Giovanni Ramponi,et al.  Feature extraction and clustering for the computer-aided reconstruction of strip-cut shredded documents , 2008, J. Electronic Imaging.

[10]  Robert Sablatnig,et al.  Strip shredded document reconstruction using optical character recognition , 2011, ICDP.

[11]  David M. Mount,et al.  A Fast Implementation of the Isodata Clustering Algorithm , 2007, Int. J. Comput. Geom. Appl..

[12]  Matthias Prandtstetter,et al.  Combining Forces to Reconstruct Strip Shredded Text Documents , 2008, Hybrid Metaheuristics.

[13]  M. A. O. Marques,et al.  Document Decipherment-restoration: Strip-shredded Document Reconstruction based on Color , 2013, IEEE Latin America Transactions.

[14]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[15]  Jan Schneider,et al.  Assistant-Based Reconstruction of Believed Destroyed Shredded Documents , 2013 .