Meta-heuristics for reconstructing cross cut shredded text documents

In this work, we present two new approaches based on variable neighborhood search (VNS) and ant colony optimization (ACO) for the reconstruction of cross cut shredded text documents. For quickly obtaining initial solutions, we consider four different construction heuristics. While one of them is based on the well known algorithm of Prim, another one tries to match shreds according to the similarity of their borders. Two further construction heuristics rely on the fact that in most cases the left and right edges of paper documents are blank, i.e. no text is written on them. Randomized variants of these construction heuristics are applied within the ACO. Experimental tests reveal that regarding the solution quality the proposed ACO variants perform better than the VNS approaches in most cases, while the running times needed are shorter for VNS. The high potential of these approaches for reconstructing cross cut shredded text documents is underlined by the obtained results.

[1]  Margaret M. Fleck,et al.  Jigsaw puzzle solver using shape and color , 1998, ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344).

[2]  M.G. Strintzis,et al.  Shredded document reconstruction using MPEG-7 standard descriptors , 2004, Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004..

[3]  Edson Justino,et al.  Reconstructing shredded documents through feature matching. , 2006, Forensic science international.

[4]  Matthias Prandtstetter,et al.  Reconstructing Borders of Manually Torn Paper Sheets Using Integer Linear Programming ausgeführt am , 2007 .

[5]  Pierre Hansen,et al.  Variable Neighborhood Search , 2018, Handbook of Heuristics.

[6]  Prosenjit Bose,et al.  Detection of text-line orientation , 1998, Canadian Conference on Computational Geometry.

[7]  Marco Dorigo,et al.  Ant colony optimization , 2006, IEEE Computational Intelligence Magazine.

[8]  Matthias Prandtstetter,et al.  Combining Forces to Reconstruct Strip Shredded Text Documents , 2008, Hybrid Metaheuristics.

[9]  Giovanni Ramponi,et al.  Using clustering for document reconstruction , 2006, Electronic Imaging.

[10]  R. Prim Shortest connection networks and some generalizations , 1957 .

[11]  Patrick De Smet Reconstruction of ripped-up documents using fragment stack analysis procedures. , 2008, Forensic science international.