Combining Forces to Reconstruct Strip Shredded Text Documents

In this work, we focus on the reconstruction of strip shredded text documents(RSSTD) which is of great interest in investigative sciences and forensics. After presenting a formal model for RSSTD, we suggest two solution approaches: On the one hand, RSSTD can be reformulated as a (standard) traveling salesman problem and solved by well-known algorithms such as the chained Lin Kernighan heuristic. On the other hand, we present a specific variable neighborhood search approach. Both methods are able to outperform a previous algorithm from literature, but nevertheless have practical limits due to the necessarily imperfect objective function. We therefore turn to a semi-automatic system which also integrates user interactions in the optimization process. Practical results of this hybrid approach are excellent; difficult instances can be quickly resolved with only few user interactions.

[1]  Ratnesh Kumar,et al.  On Asymmetric TSP : Transformation toSymmetric TSP and Performance Bound , 2007 .

[2]  Giovanni Ramponi,et al.  Using clustering for document reconstruction , 2006, Electronic Imaging.

[3]  Pierre Hansen,et al.  Variable Neighborhood Search , 2018, Handbook of Heuristics.

[4]  F. Glover,et al.  Handbook of Metaheuristics , 2019, International Series in Operations Research & Management Science.

[5]  M.G. Strintzis,et al.  Shredded document reconstruction using MPEG-7 standard descriptors , 2004, Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004..

[6]  Patrick De Smet Reconstruction of ripped-up documents using fragment stack analysis procedures. , 2008 .

[7]  Joe Marks,et al.  The HuGS platform: a toolkit for interactive optimization , 2002, AVI '02.

[8]  Matteo Fischetti,et al.  A Branch-and-Cut Algorithm for the Symmetric Generalized Traveling Salesman Problem , 1997, Oper. Res..

[9]  Margaret M. Fleck,et al.  Jigsaw puzzle solver using shape and color , 1998, ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344).

[10]  Joe Marks,et al.  Human-Guided Search : Survey and Recent Results , 2003 .

[11]  Edson Justino,et al.  Reconstructing shredded documents through feature matching. , 2006, Forensic science international.

[12]  Matthias Prandtstetter,et al.  Reconstructing Borders of Manually Torn Paper Sheets Using Integer Linear Programming ausgeführt am , 2007 .

[13]  Ben J Hicks,et al.  SPIE - The International Society for Optical Engineering , 2001 .

[14]  Joe Marks,et al.  Human-guided search , 2010, J. Heuristics.

[15]  William J. Cook,et al.  Finding Tours in the TSP , 1999 .

[16]  Forschungsinstitut für Diskrete Chained Lin-Kernighan for Large Traveling Salesman Problems , 2003 .

[17]  J. C. Bean,et al.  An efficient transformation of the generalized traveling salesman problem , 1993 .

[18]  Joanna Balme Reconstruction of Shredded Documents in the Absence of Shape Information , 2007 .

[19]  Bruce L. Golden,et al.  The Generalized Traveling Salesman Problem: A New Genetic Algorithm Approach , 2007 .