Abstract
The reconstruction of destroyed paper documents became of more interest during the last years. On the one hand it (often) occurs that documents are destroyed by mistake while on the other hand this type of application is relevant in the fields of forensics and archeology, e.g., for evidence or restoring ancient documents. Within this paper, we present a new approach for restoring cross-cut shredded text documents, i.e., documents which were mechanically cut into rectangular shreds of (almost) identical shape. For this purpose we present a genetic algorithm that is extended to a memetic algorithm by embedding a (restricted) variable neighborhood search (VNS). Additionally, the memetic algorithm’s final solution is further improved by an enhanced version of the VNS. Computational experiments suggest that the newly developed algorithms are not only competitive with the so far best known algorithms for the reconstruction of cross-cut shredded documents but clearly outperform them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ávila, B.T., Lins, R.D.: A fast orientation and skew detection algorithm for monochromatic document images. In: DocEng 2005: Proceedings of the 2005 ACM symposium on Document Engineering, pp. 118–126. ACM, New York (2005)
Bäck, T.: Evolutionary Algorithms in Theory and Practice. Oxford University Press, Oxford (1996)
Blum, C., Augilera, M.J.B., Roli, A., Sampels, M. (eds.): Hybrid Metaheuristics – An Emergent Approach for Combinatorial Optimization. Studies in Computational Intelligence, vol. 114. Springer, Heidelberg (2008)
Chung, M.G., Fleck, M.M., Forsyth, D.A.: Jigsaw puzzle solver using shape and color. In: Proceedings of the Fourth International Conference on Signal Processing 1998, vol. 2, pp. 877–880 (1998)
Davis, L. (ed.): Handbook of genetic algorithms, 1st edn. International Thomson Publishing Services (1996)
De Smet, P.: Reconstruction of ripped-up documents using fragment stack analysis procedures. Forensic Science International 176(2), 124–136 (2008)
Glover, F.W., Kochenberger, G.A. (eds.): Handbook of Metaheuristics. International Series in Operations Research & Management Science, vol. 57. Kluwer Academic Publishers, New York (2003)
Goldberg, D., Malon, C., Bern, M.: A global approach to automatic solution of jigsaw puzzles. Computational Geometry 28(2-3), 165–174 (2004)
Hansen, P., Mladenović, N.: Variable neighborhood search. In: Glover and Kochenberger [7], pp. 145–184
Holland, J.: Adaptation In Natural and Artificial Systems. University of Michigan Press, Ann Arbor (1975)
Justino, E., Oliveira, L.S., Freitas, C.: Reconstructing shredded documents through feature matching. Forensic Science International 160(2-3), 140–147 (2006)
Kleber, F., Diem, M., Sablatnig, R.: Torn document analysis as a prerequisite for reconstruction. In: Sablatnig, R., et al. (eds.) 15th International Conference on Virtual Systems and Multimedia, pp. 143–148. IEEE, Los Alamitos (2009)
Lu, S., Tan, C.L.: Automatic detection of document script and orientation. In: International Conference on Document Analysis and Recognition – ICDAR 2007, vol. 1, pp. 237–241. IEEE Computer Society, Los Alamitos (2007)
Moscato, P., Cotta, C.: A gentle introduction to memetic algorithms. In: Glover and Kochenberger [7], pp. 105–144
Prandtstetter, M., Raidl, G.R.: Meta-heuristics for reconstructing cross cut shredded text documents. In: Raidl, G.R., et al. (eds.) GECCO 2009: Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, pp. 349–356. ACM Press, New York (2009)
Prandtstetter, M.: Hybrid Optimization Methods for Warehouse Logistics and the Reconstruction of Destroyed Paper Documents. Ph.D. thesis, Vienna University of Technology (2009)
Prandtstetter, M., Raidl, G.R.: Combining forces to reconstruct strip shredded text documents. In: Blesa, M.J., Blum, C., Cotta, C., Fernández, A.J., Gallardo, J.E., Roli, A., Sampels, M. (eds.) HM 2008. LNCS, vol. 5296, pp. 175–189. Springer, Heidelberg (2008)
Prim, R.C.: Shortest connection networks and some generalizations. The Bell System Technical Journal 3, 1389–1401 (1957)
Raidl, G.R., Puchinger, J., Blum, C.: Metaheuristic hybrids. In: Gendreau, M., Potvin, J.Y. (eds.) Handbook of Metaheuristics, 2nd edn. Springer, Heidelberg (accepted 2009) (to appear)
Schauer, C.: Reconstructing Cross-Cut Shredded Documents by means of Evolutionary Algorithms. Master’s thesis, Vienna University of Technology, Institute of Computer Graphics and Algorithms (2010)
Ukovich, A., Ramponi, G.: Features for the reconstruction of shredded notebook paper. In: IEEE International Conference on Image Processing, vol. 3, pp. 93–96 (2005)
Ukovich, A., Ramponi, G., Doulaverakis, H., Kompatsiaris, Y., Strintzis, M.: Shredded document reconstruction using MPEG-7 standard descriptors. In: Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, pp. 334–337 (2004)
Ukovich, A., Zacchigna, A., Ramponi, G., Schoier, G.: Using clustering for document reconstruction. In: Dougherty, E.R., et al. (eds.) Image Processing: Algorithms and Systems, Neural Networks, and Machine Learning. Proceedings of SPIE, vol. 6064, pp. 168–179. International Society for Optical Engineering (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schauer, C., Prandtstetter, M., Raidl, G.R. (2010). A Memetic Algorithm for Reconstructing Cross-Cut Shredded Text Documents. In: Blesa, M.J., Blum, C., Raidl, G., Roli, A., Sampels, M. (eds) Hybrid Metaheuristics. HM 2010. Lecture Notes in Computer Science, vol 6373. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16054-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-16054-7_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16053-0
Online ISBN: 978-3-642-16054-7
eBook Packages: Computer ScienceComputer Science (R0)