Skip to main content

A Ground Truth Bleed-Through Document Image Database

  • Conference paper
Theory and Practice of Digital Libraries (TPDL 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7489))

Included in the following conference series:

Abstract

This paper introduces a new database of 25 recto/verso image pairs from documents suffering from bleed-through degradation, together with manually created foreground text masks. The structure and creation of the database is described, and three bleed-through restoration methods are compared in two ways; visually, and quantitatively using the ground truth masks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Burgoyne, J.A., Devaney, J., Pugin, L., Fujinaga, I.: Enhanced bleedthrough correction for early music documents with recto-verso registration. In: Int. Conf. Music Inform. Retrieval, Philadelphia, PA, pp. 407–412 (2008)

    Google Scholar 

  2. Castro, P., Almeida, R.J., Pinto, J.R.C.: Restoration of Double-Sided Ancient Music Documents with Bleed-Through. In: Rueda, L., Mery, D., Kittler, J. (eds.) CIARP 2007. LNCS, vol. 4756, pp. 940–949. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  3. Fadoua, D., Le Bourgeois, F., Emptoz, H.: Restoring Ink Bleed-Through Degraded Document Images Using a Recursive Unsupervised Classification Technique. In: Bunke, H., Spitz, A.L. (eds.) DAS 2006. LNCS, vol. 3872, pp. 38–49. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  4. Dubois, E., Pathak, A.: Reduction of bleed-through in scanned manuscript documents. In: IS&T Image Process., Image Quality, Image Capture Syst. Conf., Montreal, Canada, vol. 4, pp. 177–180 (2001)

    Google Scholar 

  5. Estrada, R., Tomasi, C.: Manuscript bleed-through removal via hysteresis thresholding. In: 10th Int.l Conf. Doc. Anal. and Recogn., Barcelona, Spain, pp. 753–757 (2009)

    Google Scholar 

  6. Gatos, B., Pratikakis, I., Perantonis, S.J.: Adaptive degraded document image binarization. J. Pattern Recogn. 39(3), 317–327 (2006)

    Article  MATH  Google Scholar 

  7. Huang, Y., Brown, M.S., Xu, D.: A framework for reducing ink-bleed in old documents. In: IEEE Conf. Comput. Vis. Pattern Recogn., Anchorage, AK, pp. 1–7 (2008)

    Google Scholar 

  8. Huang, Y., Brown, M.S., Xu, D.: User-assisted ink-bleed reduction. IEEE Trans. Image Process. 19(10), 2646–2658 (2010)

    Article  MathSciNet  Google Scholar 

  9. Moghaddam, R.F., Cheriet, M.: Low quality document image modeling and enhancement. Int. J. Doc. Anal. Recogn. 11(4), 183–201 (2009)

    Article  Google Scholar 

  10. Moghaddam, R.F., Cheriet, M.: A variational approach to degraded document enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 32(8), 1347–1361 (2010)

    Article  Google Scholar 

  11. Rowley-Brooke, R., Kokaram, A.: Bleed-through removal in degraded documents. In: SPIE: Doc. Recogn. Retrieval Conf., San Francisco, CA (2012)

    Google Scholar 

  12. Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. J. Pattern Recogn. 33(2), 225–236 (2000)

    Article  Google Scholar 

  13. Tideman, T.N.: Independence of clones as a criterion for voting rules. J. Soc. Choice Welf. 4(3), 185–206 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  14. Tonazzini, A.: Color space transformations for analysis and enhancement of ancient degraded manuscripts. J. Pattern Recogn. Image Anal. 20(3), 404–417 (2010)

    Article  Google Scholar 

  15. Tonazzini, A., Bedini, L., Salerno, E.: Independent component analysis for document restoration. Int. J. Doc. Anal. Recogn. 7(1), 17–27 (2004)

    Google Scholar 

  16. Tonazzini, A., Salerno, E., Bedini, L.: Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique. Int. J. Doc. Anal. Recogn. 10(1), 17–25 (2007)

    Article  Google Scholar 

  17. Wolf, C.: Document ink bleed-through removal with two hidden markov random fields and a single observation field. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 431–447 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rowley-Brooke, R., Pitié, F., Kokaram, A. (2012). A Ground Truth Bleed-Through Document Image Database. In: Zaphiris, P., Buchanan, G., Rasmussen, E., Loizides, F. (eds) Theory and Practice of Digital Libraries. TPDL 2012. Lecture Notes in Computer Science, vol 7489. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33290-6_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33290-6_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33289-0

  • Online ISBN: 978-3-642-33290-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics