Skip to main content

New Fusion Based Enhancement for Text Detection in Night Video Footage

  • Conference paper
  • First Online:
Advances in Multimedia Information Processing – PCM 2018 (PCM 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11166))

Included in the following conference series:

Abstract

Text Detection in night video footage is hard due to low contrast and low resolution caused by distance variations between camera and ground under poor light. In this paper, we propose a new fusion based enhancement method for text detection especially in night video footage. The proposed method integrates the merits of color space and frequency based enhanced methods for sharpening low contrast details. Specifically, for each enhanced image, the proposed method derives weighted mean for the pixels values to widen the gap between high contrast (texts) and low contrast (background) pixels. The weighed means are further modified as dynamic weights with respect to enhanced images. These weights are convolved with pixel values of respective enhanced images to produce fused images. The proposed fusion based enhancement method is tested on images collected from night video footage to demonstrate the effectiveness of the method. For the output of each enhancement method including the proposed method, text detection rates are computed to show that the proposed enhancement method outperforms the existing enhancement methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)

    Article  Google Scholar 

  2. Yin, X.C., Zuo, Z.Y., Tian, S., Liu, C.L.: Text detection, tracking and recognition in video: a comprehensive survey. IEEE Trans. Image Process. 25(6), 2752–2773 (2016)

    Article  MathSciNet  Google Scholar 

  3. Tian, S., Yin, X.C., Su, Y., Hao, H.W.: A unified framework for tracking based text detection and recognition from web videos. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 542–554 (2018)

    Article  Google Scholar 

  4. Shi, B., Bai, X., Belongie, S.: Detecting oriented text in natural images by linking segments. In: Proceedings CVPR, vol. 3 (2017)

    Google Scholar 

  5. Zhou, X., et al.: East: an efficient and accurate scene text detector. arXiv preprint arXiv:1704.03155 (2017)

  6. Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: ECCV, pp. 56–72 (2016)

    Chapter  Google Scholar 

  7. Jiang, X., Yao, H., Zhang, S., Lu, X., Zeng, W.: Night video enhancement using improved dark channel prior. In: ICIP, pp. 553–557. IEEE (2013)

    Google Scholar 

  8. Rui, W., Guoyu, W.: Medical X-ray image enhancement method based on tvhomomorphic filter. In: 2017 2nd International Conference on Image, Vision and Computing (ICIVC), pp. 315–318. IEEE (2017)

    Google Scholar 

  9. Sharma, S., Zou, J.J., Fang, G.: Contrast enhancement using pixel based image fusion in wavelet domain. In: 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I), pp. 285–290. IEEE (2016)

    Google Scholar 

  10. Lee, M.S., Park, C.H., Kang, M.G.: Edge enhancement algorithm for low-dose X-ray fluoroscopic imaging. Comput. Methods Programs Biomed. 152, 45–52 (2017)

    Article  Google Scholar 

  11. Maurya, L., Mahapatra, P.K., Kumar, A.: A social spider optimized image fusion approach for contrast enhancement and brightness preservation. Appl. Soft Comput. 52, 575–592 (2017)

    Article  Google Scholar 

  12. Pei, W.Y., Yang, C., Kau, L.J., Yin, X.C.: Multi-orientation scene text detection with multi-information fusion. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp. 657–662. IEEE (2016)

    Google Scholar 

  13. Roy, S., Shivakumara, P., Jalab, H.A., Ibrahim, R.W., Pal, U., Lu, T.: Fractional poisson enhancement model for text detection and recognition in video frames. Pattern Recogn. 52, 433–447 (2016)

    Article  Google Scholar 

  14. Xu, X., Wang, Y., Chen, S.: Medical image fusion using discrete fractional wavelet transform. Biomed. Signal Process. Control 27, 103–111 (2016)

    Article  Google Scholar 

  15. Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1156–1160. IEEE (2015)

    Google Scholar 

Download references

Acknowledgment

The work described in this paper was supported by the Natural Science Foundation of China under Grant No. 61672273 and No. 61272218, the Science Foundation for Distinguished Young Scholars of Jiangsu under Grant No. BK20160021, and Scientific Foundation of State Grid Corporation of China (Research on Ice-wind Disaster Feature Recognition and Prediction by Few-shot Machine Learning in Transmission Lines).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tong Lu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, C., Shivakumara, P., Xue, M., Zhu, L., Lu, T., Pal, U. (2018). New Fusion Based Enhancement for Text Detection in Night Video Footage. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11166. Springer, Cham. https://doi.org/10.1007/978-3-030-00764-5_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00764-5_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00763-8

  • Online ISBN: 978-3-030-00764-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics