Abstract
Text Detection in night video footage is hard due to low contrast and low resolution caused by distance variations between camera and ground under poor light. In this paper, we propose a new fusion based enhancement method for text detection especially in night video footage. The proposed method integrates the merits of color space and frequency based enhanced methods for sharpening low contrast details. Specifically, for each enhanced image, the proposed method derives weighted mean for the pixels values to widen the gap between high contrast (texts) and low contrast (background) pixels. The weighed means are further modified as dynamic weights with respect to enhanced images. These weights are convolved with pixel values of respective enhanced images to produce fused images. The proposed fusion based enhancement method is tested on images collected from night video footage to demonstrate the effectiveness of the method. For the output of each enhancement method including the proposed method, text detection rates are computed to show that the proposed enhancement method outperforms the existing enhancement methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ye, Q., Doermann, D.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37(7), 1480–1500 (2015)
Yin, X.C., Zuo, Z.Y., Tian, S., Liu, C.L.: Text detection, tracking and recognition in video: a comprehensive survey. IEEE Trans. Image Process. 25(6), 2752–2773 (2016)
Tian, S., Yin, X.C., Su, Y., Hao, H.W.: A unified framework for tracking based text detection and recognition from web videos. IEEE Trans. Pattern Anal. Mach. Intell. 40(3), 542–554 (2018)
Shi, B., Bai, X., Belongie, S.: Detecting oriented text in natural images by linking segments. In: Proceedings CVPR, vol. 3 (2017)
Zhou, X., et al.: East: an efficient and accurate scene text detector. arXiv preprint arXiv:1704.03155 (2017)
Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting text in natural image with connectionist text proposal network. In: ECCV, pp. 56–72 (2016)
Jiang, X., Yao, H., Zhang, S., Lu, X., Zeng, W.: Night video enhancement using improved dark channel prior. In: ICIP, pp. 553–557. IEEE (2013)
Rui, W., Guoyu, W.: Medical X-ray image enhancement method based on tvhomomorphic filter. In: 2017 2nd International Conference on Image, Vision and Computing (ICIVC), pp. 315–318. IEEE (2017)
Sharma, S., Zou, J.J., Fang, G.: Contrast enhancement using pixel based image fusion in wavelet domain. In: 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I), pp. 285–290. IEEE (2016)
Lee, M.S., Park, C.H., Kang, M.G.: Edge enhancement algorithm for low-dose X-ray fluoroscopic imaging. Comput. Methods Programs Biomed. 152, 45–52 (2017)
Maurya, L., Mahapatra, P.K., Kumar, A.: A social spider optimized image fusion approach for contrast enhancement and brightness preservation. Appl. Soft Comput. 52, 575–592 (2017)
Pei, W.Y., Yang, C., Kau, L.J., Yin, X.C.: Multi-orientation scene text detection with multi-information fusion. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp. 657–662. IEEE (2016)
Roy, S., Shivakumara, P., Jalab, H.A., Ibrahim, R.W., Pal, U., Lu, T.: Fractional poisson enhancement model for text detection and recognition in video frames. Pattern Recogn. 52, 433–447 (2016)
Xu, X., Wang, Y., Chen, S.: Medical image fusion using discrete fractional wavelet transform. Biomed. Signal Process. Control 27, 103–111 (2016)
Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1156–1160. IEEE (2015)
Acknowledgment
The work described in this paper was supported by the Natural Science Foundation of China under Grant No. 61672273 and No. 61272218, the Science Foundation for Distinguished Young Scholars of Jiangsu under Grant No. BK20160021, and Scientific Foundation of State Grid Corporation of China (Research on Ice-wind Disaster Feature Recognition and Prediction by Few-shot Machine Learning in Transmission Lines).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, C., Shivakumara, P., Xue, M., Zhu, L., Lu, T., Pal, U. (2018). New Fusion Based Enhancement for Text Detection in Night Video Footage. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11166. Springer, Cham. https://doi.org/10.1007/978-3-030-00764-5_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-00764-5_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00763-8
Online ISBN: 978-3-030-00764-5
eBook Packages: Computer ScienceComputer Science (R0)