Abstract
In this paper, we propose a caption detection and removal technique for reuse of TV scenes using the Multilayer Perceptrons (MLPs) and Genetic algorithms (GAs). The technique first detects the captions in a TV scene using the MLP-based caption detector, and then removes the detected captions using the GA-based region remover. In our technique, the caption removal problem is modeled as an optimization problem, which in our case, is solved by a cost function with isophote constraint that is minimized using a GA. The technique creates an optimal connection of all pairs of isophote disconnected by caption regions. To connect the disconnected isophote, we estimate the value of the smoothness, given by the best chromosome of the GA, and project this value in the isophote direction. Experimental results show a great possibility for automatic removal of captions in TV advertisement scenes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jung, K.: Neural network-based text location in color images. Pattern Recognition Letter 22, 1503–1515 (2001)
Huang, Y.L., Chang, R.F.: Error concealment using adaptive multilayer perceptrons (MLPs) for block-based image coding. In: Neural comput & Applic, vol. 9, pp. 83–92 (2000)
Geman, D., Reynolds, G.: Constrained restoration and the recovery of discontinuities. IEEE trans. on PAMI 14(3), 367–383 (1992)
Kornoribst, P., Deriche, R.: Image sequence analysis via partial differential equations. Journal of Math. Imaging and Vision 11, 5–26 (1999)
Sara, R.: Isophotes: the key to tractable local shading analysis. Proceedings of theInternational Conf. Computer Analysis of Images and Patterns, 416–423 (1995)
Kim, J.B., Kim, H.J.: Region removal and restoration using a genetic algorithm with isophote constraint. Pattern Recognition Letter 24(9), 1313–1326 (2003)
Kim, K.I., Jung, K., Park, S.H., Kim, H.J.: Support vector machines-based text detection in digital video. Pattern Recognition 34(2), 527–529 (2001)
Kim, J.B., et al.: Wavelet-based vehicle tracking for automatic traffic surveillance. Proceeding of the IEEE Tencon International Conference 1, 313–316 (2001)
Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28(10), 1523–1535 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, J., Ahn, K. (2004). Caption Detection and Removal in a TV Scene. In: Webb, G.I., Yu, X. (eds) AI 2004: Advances in Artificial Intelligence. AI 2004. Lecture Notes in Computer Science(), vol 3339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30549-1_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-30549-1_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24059-4
Online ISBN: 978-3-540-30549-1
eBook Packages: Computer ScienceComputer Science (R0)