Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video

Duan-Yu, Chen; Ming-Ho, Hsiao; Suh-Yin, Lee

doi:10.1007/3-540-45925-1_26

Chen Duan-Yu⁶,
Hsiao Ming-Ho⁶ &
Lee Suh-Yin⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2314))

Included in the following conference series:

International Conference on Advances in Visual Information Systems

662 Accesses
1 Citations

Abstract

In this paper, a novel approach of automatic closed caption detection and font size differentiation among localized text regions in I-frames of MPEG videos is proposed. The approach consists of five modules: video segmentation, shot selection, caption frame detection, caption localization and font size differentiation. Rather than directly examines scene cut frame by frame, the module of video segmentation first verifies video streams GOP by GOP and then finds out the actual scene boundaries in the frame level. Tennis videos are selected as the case study and the module of shot selection is designed to automatically select specific type of shot for further closed caption detection. The noise of potential captions is filtered out based on its long-term consistency over consecutive frames. While the general closed captions are localized, we select the specific caption that is discriminated utilizing the module of font size differentiation. The detected closed captions can support video structuring, video browsing, high-level video indexing and video content description in MPEG-7. Experimental results show the effectiveness and the feasibility of the proposed scheme.

The research is partially supported by Lee & MTI Center, National Chiao-Tung University, Taiwan and National Science Council, Taiwan.

Download to read the full chapter text

Chapter PDF

A Phase-Based Approach for Caption Detection in Videos

Caption Detection and Positioning in Digital Video

Video-Captioning Evaluation Metric for Segments (VEMS): A Metric for Segment-level Evaluation of Video Captions with Weighted Frames

Article 28 October 2023

M. Ravinder, Vaidehi Gupta, … Yu-Chen Hu

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

H. Wang and S. F. Chang, “A Highly Efficient System for Automatic Face Region Detection in MPEG Video,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 7, No. 4, Aug. 1997, pp. 615–628.
Article MathSciNet Google Scholar
Y. Zhong, H. Zhang and A. K. Jain, “Automatic Caption Localization in Compressed Video,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 4, Apr. 2000, pp. 385–392.
Article Google Scholar
H. Luo and A. Eleftheriadis, “On Face Detection in the Compressed Domain,” Proc. of ACM Multimedia 2000, pp. 285–294.
Google Scholar
Y. Zhang and T. S. Chua, “Detection of Text Captions in Compressed Domain Video,” Proc. of ACM Multimedia Workshop, 2000, pp. 201–204.
Google Scholar
S. W. Lee, Y. M. Kim and S. W. Choi, “Fast Scene Change Detection using Direct Feature Extraction from MPEG Compressed Videos,” IEEE Transactions on Multimedia, Vol. 2, No. 4, Dec. 2000, pp. 240–254.
Article Google Scholar
X. Chen and H. Zhang, “Text Area Detection from Video Frames,” Proc. of 2^nd IEEE Pacific Rim Conference on Multimedia, Oct. 2001, pp. 222–228.
Google Scholar
J. Nang, O. Kwon and S. Hong, “Caption Processing for MPEG Video in MC-DCT Compressed Domain,” Proc of ACM Multimedia Workshop, 2000, pp. 211–214.
Google Scholar
S. Y. Lee, J. L. Lian and D. Y. Chen, “Video Summary and Browsing Based on Story-Unit for Video-on-Demand Service,” Proc. International Conference on ICICS, Oct. 2001.
Google Scholar
J. L. Mitchell, W. B. Pennebaker, Chad E. Fogg, and Didier J. LeGall, “MPEG VIDEO COMPRESSION STANDARD,” Chapman&Hall, NY, USA, 1997.
Google Scholar
J. Meng, Y. Juan, S.F. Chang, “Scene Change Detection in a MPEG Compressed Video Sequence,” Proc. IS&T/SPIE, Vol. 2419, 1995, pp.14–25.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science And Information Engineering, National Chiao Tung University, 1001 Ta-Hsueh Rd, Hsinchi, Taiwan
Chen Duan-Yu, Hsiao Ming-Ho & Lee Suh-Yin

Authors

Chen Duan-Yu
View author publications
You can also search for this author in PubMed Google Scholar
Hsiao Ming-Ho
View author publications
You can also search for this author in PubMed Google Scholar
Lee Suh-Yin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Systems Institute, 3420 Main Street, 60076, Skokie, IL, USA
Shi-Kuo Chang
Dept. of Comp. Science & Information Engineering, National Chiao Tung University, 1001 Ta HsuehRoad, Hsin Chu, Taiwan
Zen Chen & Suh-Yin Lee &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Duan-Yu, C., Ming-Ho, H., Suh-Yin, L. (2002). Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video. In: Chang, SK., Chen, Z., Lee, SY. (eds) Recent Advances in Visual Information Systems. VISUAL 2002. Lecture Notes in Computer Science, vol 2314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45925-1_26

Download citation

DOI: https://doi.org/10.1007/3-540-45925-1_26
Published: 23 April 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43358-3
Online ISBN: 978-3-540-45925-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video

Abstract

Chapter PDF

Similar content being viewed by others

A Phase-Based Approach for Caption Detection in Videos

Caption Detection and Positioning in Digital Video

Video-Captioning Evaluation Metric for Segments (VEMS): A Metric for Segment-level Evaluation of Video Captions with Weighted Frames

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Automatic Closed Caption Detection and Font Size Differentiation in MPEG Video

Abstract

Chapter PDF

Similar content being viewed by others

A Phase-Based Approach for Caption Detection in Videos

Caption Detection and Positioning in Digital Video

Video-Captioning Evaluation Metric for Segments (VEMS): A Metric for Segment-level Evaluation of Video Captions with Weighted Frames

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation