skip to main content
10.1145/2908446.2908472acmotherconferencesArticle/Chapter ViewAbstractPublication PagesinfosConference Proceedingsconference-collections
research-article

Caption Detection, Localization and Type Recognition in Arabic News Video

Published: 09 May 2016 Publication History

Abstract

In this paper, we propose a method to detect and localize all caption types in Arabic news videos. Moreover, different types of captions are considered including static, horizontal scrolling and vertical scrolling captions. Our method is able to deal with different patterns of appearance and disappearance of captions in news video. Deal with news videos with multiple captions. Our method is based on edge feature and multiple frames integration. Canny edge map is computed for each frame. Horizontal lines detection is applied and frames are categorized into clusters. Finally, caption types are recognized from each cluster by observing the normalized inter-frame edge map difference. Experimental results show the effectiveness of the proposed method to detect, locate all caption types, recognize the caption type and identify the appearance/disappearance intervals of captions. The experiments are conducted using real news videos recorded from different TV channels.

References

[1]
X.-S. Hua, X.-R. Chen, L. Wenyin, and H.-J. Zhang, 2001, "Automatic location of text in video frames," Proceedings of the 2001 ACM workshops on Multimedia multimedia information retrieval - MULTIMEDIA '01.
[2]
R. Wang, W. Jin, and L. Wu, 2004, "A novel video caption detection approach using Multi-frame Integration," Proceedings - International Conference on Pattern Recognition, vol. 1, no. 200433, pp. 449--452.
[3]
M. Ben Halima, H. Karray, and A. M. Alimi, 2013, "Arabic Text Recognition in Video Sequences," International Journal of Computational Linguistics Research, pp. 603--608.
[4]
Z. Yang, 2012, "Caption Detection and Text Recognition in News Video, 5th International Congress on Image and Signal Processing (CISP).
[5]
M. Halima, H. Karray, and A. Alimi, 2010,"A comprehensive method for Arabic video text detection, localization, extraction and recognition," 11th Pacific Rim Conference on Multimedia, Shanghai, China.
[6]
T. Q. Phan, P. Shivakumara, and C. L. Tan, 2009, "A Laplacian Method for Video Text Detection," 2009 10th International Conference on Document Analysis and Recognition.
[7]
H. Karray, A. A. Regim, and I. Machines, 2005, "Detection and Extraction of the Text in a video sequence," 12th IEEE International Conference on Electronics, Circuits and Systems.
[8]
S. Lefevre and N. Vincent, 2005, "Caption localisation in video sequences by fusion of multiple detectors," Eighth International Conference on Document Analysis and Recognition ICDAR05.
[9]
A. Khader, J. Saudagar, and H. Vulla, 2015, "Efficient Arabic Text Extraction and Recognition using Thinning and Dataset Comparison Technique,", International Conference on Communication, Information & Computing Technology (ICCICT).
[10]
T. Pratheeba, V. Kavitha, and S. Raja Rajeswari, 2010, "Morphology based text detection and extraction from complex video scene," International Journal of Engineering and Technology, vol. 2, no. 3, pp. 200--206.
[11]
V. Khare, P. Shivakumara, and P. Raveendran, 2015, "A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video," Expert Systems with Applications, vol. 42, no. 21, pp. 7627--7640.
[12]
T. K. Boaz and C. J. Prabhakar, 2013, "A novel approach for detection and localization of caption in video based on pixel pairs," National Conference on Challenges in Research & Technology in the Coming Decades (CRT 2013).
[13]
R. Bhavadharani, P. M. Sowmya, and A. Thilagavathy, 2014, "A Dynamic Approach to Extract Texts and Captions from Videos," International Journal of Computer Science and Mobile Computing, vol. 3, no. 4, pp. 1047--1052.

Cited By

View all
  • (2022)Fonts That Fit the Music: A Multimodal Design Trend Analysis of Lyric VideosIEEE Access10.1109/ACCESS.2022.318402810(65414-65425)Online publication date: 2022
  • (2022)Multi-Script Video Caption Localization Based on Visual RhythmsApplied Artificial Intelligence10.1080/08839514.2022.203292636:1Online publication date: 4-Feb-2022
  • (2021)Effect of Occlusion on Deaf and Hard of Hearing Users’ Perception of Captioned Video QualityUniversal Access in Human-Computer Interaction. Access to Media, Learning and Assistive Environments10.1007/978-3-030-78095-1_16(202-220)Online publication date: 3-Jul-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
INFOS '16: Proceedings of the 10th International Conference on Informatics and Systems
May 2016
347 pages
ISBN:9781450340625
DOI:10.1145/2908446
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 May 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Arabic news video
  2. Caption Detection
  3. Caption Localization
  4. Caption Type Recognition
  5. Edge Features
  6. Multiple Frames Integration

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

INFOS '16

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Fonts That Fit the Music: A Multimodal Design Trend Analysis of Lyric VideosIEEE Access10.1109/ACCESS.2022.318402810(65414-65425)Online publication date: 2022
  • (2022)Multi-Script Video Caption Localization Based on Visual RhythmsApplied Artificial Intelligence10.1080/08839514.2022.203292636:1Online publication date: 4-Feb-2022
  • (2021)Effect of Occlusion on Deaf and Hard of Hearing Users’ Perception of Captioned Video QualityUniversal Access in Human-Computer Interaction. Access to Media, Learning and Assistive Environments10.1007/978-3-030-78095-1_16(202-220)Online publication date: 3-Jul-2021
  • (2020)Lyric Video Analysis Using Text Detection and TrackingDocument Analysis Systems10.1007/978-3-030-57058-3_30(426-440)Online publication date: 14-Aug-2020
  • (2017)News Videos Segmentation Using Dominant Colors RepresentationAdvances in Soft Computing and Machine Learning in Image Processing10.1007/978-3-319-63754-9_5(89-109)Online publication date: 15-Oct-2017
  • (2016)An Innovative Method for Key Frames Extraction in News VideosProceedings of the International Conference on Advanced Intelligent Systems and Informatics 201610.1007/978-3-319-48308-5_37(383-394)Online publication date: 18-Oct-2016
  • (2016)Abrupt Cut Detection in News Videos Using Dominant Colors RepresentationProceedings of the International Conference on Advanced Intelligent Systems and Informatics 201610.1007/978-3-319-48308-5_31(320-331)Online publication date: 18-Oct-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media