Multiple style exploration for story unit segmentation of broadcast news video

Feng, Bailan; Chen, Zhineng; Zheng, Rong; Xu, Bo

doi:10.1007/s00530-013-0350-0

Multiple style exploration for story unit segmentation of broadcast news video

Regular Paper
Published: 13 December 2013

Volume 20, pages 347–361, (2014)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Bailan Feng¹,
Zhineng Chen¹,
Rong Zheng¹ &
…
Bo Xu¹

462 Accesses
8 Citations
Explore all metrics

Abstract

Broadcast news video has been playing an increasingly important role in our daily life. However, how to effectively segment a broadcast news video into meaningful semantic story units is still a challenge issue. In this paper, we propose a novel unified video structure parsing approach, named multiple style exploration-based news story segmentation (MSE-NSS), to segment broadcast news videos into semantic story units. In MSE-NSS, we first investigate the appropriate methods to explore multiple kinds of style information inherent in broadcast news videos, including temporal style inferred from caption texts, boundary style represented by a wealth of multi-modal visual–audio features, and structural style known as the spanning duration of story units. Then the above multiple style information is integrated together and the task of story unit segmentation is accomplished through the following three steps: temporal style-based pre-location, boundary style-based description, and boundary-structural style-based segmentation, where the segmentation process is composed of a SVM-based detector and a dynamic programming-based refiner that considers the boundary style and the structural style collectively. Parallel to this, a news-oriented broadcast management system—NOBMs is implemented on top of the proposed MSE-NSS. Encouraging experimental results on a large broadcast news video dataset demonstrate the effectiveness of the proposed MSE-NSS, as well as its superiority over traditional story unit segmentation methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Segmenting with style: detecting program and story boundaries in TV news broadcast videos

Article 27 July 2019

Automatic Segmentation of TV News into Stories Using Visual and Temporal Information

Only overlay text: novel features for TV news broadcast video segmentation

Article 06 April 2022

References

Ardissono, L., Kobsa, A., Maybury, M. (eds.): Personalized digital television. Kluwer, Norwell (2004)
Xie, L., Zheng, L.L., Liu, Z.H., Zhang, Y.N.: Laplacian eigenmaps for automatic story segmentation of broadcast news. IEEE Trans. Audio Speech Language Process. 20(1), 276–289 (2012)
Article Google Scholar
Zhai, Y., Rasheed, Z., Shah, M.: University of Central Florida at TRECVID 2003. In: Proc. TRECVID Workshop (2003)
Wang, J.Q., Duan, L.Y., Liu, Q.S., Lu, H.Q., Jin, J.S.: A multimodal scheme for program segmentation and representation in broadcast video streams. IEEE Trans. Multimedia 10(3), 393–408 (2008)
Article Google Scholar
Santo, M.D., Foggia, P., Percannella, G., Sansone, C., Vento, M.: An unsupervised algorithm for anchor shot detection. In: Proc. IEEE ICPR, pp. 1238–1241 (2006)
Hsu, W.H., Kenney, L.S., Chang, S.F., Franz, M., Smith, J.: Columbia-IBM news video story segmentation in TRECVID 2004. In: Proc. ACM CIVR, pp. 1–11 (2005)
Gao, X., Tang, X.: Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing. IEEE Trans. Circuits System Video Technol. 12(9), 765–776 (2002)
Article Google Scholar
Bertini, M., Bimbo, A.D., Pala, P.: Content-based indexing and retrieval of TV news. Pattern Recogn. Lett. 22, 503–516 (2001)
Article MATH Google Scholar
Shriberg, E., Stolcke, A., Hakkani-Tür, D., Tür, G.: Prosody-based automatic segmentation of speech into sentences and topics. Speech Commun. 32(1–2), 127–154 (2000)
Article Google Scholar
Tür, G., Hakkani-Tür, D.: Integrating prosodic and lexical cues for automatic topic segmentation. Comput. Linguist. 27(1), 31–57 (2001)
Article Google Scholar
Tseng, C.Y., Pin, S.H., Lee, Y., Wang, H.M., Chen, Y.C.: Fluent speech prosody: framework and modeling. Speech Commun. 46(3–4), 284–309 (2005)
Article Google Scholar
Xie, L., Liu, C., Meng, H.: Combined use of speaker and tone-normalized pitch reset with pause duration for automatic story segmentation in mandarin broadcast news. In: Proc. HLT-NAACL, pp. 193–199 (2007)
Xie, L.: Discovering salient prosodic cues and their interactions for automatic story segmentation in mandarin broadcast news. Multimedia Syst. 14(4), 237–253 (2008)
Article Google Scholar
Hearst, M.A.: TextTiling: segmenting text into multi-paragraph subtopic passages. Comput. Liguist. 23(1), 33–64 (1997)
Google Scholar
Stokes, N., Carthy, J., Smeaton, A.: SeLeCT: a lexical cohesion based news story segmentation system. J. AI Commun. 17(1), 3–12 (2004)
MATH MathSciNet Google Scholar
Chan, S.K., Xie, L., Meng, H.M.L.: Modeling the statistical behavior of lexical chains to capture word cohesiveness for automatic story segmentation. In: Proc. Interspeech, pp. 2408–2411 (2007)
Poulisse, G.J., Moens, M.F.: Multimodal news story segmentation. In: Proc. Int. Conf. on intelligent human computer interaction, pp. 95–101 (2009)
Chaisorn, L., Chua, T.S., Lee, C.H.: A multi-modal approach to story segmentation for news video. World Wide Web Internet Web Inf. Systems 6(2), 187–208 (2003)
Article Google Scholar
Liu, M.M., Zheng, L.L., Leung, C.C., Xie, L., Ma, B., Li, H.Z.: Broadcast news story segmentation using probabilistic latent semantic analysis and laplacian eigenmaps. In: Pro. APSIPA-ASC, pp. 356–360 (2011)
Xu, S., Feng, B.L., Chen, Z.N., Xu, B.: A general framework of video segmentation to logical unit based on conditional random fields. In: Proc. ACM ICMR (2013)
Wang, X., Xie, L., Ma, B., Chng, E.S., Li, H.: Modeling broadcast news prosody using conditional random fields for story segmenation. In: Proc. APSIPA ASC (2010)
Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Analysis Mach. Intell. 24(7), 971–987 (2002)
Article Google Scholar
Maitra, R., Peterson, A.D., Ghosh, A.P.: A systematic evaluation of different methods for initializing the K-means clustering algorithm. IEEE Trans. Knowledge Data Eng, 1–11 (2010)
Yang, M.H., Kriegman, D., Ahuja, N.: Detecting faces in images: a survey. IEEE Trans. Pattern Analysis Mach. Intell. 24(1), 34–58 (2002)
Article Google Scholar
Gao, X., Tang, X.: Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing. IEEE Trans. Circuits System Video Technol. 12(9), 765–776 (2002)
Article Google Scholar
He, S.N., Yu, J.B.: A novel chinese continuous speech endpoint detection method based on time domain features of the word structure. In: Proc. IEEE ICCCAS, pp. 992–996 (2002)
Yang, S.Y., Zhou, Y.Y., Huang, S.X.: A survey of endpoint detection methods for speech signal. Inf. Technol. 29(7), 5–8 (2005)
Google Scholar
Zhu, S.Q., Qiu, X.H.: Research on endpoint detection of speech signals. Comput. Simul. 22(3), 214–216 (2005)
Google Scholar
Zhang, S.L., Zhang, S.W., Xu, B.: A two-level method for unsupervised speaker-based audio segmentation. In: Proc. IEEE ICPR, pp. 298–301 (2006)
Liang, J.E., Meng, M., Wang, X.R., Ding, P., Xu, B.: An improved mandarin keyword spotting system using mce training and context-enhanced verfication. In: Proc. IEEE ICASSP, pp. 1145–1148 (2006)
Feng, B.L., Ding, P., Chen, J.S., Bai, J.F., Xu, S., Xu, B.: Multi-modal information fusion for news story segmentation in broadcast video. In: Proc. IEEE ICASSP, pp. 1417–1420 (2012)
Chua, T.S., Chang, S.F., Chaisorn, L., Hsu, W.: Story boundary detection in large broadcast news video archives—techniques, experience and trends. In: Proc. ACM MM, pp. 656–659 (2004)
Cao, J., Ngo, C.W., Zhang, Y.D., Li, J.T.: Tracking web video topics: discovery, visualization and monitoring. IEEE Trans. Circuits Systems Video Technol 21(12), 1835–1846 (2011)
Article Google Scholar
Chen, Z.N., Cao, J., Xia, T., Song, Y.C., Zhang, Y.D., Li, J.T.: Web video retagging. Multimedia Tools Appl. 55(1), 53–82 (2011)
Article Google Scholar
Chen, Z.N., Cao, J., Song, Y.C., Guo, J.B, Zhang, Y.D., Li, J.T.: Context-oriented web video tag recommendation. In: Proc. WWW, pp. 1079–1080 (2010)
Feng, B.L., Cao, J., Bao, L., Zhang, Y.D., Lin, S.X., Bao, X.G., Yun, X.C.: Graph-based multi-space semantic correlation diffusion for video retrieval. Visual Comput. Int. J. Comput. Graphics 27(1), 21–34 (2011)
Google Scholar
Yao, T., Ngo, C.W., Mei, T.: Circular reranking for visual search. IEEE Trans. Image Process. 22(4), 1644–1655 (2013)
Article MathSciNet Google Scholar
Hsu, W., Kennedy, L., Huang, C.W., Chang, S.F., Lin, C.Y., Iyengar, G.: News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003. In: Proc. IEEE ICASSP, pp. 645–648 (2004)
Hsu, W, Chang, S.F., Huang, C.W., Kennedy, L., Lin, C.Y., Iyengar, G.: Discovery and fusion of salient multi-modal features towards news story segmentation. IS&T/SPIE Electronic Imaging. (2004)
Wang, X.X., Xie, L., Lu, M.M., Ma, B., Chng, E.S., Li, H.Z.: Broadcast news story segmentation using conditional random fields and multimodal features. IEICE Trans. Inf. Systems, 95-D(5), 1206–1215 (2012)
Google Scholar
Lu, M.M., Xie, L., Fu, Z.H., Jiang, D.M., Zhang, Y.N.: Multi-modal feature integration for story boundary detection in broadcast news. International Symposium on Chinese Spoken Language Processing, pp. 420–425 (2010)
Noreau, N.: HTK(v.3.1): basic tutorial. Technical Report (2002)
Kudo, T.: CRF++: yet another CRF toolkit. Technical Report (2005)

Download references

Acknowledgments

This work was supported by National Nature Science Foundation of China (No. 61202326), Beijing Natural Science Foundation (No. 4132071) and National Nature Science Foundation of China (No. 61303175). The authors would like to thank Bao Han and Guoyue Si for their supports with the system UI development.

Author information

Authors and Affiliations

Interactive Digital Media Technology Research Center, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
Bailan Feng, Zhineng Chen, Rong Zheng & Bo Xu

Authors

Bailan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Zhineng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Rong Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Bo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bailan Feng.

Additional information

Communicated by P. Pala.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Feng, B., Chen, Z., Zheng, R. et al. Multiple style exploration for story unit segmentation of broadcast news video. Multimedia Systems 20, 347–361 (2014). https://doi.org/10.1007/s00530-013-0350-0

Download citation

Received: 22 May 2013
Accepted: 25 November 2013
Published: 13 December 2013
Issue Date: July 2014
DOI: https://doi.org/10.1007/s00530-013-0350-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multiple style exploration for story unit segmentation of broadcast news video

Abstract

Access this article

Similar content being viewed by others

Segmenting with style: detecting program and story boundaries in TV news broadcast videos

Automatic Segmentation of TV News into Stories Using Visual and Temporal Information

Only overlay text: novel features for TV news broadcast video segmentation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multiple style exploration for story unit segmentation of broadcast news video

Abstract

Access this article

Similar content being viewed by others

Segmenting with style: detecting program and story boundaries in TV news broadcast videos

Automatic Segmentation of TV News into Stories Using Visual and Temporal Information

Only overlay text: novel features for TV news broadcast video segmentation

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation