Skip to main content

Complementarity of Lexical Cohesion and Speaker Role Information for Story Segmentation of French TV Broadcast News

  • Conference paper
Statistical Language and Speech Processing (SLSP 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7978))

Included in the following conference series:

Abstract

Topic boundary detection in French TV Broadcast News is addressed in this paper with an approach based on the combination of two views: lexical cohesion and speaker role analysis. We propose an improved selection strategy from the classical lexical cohesion curve as well as an integrated supervised classification approach that jointly exploits the two views. The combination of these methods leads to significant improvements on a rich French database composed of shows from 7 different channels.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amaral, R., Trancoso, I.: Exploring the structure of broadcast news for topic segmentation. In: Vetulani, Z., Uszkoreit, H. (eds.) LTC 2007. LNCS, vol. 5603, pp. 1–12. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  2. Claveau, V., Lefèvre, S.: Topic segmentation of tv-streams by mathematical morphology and vectorization. In: INTERSPEECH, pp. 1105–1108 (2011)

    Google Scholar 

  3. Damnati, G., Charlet, D.: Multi-view approach for speaker turn role labeling in tv broadcast news shows. In: INTERSPEECH, pp. 1285–1288 (2011)

    Google Scholar 

  4. Dumont, E., Quénot, G.: Automatic story segmentation for tv news video using multiple modalities. Int. J. Digital Multimedia Broadcasting (2012)

    Google Scholar 

  5. Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous-valued attributes for classification learning. In: International Joint Conference on Artificial Intelligence, pp. 1022–1029 (1993)

    Google Scholar 

  6. Gauvain, J.L., Lamel, L., Adda, G.: The limsi broadcast news transcription system. Speech Communication 37(1-2), 89–108 (2002)

    Article  MATH  Google Scholar 

  7. Guinaudeau, C.: Structuration automatique de flux télévisuels. Thèse, INSA de Rennes (2011)

    Google Scholar 

  8. Guinaudeau, C., Gravier, G., Sébillot, P.: Enhancing lexical cohesion measure with confidence measures, semantic relations and language model interpolation for multimedia spoken content topic segmentation. Computer Speech and Language 26(2), 90–104 (2012)

    Article  Google Scholar 

  9. Guinaudeau, C., Hirschberg, J.: Accounting for prosodic information to improve asr-based topic tracking for tv broadcast news. In: INTERSPEECH, pp. 1401–1404 (2011)

    Google Scholar 

  10. Hadsell, R., Kira, Z., Wang, W., Precoda, K.: Unsupervised topic modeling for leader detection in spoken discourse. In: ICASSP, pp. 5113–5116 (2012)

    Google Scholar 

  11. Hearst, M.A.: Texttiling: segmenting text into multi-paragraph subtopic passages. Comput. Linguist. 23(1), 33–64 (1997)

    Google Scholar 

  12. Lecorvé, G., Gravier, G., Sébillot, P.: An unsupervised web-based topic language model adaptation method. In: ICASSP, pp. 5081–5084 (2008)

    Google Scholar 

  13. Malioutov, I., Barzilay, R.: Minimum cut model for spoken lecture segmentation. In: International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 25–32 (2006)

    Google Scholar 

  14. Rosenberg, A., Hirschberg, J.: Story segmentation of brodcast news in english, mandarin and arabic. In: Proceedings of the Human Language Technology Conference of the NAAC, pp. 125–128 (2006)

    Google Scholar 

  15. Sitbon, L., Bellot, P.: Topic segmentation using weighted lexical links (wll). In: SIGIR, pp. 737–738 (2007)

    Google Scholar 

  16. Tür, G., Hakkani-Tür, D.Z., Stolcke, A., Shriberg, E.: Integrating prosodic and lexical cues for automatic topic segmentation. Computational Linguistics 27(1), 31–57 (2001)

    Article  Google Scholar 

  17. Tür, G., Stolcke, A., Voss, L.L., Peters, S., Hakkani-Tür, D., Dowding, J., Favre, B., Fernández, R., Frampton, M., Frandsen, M.W., Frederickson, C., Graciarena, M., Kintzing, D., Leveque, K., Mason, S., Niekrasz, J., Purver, M., Riedhammer, K., Shriberg, E., Tien, J., Vergyri, D., Yang, F.: The calo meeting assistant system. IEEE Transactions on Audio, Speech and Language Processing 18(6), 1601–1611 (2010)

    Article  Google Scholar 

  18. Utiyama, M., Isahara, H.: A statistical model for domain-independent text segmentation. In: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, pp. 499–506 (2001)

    Google Scholar 

  19. Winston, H., Hsu, H.-M., Chang, S.-F.: A statistical framework for fusing mid-level perceptual features in news story segmentation. In: International Conference on Multimedia and Expo, pp. 413–416 (2003)

    Google Scholar 

  20. Xiaoxuan, W., Lei, X., Mimi, L., Bin, M., Chng, E.S., Haizhou, L.: Broadcast news story segmentation using conditional random fields and multimodal features. IEICE Transactions on Information and Systems 95(5), 1206–1215 (2012)

    Google Scholar 

  21. Xie, L., Yang, Y., Liu, Z.Q., Feng, W., Liu, Z.: Integrating acoustic and lexical features in topic segmentation of chinese broadcast news using maximum entropy approach. In: International Conference on Audio, Language and Image Processing, pp. 407–413 (2010)

    Google Scholar 

  22. Xie, L., Yang, Y., Zeng, J.: Subword lexical chaining for automatic story segmentation in chinese broadcast news. In: Huang, Y.-M.R., Xu, C., Cheng, K.-S., Yang, J.-F.K., Swamy, M.N.S., Li, S., Ding, J.-W. (eds.) PCM 2008. LNCS, vol. 5353, pp. 248–258. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bouchekif, A., Damnati, G., Charlet, D. (2013). Complementarity of Lexical Cohesion and Speaker Role Information for Story Segmentation of French TV Broadcast News. In: Dediu, AH., MartĂ­n-Vide, C., Mitkov, R., Truthe, B. (eds) Statistical Language and Speech Processing. SLSP 2013. Lecture Notes in Computer Science(), vol 7978. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39593-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39593-2_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39592-5

  • Online ISBN: 978-3-642-39593-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics