Skip to main content

Football News Generation from Chinese Live Webcast Script

  • Conference paper
  • First Online:
Natural Language Understanding and Intelligent Applications (ICCPOL 2016, NLPCC 2016)

Abstract

Challenges exist in the field of sports news generation automatically from webcast that (1) finding hot events and sentences accurately; (2) organizing the selected sentences with highly readability. This paper proposes a framework to generate sports news automatically. First, to obtain accurate hot events and sentences, we design a neural network to predict the probabilities that each statement in live webcast script appears in the writing news, where the inputs of the neural network are weighed word vectors obtained from football keywords dictionary, and the outputs the similarity of statements in training live webcast script and sentences in training news. In this way, the “good” sentences selected from webcast contribute to the semi-finished sport news. To make the generated news to be possibly similar to human writing, we adopt idioms often appeared in football game to describe or summarize the games’ development or turns between the selected sentences, and come into being the final sport news. The proposed framework are validated on the training and test data set proved by “Sports News Generation from Live Webcast scripts” task of NLPCC 2016, the experiments show that the proposed method present good performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://js.qq.com/a/20150911/016002.htm.

  2. 2.

    https://www.washingtonpost.com/pr/wp/2016/08/05/the-washington-post-experiments-with-automated-storytelling-to-help-power-2016-rio-olympics-coverage/.

References

  1. Schiller, V.H.: System, report, and method for generating natural language news-based stories: US, US8494944 (2013)

    Google Scholar 

  2. Dixon, T.: Financial News Generation System: WO/2012/119247 (2012)

    Google Scholar 

  3. Tornoe, R.: Learn to Stop Worrying and Love Robot Journalists. Editor & Publisher (2014)

    Google Scholar 

  4. Wan, X., Yang, J., Xiao, J.: Manifold-ranking based topic-focused multi-document summarization. In: IJCAI, vol. 7, pp. 2903–2908 (2007)

    Google Scholar 

  5. Hovy, E., Lin, C.Y.: Automated text summarization and the SUMMARIST system. In: Proceedings of a Workshop on Held at Baltimore, Maryland, 13–15 October 1998, pp. 197–214. Association for Computational Linguistics (1998)

    Google Scholar 

  6. Lin, C.Y., Hovy, E.: From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 457–464. Association for Computational Linguistics (2002)

    Google Scholar 

  7. Evans, D.K., Klavans, J.L., McKeown, K.R.: Columbia newsblaster: multilingual news summarization on the web. In: Demonstration Papers at HLT-NAACL 2004, pp. 1–4. Association for Computational Linguistics (2004)

    Google Scholar 

  8. Radev, D., Otterbacher, J., Winkel, A., et al.: NewsInEssence: summarizing online news topics. Commun. ACM 48(10), 95–98 (2005)

    Article  Google Scholar 

  9. Min, K., Ma, C., Zhao, T., Li, H.: BosonNLP: an ensemble approach for word segmentation and POS tagging. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 520–526. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25207-0_48

    Chapter  Google Scholar 

  10. Chuang, W.T., Yang, J.: Extracting sentence segments for text summarization: a machine learning approach. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 152–159. ACM (2000)

    Google Scholar 

  11. Zhang, Q., Huang, X., Wu, L.: A new method for calculating similarity between sentences and application on automatic text summarization. In: Proceedings of the First National Conference on Information Retrieval and Content Security (2004)

    Google Scholar 

  12. Conroy, J.M., O’leary, D.P.: Text summarization via hidden markov models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 406–407. ACM (2001)

    Google Scholar 

  13. Zhang, P., Li, C.: Automatic text summarization based on sentences clustering and extraction. In: 2nd IEEE International Conference on Computer Science and Information Technology, ICCSIT 2009, pp. 167–170. IEEE (2009)

    Google Scholar 

  14. Lin, C.-Y.: ROUGE: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out (WAS 2004), Barcelona, Spain, 25–26 July 2004 (2004a)

    Google Scholar 

  15. Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statis-tics. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 71–78. Association for Computational Linguistics (2003)

    Google Scholar 

Download references

Acknowledge

This work is supported by the National Key Research & Development Plan of China (No. 2016YFB1001404), the Strategic Priority Research Program of the CAS (Grant XDB02080006), the National High-Tech Research and Development Program of China(863 Program) (No. 2015AA016305), the National Natural Science Foundation of China (NSFC) (No. 61425017, No. 61332017, No. 61375027, No. 61203258, No. 61273288), the Strategic Priority Research Program of the CAS (Grant XDB02080006), and the Guangxi Science and Technology Development Project (No: 1598018-6), the Guangxi Key Laboratory of Trusted Software of Guilin University of Electronic Technology (KX201514).

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Renjun, T. et al. (2016). Football News Generation from Chinese Live Webcast Script. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_70

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50496-4_70

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50495-7

  • Online ISBN: 978-3-319-50496-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics