Abstract
Challenges exist in the field of sports news generation automatically from webcast that (1) finding hot events and sentences accurately; (2) organizing the selected sentences with highly readability. This paper proposes a framework to generate sports news automatically. First, to obtain accurate hot events and sentences, we design a neural network to predict the probabilities that each statement in live webcast script appears in the writing news, where the inputs of the neural network are weighed word vectors obtained from football keywords dictionary, and the outputs the similarity of statements in training live webcast script and sentences in training news. In this way, the “good” sentences selected from webcast contribute to the semi-finished sport news. To make the generated news to be possibly similar to human writing, we adopt idioms often appeared in football game to describe or summarize the games’ development or turns between the selected sentences, and come into being the final sport news. The proposed framework are validated on the training and test data set proved by “Sports News Generation from Live Webcast scripts” task of NLPCC 2016, the experiments show that the proposed method present good performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Schiller, V.H.: System, report, and method for generating natural language news-based stories: US, US8494944 (2013)
Dixon, T.: Financial News Generation System: WO/2012/119247 (2012)
Tornoe, R.: Learn to Stop Worrying and Love Robot Journalists. Editor & Publisher (2014)
Wan, X., Yang, J., Xiao, J.: Manifold-ranking based topic-focused multi-document summarization. In: IJCAI, vol. 7, pp. 2903–2908 (2007)
Hovy, E., Lin, C.Y.: Automated text summarization and the SUMMARIST system. In: Proceedings of a Workshop on Held at Baltimore, Maryland, 13–15 October 1998, pp. 197–214. Association for Computational Linguistics (1998)
Lin, C.Y., Hovy, E.: From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 457–464. Association for Computational Linguistics (2002)
Evans, D.K., Klavans, J.L., McKeown, K.R.: Columbia newsblaster: multilingual news summarization on the web. In: Demonstration Papers at HLT-NAACL 2004, pp. 1–4. Association for Computational Linguistics (2004)
Radev, D., Otterbacher, J., Winkel, A., et al.: NewsInEssence: summarizing online news topics. Commun. ACM 48(10), 95–98 (2005)
Min, K., Ma, C., Zhao, T., Li, H.: BosonNLP: an ensemble approach for word segmentation and POS tagging. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds.) NLPCC 2015. LNCS (LNAI), vol. 9362, pp. 520–526. Springer, Heidelberg (2015). doi:10.1007/978-3-319-25207-0_48
Chuang, W.T., Yang, J.: Extracting sentence segments for text summarization: a machine learning approach. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 152–159. ACM (2000)
Zhang, Q., Huang, X., Wu, L.: A new method for calculating similarity between sentences and application on automatic text summarization. In: Proceedings of the First National Conference on Information Retrieval and Content Security (2004)
Conroy, J.M., O’leary, D.P.: Text summarization via hidden markov models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 406–407. ACM (2001)
Zhang, P., Li, C.: Automatic text summarization based on sentences clustering and extraction. In: 2nd IEEE International Conference on Computer Science and Information Technology, ICCSIT 2009, pp. 167–170. IEEE (2009)
Lin, C.-Y.: ROUGE: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out (WAS 2004), Barcelona, Spain, 25–26 July 2004 (2004a)
Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statis-tics. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 71–78. Association for Computational Linguistics (2003)
Acknowledge
This work is supported by the National Key Research & Development Plan of China (No. 2016YFB1001404), the Strategic Priority Research Program of the CAS (Grant XDB02080006), the National High-Tech Research and Development Program of China(863 Program) (No. 2015AA016305), the National Natural Science Foundation of China (NSFC) (No. 61425017, No. 61332017, No. 61375027, No. 61203258, No. 61273288), the Strategic Priority Research Program of the CAS (Grant XDB02080006), and the Guangxi Science and Technology Development Project (No: 1598018-6), the Guangxi Key Laboratory of Trusted Software of Guilin University of Electronic Technology (KX201514).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Renjun, T. et al. (2016). Football News Generation from Chinese Live Webcast Script. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_70
Download citation
DOI: https://doi.org/10.1007/978-3-319-50496-4_70
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)