Abstract
In order to enable automatic generation of sports news, in this paper, we propose an extraction method to extract summary sentences from live sports text. After analyzing the characteristics of live sports text, we regard extraction of summary sentence as the sequence tagging problem, and decide to use Conditional Random Fields (CRFs) as the extraction model. Firstly, we expend the correlated words of keywords using word2vec. Then, we select positive correlated words, negative correlated words, time and the window of score changes as features to train the model and extract summary sentences. This method get good results on the evaluation indicators of ROUGE-1, GOUGE-2 and ROUGE-SU4. And it shows that this method has a meaningful influence on automatic summarization and automatic generation of sports news.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)
Prasad Pingali, R.K., Varma, V.: IIIT Hyderabad at DUC 2007. In: Proceedings of DUC 2007 (2007)
Lin, Z., Chua, T.S., Kan, M.Y., et al.: NUS at DUC 2007: using evolutionary models of text. In: Proceedings of Document Understanding Conference (DUC) (2007)
He, T., Shao, W., Xiao, H.S., et al.: The implementation of a query-directed multi-document summarization system. In: 6th International Conference on Advanced Language Processing and Web Information Technology, ALPIT 2007, pp. 105–110. IEEE (2007)
Liu, J., Xu, J., Zhang, Y.: Summarization based on hidden topic Markov model with multi-features. Acta Scientiarum Naturalium Universitatis Pekinensis 1, 027 (2014)
Cheng, Y., Silamu, W., Hasimua, M.: Automatic text summarization based on comprehensive characteristics of sentence. Comput. Sci. 42(4), 226–229 (2015)
Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space (2013). arXiv preprint arXiv:1301.3781
Mikolov, T., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems (2013)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of 18th International Conference on Machine Learning, ICML, vol. 1, pp. 282–289 (2001)
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Workshop Text Summarization Branches Out: Proceedings of ACL-2004, vol. 8 (2004)
Acknowledgements
This work is supported by the National Natural Science Foundation of China under Grants Nos. 61271304, 61671070, Beijing Advanced Innovation Center for Imaging Technology BAICIT-2016003, National Social Science Foundation of China under Grants Nos. 14@ZH036, 15ZDB017, National Language Committee of China under Grants No. ZDA125-26.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Zhu, L., Wang, W., Chen, Y., Lv, X., Zhou, J. (2016). Research on Summary Sentences Extraction Oriented to Live Sports Text. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_72
Download citation
DOI: https://doi.org/10.1007/978-3-319-50496-4_72
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)