Abstract
Text summarization techniques are widely used to generate abstracts of documents automatically, facilitating users to capture the needed information from a large amount of documents. However, it poses a great challenge to satisfy the information needs of users in different roles while keeping the most important information. To solve this problem, we propose a novel framework on text summarization, which incorporates historical information of users to generate a user-oriented personalized summarization. The framework adopts two topic models to model the comments of doctors and patients from health-related social networks about certain diseases, and then identifies topics that the two kinds of users are interested in. Based on the identified topics, we propose three methods for sentence ranking to generate summarizations about the diseases for doctors and patients, respectively. Experimental results show a high similarity between the generated summarization and the real interests of different users, better meeting the information needs while keeping the summarization performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)
Louis, A., Joshi, A., Nenkova, A.: Discourse indicators for content selection in summarization. Paper Presented at the Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue (2010)
Zhang, H., Fiszman, M., Shin, D., Wilkowski, B., Rindflesch, T.C.: Clustering cliques for graph-based summarization of the biomedical research literature. BMC Bioinform. 14(1), 182 (2013)
Ma, Z., Sun, A., Yuan, Q., Cong, G.: Topic-driven reader comments summarization. Paper Presented at the Proceedings of the 21st ACM International Conference on Information and Knowledge Management (2012)
Rindflesch, T.C., Fiszman, M.: The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. J. Biomed. Inf. 36(6), 462–477 (2003)
Zeng, Q.T., Tse, T.: Exploring and developing consumer health vocabularies. J. Am. Med. Inf. Assoc. 13(1), 24–29 (2006)
Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. JASIS 41(6), 391–407 (1990)
Hofmann, T.: Probabilistic latent semantic indexing. Paper Presented at the Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1999)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. Paper Presented at the Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1998)
Radev, D., Allison, T., Blair-Goldensohn, S., Blitzer, J., Celebi, A., Dimitrov, S., Liu, D.: MEAD-a platform for multidocument multilingual text summarization (2004)
Erkan, G., Radev, D.R.: LexRank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. (JAIR) 22(1), 457–479 (2004)
Vanderwende, L., Suzuki, H., Brockett, C., Nenkova, A.: Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion. Inf. Process. Manag. 43(6), 1606–1618 (2007)
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. Paper Presented at the Text Summarization Branches Out: Proceedings of the ACL-04 Workshop (2004)
Achananuparp, P., Hu, X., Shen, X.: The evaluation of sentence similarity measures. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2008. LNCS, vol. 5182, pp. 305–316. Springer, Heidelberg (2008)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781 (2013)
Acknowledgements
This work is partially supported by grant from the Natural Science Foundation of China (No. 61277370, 61402075, 61572102), Natural Science Foundation of Liaoning Province, China (No. 201202031, 2014020003), State Education Ministry and The Research Fund for the Doctoral Program of Higher Education (No. 20090041110002), the Fundamental Research Funds for the Central Universities. The 12th five year national science and technology supporting programs of China under Grant No. 2015BAF20B02.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Xu, B., Lin, H., Hao, H., Yang, Z., Wang, J., Zhang, S. (2016). Generating User-oriented Text Summarization Based on Social Networks Using Topic Models. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds) Social Media Processing. SMP 2016. Communications in Computer and Information Science, vol 669. Springer, Singapore. https://doi.org/10.1007/978-981-10-2993-6_16
Download citation
DOI: https://doi.org/10.1007/978-981-10-2993-6_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2992-9
Online ISBN: 978-981-10-2993-6
eBook Packages: Computer ScienceComputer Science (R0)