Abstract
As the rapid development of the internet, we can collect more and more information. it also means we need the abitily to search the information which really useful to us from the amount of information quickly. Automatic summarization is useful to us for handling the huge amount of text information in the Web. This paper proposes a Chinese summarization method based on Affinity Propagation(AP)clustering and latent semantic analysis(LSA). AP is a new clustering algorithm raised by B. J. Frey on science in 2007 that takes as input measures of similarity between pairs of data points and simultaneously considersĀ allĀ data points as potential exemplars. LSA is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of sentences. Experiment results show that our method could get more comprehensive and high-quality summarization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sicui, W., Weijiang, L., Feng, W., Hui, D.: A Survey on Automatic Summarization. In: International Forum on Information Technology and Applications, IFITA (2010)
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. ScienceĀ 315, 972 (2007)
MĆ©zard, M.: Where Are the Exemplars? ScienceĀ 315, 972 (2007)
Eiler, J.M.: On the Origins of Granites. ScienceĀ 315, 972 (2007)
Kummamuru, K., Lotlikar, R.: A hierarchical monothetic document clustering algorithm for summarization and browsing search results. In: Proceedings of the 13th International Conference on World Wide Web. ACM, New York (2004)
Ai, D., Yuchao, Z., Dezheng, Z.: Automatic text summarization based on latent semantic indexing. In: Artificial Life and Robotics (2010)
Edmundson, H.P.: New methods in automatic abstracting. Journal of the Association for Computing MachineryĀ 16(2), 264ā285 (1969)
Paice, C.D.: The automatic generation of literature abstracts: an approach based on the identification of self-indicating phrases. In: Proceedings of the Third Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 172ā191 (1981)
Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 68ā73 (1995)
Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Advances in Automatic Text Summarization, pp. 111ā121. MIT Press (1999)
Zajic, D., Dorr, B.J., Lin, J., Schwartz, R.: Multi-candidate reduction: sentence compression as a tool for document summarization tasks. Information Processing & ManagementĀ 43(6), 1549ā1570 (2007)
Meng, W., Chun-gui, L., Pei-he, T., Xiao-rong, W.: Chinese Automatic Summarization Based on Thematic Sentence Discovery. Computer EngineeringĀ 33(8), 180ā181 (2007)
Mihalcea, R., Tarau, P.: TextRank: Bringing Order into Texts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain (2004)
Yeh, J.-Y., Ke, H.-R.: Text summarization using a trainable summarizer and latent semantic analysis. Information Processing & Management (2005)
Xie, S., Liu, Y., Hansen, J.H.L., Harabagiu, S.: Automatic Extractive Summarization on Meeting Corpus (2010)
Antiqueira, L., Oliveira Jr., O.N., da Fontoura Costa, L.: A complex network approach to text summarization. Information Sciences (2009)
Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Processes (1998)
Morris, A.H., et al.: The effects and limitations of automated text condensing on reading comprehension performance. Information Systems Research (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, R., Bu, Z., Xia, Z. (2012). Automatic Summarization for Chinese Text Using Affinity Propagation Clustering and Latent Semantic Analysis. In: Wang, F.L., Lei, J., Gong, Z., Luo, X. (eds) Web Information Systems and Mining. WISM 2012. Lecture Notes in Computer Science, vol 7529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33469-6_67
Download citation
DOI: https://doi.org/10.1007/978-3-642-33469-6_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33468-9
Online ISBN: 978-3-642-33469-6
eBook Packages: Computer ScienceComputer Science (R0)