Skip to main content

Automatic Summarization for Chinese Text Using Affinity Propagation Clustering and Latent Semantic Analysis

  • Conference paper
Web Information Systems and Mining (WISM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7529))

Included in the following conference series:

  • 2778 Accesses

Abstract

As the rapid development of the internet, we can collect more and more information. it also means we need the abitily to search the information which really useful to us from the amount of information quickly. Automatic summarization is useful to us for handling the huge amount of text information in the Web. This paper proposes a Chinese summarization method based on Affinity Propagation(AP)clustering and latent semantic analysis(LSA). AP is a new clustering algorithm raised by B. J. Frey on science in 2007 that takes as input measures of similarity between pairs of data points and simultaneously considersĀ allĀ data points as potential exemplars. LSA is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of sentences. Experiment results show that our method could get more comprehensive and high-quality summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sicui, W., Weijiang, L., Feng, W., Hui, D.: A Survey on Automatic Summarization. In: International Forum on Information Technology and Applications, IFITA (2010)

    Google ScholarĀ 

  2. Frey, B.J., Dueck, D.: Clustering by passing messages between data points. ScienceĀ 315, 972 (2007)

    ArticleĀ  MathSciNetĀ  MATHĀ  Google ScholarĀ 

  3. MĆ©zard, M.: Where Are the Exemplars? ScienceĀ 315, 972 (2007)

    ArticleĀ  MathSciNetĀ  Google ScholarĀ 

  4. Eiler, J.M.: On the Origins of Granites. ScienceĀ 315, 972 (2007)

    ArticleĀ  MathSciNetĀ  Google ScholarĀ 

  5. Kummamuru, K., Lotlikar, R.: A hierarchical monothetic document clustering algorithm for summarization and browsing search results. In: Proceedings of the 13th International Conference on World Wide Web. ACM, New York (2004)

    Google ScholarĀ 

  6. Ai, D., Yuchao, Z., Dezheng, Z.: Automatic text summarization based on latent semantic indexing. In: Artificial Life and Robotics (2010)

    Google ScholarĀ 

  7. Edmundson, H.P.: New methods in automatic abstracting. Journal of the Association for Computing MachineryĀ 16(2), 264ā€“285 (1969)

    ArticleĀ  MATHĀ  Google ScholarĀ 

  8. Paice, C.D.: The automatic generation of literature abstracts: an approach based on the identification of self-indicating phrases. In: Proceedings of the Third Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 172ā€“191 (1981)

    Google ScholarĀ 

  9. Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 68ā€“73 (1995)

    Google ScholarĀ 

  10. Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Advances in Automatic Text Summarization, pp. 111ā€“121. MIT Press (1999)

    Google ScholarĀ 

  11. Zajic, D., Dorr, B.J., Lin, J., Schwartz, R.: Multi-candidate reduction: sentence compression as a tool for document summarization tasks. Information Processing & ManagementĀ 43(6), 1549ā€“1570 (2007)

    ArticleĀ  Google ScholarĀ 

  12. Meng, W., Chun-gui, L., Pei-he, T., Xiao-rong, W.: Chinese Automatic Summarization Based on Thematic Sentence Discovery. Computer EngineeringĀ 33(8), 180ā€“181 (2007)

    Google ScholarĀ 

  13. Mihalcea, R., Tarau, P.: TextRank: Bringing Order into Texts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain (2004)

    Google ScholarĀ 

  14. Yeh, J.-Y., Ke, H.-R.: Text summarization using a trainable summarizer and latent semantic analysis. Information Processing & Management (2005)

    Google ScholarĀ 

  15. Xie, S., Liu, Y., Hansen, J.H.L., Harabagiu, S.: Automatic Extractive Summarization on Meeting Corpus (2010)

    Google ScholarĀ 

  16. Antiqueira, L., Oliveira Jr., O.N., da Fontoura Costa, L.: A complex network approach to text summarization. Information Sciences (2009)

    Google ScholarĀ 

  17. Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Processes (1998)

    Google ScholarĀ 

  18. Morris, A.H., et al.: The effects and limitations of automated text condensing on reading comprehension performance. Information Systems Research (1992)

    Google ScholarĀ 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yang, R., Bu, Z., Xia, Z. (2012). Automatic Summarization for Chinese Text Using Affinity Propagation Clustering and Latent Semantic Analysis. In: Wang, F.L., Lei, J., Gong, Z., Luo, X. (eds) Web Information Systems and Mining. WISM 2012. Lecture Notes in Computer Science, vol 7529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33469-6_67

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33469-6_67

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33468-9

  • Online ISBN: 978-3-642-33469-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics