Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3878))

Abstract

Since the early ages of artificial intelligence, associative or semantic networks have been proposed as representations that enable the storage of language units and the relationships that interconnect them, allowing for a variety of inference and reasoning processes, and simulating some of the functionalities of the human mind. The symbolic structures that emerge from these representations correspond naturally to graphs – relational structures capable of encoding the meaning and structure of a cohesive text, following closely the associative or semantic memory representations. The activation or ranking of nodes in such graph structures mimics to some extent the functioning of human memory, and can be turned into a rich source of knowledge useful for several language processing applications. In this paper, we suggest a framework for the application of graph-based ranking algorithms to natural language processing, and illustrate the application of this framework to two traditionally difficult text processing tasks: word sense disambiguation and text summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anderson, J.: A spreading activation theory of memory. Journal of Verbal Learning and Verbal Behavior 22 (1983)

    Google Scholar 

  2. Berger, H., Dittenbach, M., Merkl, D.: An adaptive information retrieval system based on associative networks. In: Proceedings of the first Asian-Pacific conference on Conceptual modelling, Dunedin, New Zealand (2004)

    Google Scholar 

  3. Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)

    Article  Google Scholar 

  4. Budanitsky, A., Hirst, G.: Semantic distance in wordnet: An experimental, application-oriented evaluation of five measures. In: Proceedings of the NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh (June 2001)

    Google Scholar 

  5. Cohen, P., Kjeldsen, R.: Information retrieval by constrained spreading activation in semantic networks. Information Processing and Management 23, 4 (1987)

    Article  Google Scholar 

  6. Collins, A.M., Loftus, E.: A spreading-activation theory of semantic processing. Psychological Review 82, 6 (1975)

    Article  Google Scholar 

  7. Dom, B., Eiron, I., Cozzi, A., Shang, Y.: Graph-based ranking algorithms for e-mail expertise analysis. In: Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery, San Diego, California (2003)

    Google Scholar 

  8. DUC. Document understanding conference (2002), http://www-nlpir.nist.gov/projects/duc/

  9. Freud, S. Psychopathology of everyday life. Payot (1901)

    Google Scholar 

  10. Grimmett, G., Stirzaker, D.: Probability and Random Processes. Oxford University Press, Oxford (1989)

    Google Scholar 

  11. Hirst, G.: Resolving lexical ambiguity computationally with spreading activation and Polaroid words. In: Small, S., Cottrell, G., Tanenhaus, M. (eds.) Lexical Ambiguity Resolution. Morgan Kaufmann, San Francisco (1988)

    Google Scholar 

  12. Jannink, J.: A Word Nexus for Systematic Interoperation of Semantically Heterogeneous Data Sources. PhD thesis, Stanford University (2001)

    Google Scholar 

  13. Kleinberg, J.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  14. Landauer, T.K., Foltz, P., Laham, D.: Introduction to latent semantic analysis. Discourse Processes 25 (1998)

    Google Scholar 

  15. Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proceedings of the SIGDOC Conference 1986, Toronto (June 1986)

    Google Scholar 

  16. Lin, C., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of Human Language Technology Conference (HLT-NAACL 2003), Edmonton, Canada (May 2003)

    Google Scholar 

  17. Mihalcea, R.: Graph-based ranking algorithms for sentence extraction, applied to text summarization. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Lingusitics (ACL 2004), Barcelona, Spain (2004) (companion volume)

    Google Scholar 

  18. Mihalcea, R.: Large vocabulary unsupervised word sense disambiguation with graph-based algorithms for sequence data labeling. In: Proceedings of the Human Language Technology Empirical Methods in Natural Language Processing conference, Vancouver (2005)

    Google Scholar 

  19. Mihalcea, R., Tarau, P.: TextRank – bringing order into texts. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), Barcelona, Spain (2004)

    Google Scholar 

  20. Mihalcea, R., Tarau, P.: An algorithm for language independent single and multiple document summarization. In: Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP-2005), Korea (2005)

    Google Scholar 

  21. Mihalcea, R., Tarau, P., Figa, E.: PageRank on semantic networks, with application to word sense disambiguation. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), Geneva, Switzerland (2004)

    Google Scholar 

  22. Miller, G.: Wordnet: A lexical database. Communication of the ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  23. Miller, G., Leacock, C., Randee, T., Bunker, R.: A semantic concordance. In: Proceedings of the 3rd DARPA Workshop on Human Language Technology, Plainsboro, New Jersey (1993)

    Google Scholar 

  24. Moldovan, D., Lee, W., Lin, C.: Parallel knowledge processing on SNAP. IEEE Transactions on Knowledge and Data Engineering 5(1) (1993)

    Google Scholar 

  25. Palmer, M., Fellbaum, C., Cotton, S., Delfs, L., Dang, H.: English tasks: all-words and verb lexical sample. In: Proceedings of ACL/SIGLEX Senseval-2, Toulouse, France (2001)

    Google Scholar 

  26. Quillian, M.: Semantic memory. In: Minsky, M. (ed.) Semantic Information Processing. MIT Press, Cambridge (1968)

    Google Scholar 

  27. Schvaneveldt, R.: Pathfinder Associative networks: studies in knowledge organization, Norwood (1989)

    Google Scholar 

  28. Snyder, B., Palmer, M.: The English all-words task. In: Proceedings of ACL/SIGLEX Senseval-3, Barcelona, Spain (July 2004)

    Google Scholar 

  29. Spitzer, M.: The mind within the net: models of learning, thinking, and acting. MIT Press, Cambridge (1999)

    Google Scholar 

  30. Vanderwende, L., Banko, M., Menezes, A.: Event-centric summary generation. In: Proceedings of the Document Understanding Conference (2004)

    Google Scholar 

  31. Veronis, J., Ide, N.: Word sense disambiguation with very large neural networks extracted from machine readable dictionaries. In: Proceedings of the 13th International Conference on Computational Linguistics (COLING 1990), Helsinki, Finland (August 1990)

    Google Scholar 

  32. Wolf, F., Gibson, E.: Paragraph-, word-, and coherence-based approaches to sentence ranking: A comparison of algorithm and human performance. In: Proceedings of the 42nd Meeting of the Association for Computational Linguistics, Barcelona, Spain (July 2004)

    Google Scholar 

  33. Zock, M., Bilac, S.: Word lookup on the basis of associations: from an idea to a roadmap. In: Proceedings of the Coling 2004 workshop on Enhancing and Using Electronic Dictionaries, Geneva, Switzerland (August 2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mihalcea, R. (2006). Random Walks on Text Structures. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_27

Download citation

  • DOI: https://doi.org/10.1007/11671299_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32205-4

  • Online ISBN: 978-3-540-32206-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics