Skip to main content

Semantic Similarity Calculation of Short Texts Based on Language Network and Word Semantic Information

  • Conference paper
Advanced Computer Architecture

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 451))

Abstract

We first analyzes the deviation when current similarity calculation methods for texts are applied to short texts, and proposes a similarity calculation method for short texts based on language network and word semantic information. Firstly, models the short texts as language network according to the complex-network characteristic of human being’s language. Then analyzes the comprehensive eigenvalue of the words in the language network and the word similarity between different texts to obtain the word semantic. Calculate the similarity between short texts combining language network and word semantic. Finally the effectiveness of proposed algorithm is verified through clustering algorithm experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fung, B.C.M., Wang, K., Ester, M.: Hierarchical document clustering. In: John, W. (ed.) The Encyclopedia of Data Warehousing and Mining, pp. 970–975. Idea Group (2005)

    Google Scholar 

  2. Hall, P., Dowling, G.: Approximate string matching. Computing Survey 12(4), 381–402 (1980)

    Article  MathSciNet  Google Scholar 

  3. Lamontagne, L., Lee, H.-H.: Textual reuse for email response. In: Funk, P., González Calero, P.A. (eds.) ECCBR 2004. LNCS (LNAI), vol. 3155, pp. 242–256. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  4. Glass, J., et al.: A Framework for Developing Conversational User Interfaces. In: Fourth International Conference on Computer-Aided Design of User Interfaces, Funchal, Isle of Madeira, Portugal (2004)

    Google Scholar 

  5. Bickmore, T., Giorgino, T.: Health dialog systems for patients and consumers. J. Biomed. Inform. 39(5), 556–571 (2006)

    Article  Google Scholar 

  6. Cassell, J., et al.: Embodied Conversational Agents (2000)

    Google Scholar 

  7. Gorin, A.L., Riccardi, G., Wright, J.H.: How I help you? Speech Communication 23, 113–127 (1997)

    Article  Google Scholar 

  8. Graesser, A.C., et al.: AutoTutor: An Intelligent Tutoring System With Mixed Initiative Dialogue. IEEE Transactions on Education 48(4), 612–618 (2005)

    Article  Google Scholar 

  9. Salton, G.: The SMART Retrieval System-Experiments in Automatic Document Processing. Prentice Hall Inc., Englewood Cliffs (1971)

    Google Scholar 

  10. Dinesh, R., Harish, B.S., Guru, D.S., Manjunath, S.: Concept of Status Matrix in Text Classification. In: The Proceedings of Indian International Conference on Artificial Intelligence, Tumkur, India, pp. 2071–2079 (2009)

    Google Scholar 

  11. Mitra, V., Wang, C.J., Banerjee, S.: Text Classification: A least square support vector machine approach. Journal of Applied Soft Computing 2007(7), 908–914 (2007)

    Article  Google Scholar 

  12. Fung, G.P.C., Yu, J.X., Lu, H., Yu, P.S.: Text classification without negative example revisit. IEEE Transactions on Knowledge and Data Engineering 2006(18), 23–47 (2006)

    Google Scholar 

  13. Strogatz, S.H., Stewart, I.: Coupled oscillators and biological synchronization. Sci. Am. 269(6), 102–109 (1993)

    Article  Google Scholar 

  14. Gerhardt, M., Schuster, H., Tyson, J.J.: A cellular automaton model of excitable media including curvature and dispersion. Science 247, 1563–1566 (1990)

    Article  MATH  MathSciNet  Google Scholar 

  15. Hopfield, J.J., Herz, A.V.M.: Rapid local synchronization of action potentials: Toward computation with coupled integrate-and-fire neurons. Proc. Natl Acad. Sci. USA 92, 6655–6662 (1995)

    Article  Google Scholar 

  16. Nowak, M.A., May, R.M.: Evolutionary games and spatial chaos. Nature 359, 826–829 (1992)

    Article  Google Scholar 

  17. Kauffman, S.A.: Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol. (22), 437–467 (1969)

    Google Scholar 

  18. Watts, D., Strogatz, S.: Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998)

    Article  Google Scholar 

  19. Harris, Z.: Distributional Structure. Word (10), 146–162 (1954)

    Google Scholar 

  20. Firth, J.R.: A Synopsis of Linguistic Theory, 1930–1957. In: Special Volume of the Philological Society. Blackwell, Oxford (1957)

    Google Scholar 

  21. Miller, G., Charles, W.: Contextual Correlates of Semantic Similarity. Language and Cognitive Processes 6, 1–28 (1991)

    Article  Google Scholar 

  22. Li, Y., McLean, D., Bandar, Z.A., James, D.: Sentence Similarity Based on Semantic Nets and Corpus Statistics. IEEE Transactions on Knowledge and Data Engineering 18(8), 1138–1150 (2006)

    Article  Google Scholar 

  23. Ferrer i Cancho, R., Sole, R.V.: The small world of human language. Biological Sciences 268(1482), 2261–2265 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhan, Z., Lin, F., Yang, X. (2014). Semantic Similarity Calculation of Short Texts Based on Language Network and Word Semantic Information. In: Wu, J., Chen, H., Wang, X. (eds) Advanced Computer Architecture. Communications in Computer and Information Science, vol 451. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-44491-7_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-44491-7_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-44490-0

  • Online ISBN: 978-3-662-44491-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics