Skip to main content
Log in

Computing Semantic Clusters by Semantic Mirroring and Spectral Graph Partitioning

  • Published:
Mathematics in Computer Science Aims and scope Submit manuscript

Abstract

Using the technique of semantic mirroring a graph is obtained that represents words and their translations from a parallel corpus or a bilingual lexicon. The connectedness of the graph holds information about the semantic relations of words that occur in the translations. Spectral graph theory is used to partition the graph, which leads to a grouping of the words in different clusters. We illustrate the method using a small sample of seed words from a lexicon of Swedish and English adjectives and discuss its application to computational lexical semantics and lexicography.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bui T.N., Jones C.: Finding good approximate vertex and edge partitions is NP-hard. Inf. Process. Lett. 42, 153–159 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  2. Chugur, I., Penas, A., Gonzalo, J., Verdejo, F.: Monolingual and bilingual dictionary approaches to the enrichment of the Spanish WordNet with adjectives. In: Proceedings of WordNet and Other Lexical Resources Workshop, NAACL, Pittsburgh (2001)

  3. Chung, F.: Laplacians of graphs and Cheeger’s inequalities. In: Miklós, D., Sós, V.T., Szonyi, T. (eds.) Combinatorics, Paul Erdös is Eighty, vol. 2. pp. 157–172. Bolyai Soc. Math. Stud., vol. 2. János Bolyai Math. Soc., Budapest (1996)

  4. Chung, F.: Spectral Graph Theory. CBMS Regional Conference Series in Mathematics Number 92. American Mathematical Society (1997)

  5. Diab, M., Resnik, P.: An unsupervised method for word sense tagging using parallel corpora. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 255–262 (2002)

  6. Dyvik, H.: A translational basis for semantics. In: Johansson, S., Oksefjell, S. (eds.) Corpora and Crosslinguistic Research: Theory, Method and Case Studies, pp. 51–86. Rodopi (1998)

  7. Dyvik, H.: Translations as semantic mirrors: from parallel corpora to wordnet. In: 23rd International Conference on English Language Research on Computerized Corpora of Modern and Medieval English (ICAME 23). Gothenburg (2002)

  8. Dyvik, H.: Translations as a Semantic Knowledge Source. In: Proceedings of the Second Baltic Conference on Human Language Technologies, pp. 27–38, Tallinn, Institute of Cybernetics, Tallinn University of Technology and Institute of the Estonian Language (2005)

  9. Fagerlund, M., Merkel, M., Eldén, L., Ahrenberg, L.: Computing word senses by semantic mirroring and spectral graph partitioning. In: Proceedings of TextGraphs-5—2010 Workshop on Graph-based Methods for Natural Language Processing, pp. 103–107. Association for Computational Linguistics. Uppsala, Sweden (2010)

  10. Fellbaum, C. (eds): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  11. Foo, J., Merkel, M.: Computer aided term bank creation and standardization: building standardized term banks through automated term extraction and advanced editing tools. In: Terminology in Everyday Life. John Benjamins (2010)

  12. Ganter B., Wille R.: Formal Concept Analysis. Mathematical Foundations. Springer, Berlin (1999)

    Book  MATH  Google Scholar 

  13. Ide N.: Cross-lingual sense determination: Can it work?. Comput. Humanit. Specl. Issue SENSEVAL 34, 223–234 (2000)

    Article  Google Scholar 

  14. Kilgarriff, A.: Thesauruses for natural language processing. In: 2003 International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings, pp. 5–13 (2003)

  15. Koehn, P.: Europarl: a parallel corpus for statistical machine translation. In: The Tenth Machine Translation Summit, Phuket, Thailand, September 12–16 (2005)

  16. Lilliehök, H., Merkel, M.: Clustering word senses from semantic mirroring data. In: Proceedings of the Workshop on Lexical Semantic Resources for NLP at NODALIDA, Oslo, Norway, NEALT Proceedings Series, vol. 19 (2013)

  17. Lindén, K.: Word Sense Discovery and Disambiguation. PhD thesis, University of Helsinki, Helsinki (2005)

  18. Miller G.A.: Wordnet: a lexical database for English. Commun. ACM 38, 39–41 (1995)

    Article  Google Scholar 

  19. Pothen A., Simon H.D., Liou K.-P.: Partitioning sparse matrices with eigenvectors of graphs. SIAM J. Matrix Anal. Appl. 11(3), 430–452 (1990)

    Article  MathSciNet  MATH  Google Scholar 

  20. Priss, U., Old, L.J.: Conceptual exploration of semantic mirrors. In: Formal Concept Analysis: Third International Conference, ICFCA (2005)

  21. Resnik P., Yarowsky D.: Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation. Nat. Lang. Eng. 5, 113–133 (1999)

    Article  Google Scholar 

  22. Simon, H.D.: Partitioning of unstructured problems for parallel processing. Parallel Methods on Large-scale Structural Analysis and Physics Applications. Comput. Syst. Eng. 2(2–3), 135–148 (1991)

  23. von Luxburg U.: A tutorial on spectral clustering. Statist. Comput. 17, 395–416 (2007). doi:10.1007/s11222-007-9033-z

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lars Eldén.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eldén, L., Merkel, M., Ahrenberg, L. et al. Computing Semantic Clusters by Semantic Mirroring and Spectral Graph Partitioning. Math.Comput.Sci. 7, 293–313 (2013). https://doi.org/10.1007/s11786-013-0159-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11786-013-0159-4

Keywords

Mathematics Subject Classification (2010)

Navigation