Skip to main content

Hypernymy Detection for Vietnamese Using Dynamic Weighting Neural Network

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2018)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13397))

Abstract

The hypernymy detection problem aims to identify the “is-a" relation between words. This problem has recently received attention from researchers in the field of natural language processing because of its application to varied downstream tasks. So far, fairly effective methods for hypernymy detection in English have been reported. In Vietnamese, this problem has not been effectively solved. In this study, we applied a number of hypernymy detection methods based on word embeddings and supervised learning for Vietnamese. We propose an improvement on the dynamic weighting neural network model introduced by Luu Anh Tuan et al. [18] by weighting context words proportionally to the semantic similarity between them and the hypernym. Based on Vietnamese WordNet, three datasets for hypernymy detection were built. Experimental results showed that our proposal can increase the efficiency from 8% to 10% in terms of accuracy compared to the original method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In this paper, we used ‘_’ characters to associate the syllables of a compound word in Vietnamese.

  2. 2.

    http://scikit-learn.org.

  3. 3.

    https://github.com/ahug/HypEval/tree/master/data.

  4. 4.

    https://github.com/BuiTan/Vds.

References

  1. Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 805–810 (2003)

    Google Scholar 

  2. Banerjee, S., Pedersen, T.: An adapted lesk algorithm for word sense disambiguation using wordnet. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 136–145. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-45715-1_11

    Chapter  Google Scholar 

  3. Baroni, M., Bernardi, R., Do, N.Q., Chieh Shan, C.: Entailment above the word level in distributional semantics. In: Daelemans, W., Lapata, M., Màrquez, L. (eds.) EACL, pp. 23–32. The Association for Computer Linguistics (2012)

    Google Scholar 

  4. Baroni, M., Lenci, A.: How we BLESSed distributional semantic evaluation. In: Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics, pp. 1–10. Association for Computational Linguistics, Edinburgh, UK (2011). ‘aclanthology.org/W11-2501

  5. Biran, O., McKeown, K.R.: Classifying taxonomic relations between pairs of wikipedia articles. In: IJCNLP, pp. 788–794. Asian Federation of Natural Language Processing/ACL (2013)

    Google Scholar 

  6. Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    Article  Google Scholar 

  7. Dagan, I., Roth, D., Sammons, M., Zanzotto, F.M.: Recognizing Textual Entailment: Models and Applications: Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers, San Rafael (2013)

    Book  Google Scholar 

  8. Fellbaum, C. (ed.): WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    Google Scholar 

  9. Fu, R., Guo, J., Qin, B., Che, W., Wang, H., Liu, T.: Learning semantic hierarchies via word embeddings. In: ACL, vol. 1, pp. 1199–1209. The Association for Computer Linguistics (2014)

    Google Scholar 

  10. Geffet, M., Dagan, I.: The distributional inclusion hypotheses and lexical entailment. In: Knight, K., Ng, H.T., Oflazer, K. (eds.) ACL, pp. 107–114. The Association for Computer Linguistics (2005)

    Google Scholar 

  11. Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th Conference on Computational Linguistics, COLING 1992, vol. 2, pp. 539–545. Association for Computational Linguistics, Stroudsburg (1992). https://doi.org/10.3115/992133.992154

  12. Kennard, N.N.: Learning hypernymy over word embeddings (2015)

    Google Scholar 

  13. Kotlerman, L., Dagan, I., Szpektor, I., Zhitomirsky-Geffet, M.: Directional distributional similarity for lexical inference. Nat. Lang. Eng. 16(4), 359–389 (2010)

    Article  Google Scholar 

  14. Kozareva, Z., Hovy, E.: Learning arguments and supertypes of semantic relations using recursive patterns. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 1482–1491. Association for Computational Linguistics, Uppsala (2010)

    Google Scholar 

  15. Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: SIGDOC 1986: Proceedings of the 5th Annual International Conference on Systems Documentation, pp. 24–26. ACM, New York (1986). http://doi.acm.org/10.1145/318723.318728

  16. Levy, O., Remus, S., Biemann, C., Dagan, I.: Do supervised distributional methods really learn lexical inference relations? In: Mihalcea, R., Chai, J.Y., Sarkar, A. (eds.) HLT-NAACL, pp. 970–976. The Association for Computational Linguistics (2015)

    Google Scholar 

  17. Liu, Q., Jiang, H., Wei, S., Ling, Z.H., Hu, Y.: Learning semantic word embeddings based on ordinal knowledge constraints. In: ACL, vol. 1, pp. 1501–1511. The Association for Computer Linguistics (2015)

    Google Scholar 

  18. Luu, A.T., Tay, Y., Hui, S.C., Ng, S.K.: Learning term embeddings for taxonomic relation identification using dynamic weighting neural network. In: Su, J., Carreras, X., Duh, K. (eds.) EMNLP, pp. 403–413. The Association for Computational Linguistics (2016)

    Google Scholar 

  19. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representations in Vector Space. In: Proceedings of the First International Conference on Learning Representations, ICLR 2013, pp. 1–13 (2013)

    Google Scholar 

  20. Navigli, R., Ponzetto, S.P.: Babelnet: building a very large multilingual semantic network. In: Hajic, J., Carberry, S., Clark, S. (eds.) ACL, pp. 216–225. The Association for Computer Linguistics (2010)

    Google Scholar 

  21. Navigli, R., Velardi, P., Faralli, S.: A graph-based algorithm for inducing lexical taxonomies from scratch. In: Walsh, T. (ed.) IJCAI, pp. 1872–1877. IJCAI/AAAI (2011)

    Google Scholar 

  22. Nguyen, T.P., Pham, V.L., Nguyen, H.A., Vu, H.H., Tran, N.A., Truong, T.T.H.: A two-phase approach for building Vietnamese WordNet. In: Proceedings of the 8th Global WordNet Conference (GWC), pp. 261–266. Global Wordnet Association, Bucharest (2016). https://aclanthology.org/2016.gwc-1.38/

  23. Roller, S., Erk, K., Boleda, G.: Inclusive yet selective: supervised distributional hypernymy detection. In: Hajic, J., Tsujii, J. (eds.) COLING, pp. 1025–1036. ACL (2014)

    Google Scholar 

  24. Seitner, J., et al.: A large database of hypernymy relations extracted from the web. In: Calzolari, N., et al. (eds.) LREC. European Language Resources Association (ELRA) (2016)

    Google Scholar 

  25. Snow, R., Jurafsky, D., Ng, A.Y.: Learning syntactic patterns for automatic hypernym discovery. In: NIPS, pp. 1297–1304 (2004)

    Google Scholar 

  26. Tan, B.V., Thai, N.P., Lam, P.V.: Construction of a word similarity dataset and evaluation of word similarity techniques for vietnamese. In: Nguyen, T.T., Le, A.P., Tojo, S., Nguyen, L.M., Phan, X.H. (eds.) KSE, pp. 65–70. IEEE (2017)

    Google Scholar 

  27. Tan, L., Gupta, R., van Genabith, J.: Usaar-wlv: hypernym generation with deep neural nets. In: Cer, D.M., Jurgens, D., Nakov, P., Zesch, T. (eds.) SemEval@NAACL-HLT, pp. 932–937. The Association for Computer Linguistics (2015)

    Google Scholar 

  28. Turney, P.D., Mohammad, S.M.: Experiments with three approaches to recognizing lexical entailment. Nat. Lang. Eng. 21(3), 437–476 (2015)

    Article  Google Scholar 

  29. Weeds, J., Clarke, D., Reffin, J., Weir, D.J., Keller, B.: Learning to distinguish hypernyms and co-hyponyms. In: Hajic, J., Tsujii, J. (eds.) COLING, pp. 2249–2259. ACL (2014)

    Google Scholar 

Download references

Acknowledgments

This paper is a part of project number KHCN-TB.23X/13-18 which is led by Assoc. Prof. Ngo Thanh Quy and funded by Vietnam National University, Hanoi under the Science and Technology Program for the Sustainable Development of Northwest Region.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Van-Tan Bui .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bui, VT., Nguyen, PT., Pham, VL. (2023). Hypernymy Detection for Vietnamese Using Dynamic Weighting Neural Network. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2018. Lecture Notes in Computer Science, vol 13397. Springer, Cham. https://doi.org/10.1007/978-3-031-23804-8_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-23804-8_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-23803-1

  • Online ISBN: 978-3-031-23804-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics