Skip to main content

Embedding Linguistic Features in Word Embedding for Preposition Sense Disambiguation in English—Malayalam Machine Translation Context

  • Chapter
  • First Online:
Recent Advances in Computational Intelligence

Part of the book series: Studies in Computational Intelligence ((SCI,volume 823))

Abstract

Preposition sense disambiguation has huge significance in Natural language processing tasks such as Machine Translation. Transferring the various senses of a simple preposition in source language to a set of senses in target language has high complexity due to these many-to-many relationships, particularly in English-Malayalam machine translation. In order to reduce this complexity in the transfer of senses, in this paper, we used linguistic information such as noun class features and verb class features of the respective noun and verb correlated to the target simple preposition. The effect of these linguistic features for the proper classification of the senses (postposition in Malayalam) is studied with the help of several machine learning algorithms. The study showed that, the classification accuracy is higher when both verb and noun class features are taken into consideration. In linguistics, the major factor that decides the sense of the preposition is the noun in the prepositional phrase. The same trend was observed in the study when the training data contained only noun class features. i.e., noun class features dominates the verb class features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Asher, R.E.: Malayalam. Routledge (2013)

    Google Scholar 

  2. Chandran, S., Sajith Variyar, V.V., Nidhin Prabhakar, T.V., Soman, K.P.: Aerial image classification using regularized least squares classifier. J. Chem. Pharm. Sci. 9, 889–895 (2016)

    Google Scholar 

  3. Cortes, C., Vapnik, V.: Support vector machine. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  4. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge university press (2000)

    Google Scholar 

  5. Downing, A., Locke, P.: A University Course in English Grammar. Psychology Press (2002)

    Google Scholar 

  6. Goldberg, Y.: Neural network methods for natural language processing. Synth. Lect. Hum. Lang. Technol. 10(1), 1–309 (2017)

    Article  Google Scholar 

  7. Hovy, D., Tratz, S., Hovy, E.: What’s in a preposition?: dimensions of sense disambiguation for an interesting word class. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 454–462, Aug 2010. Association for Computational Linguistics

    Google Scholar 

  8. Jayan, V., Sunil, R., Bhadran, V.K.: Disambiguation of pre/post positions in English–Malayalam text translation. In: 24th International Conference on Computational Linguistics, p. 93 (2012)

    Google Scholar 

  9. Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of IJCAI, vol. 14, no. 2, pp. 1137–1145, Aug 1995

    Google Scholar 

  10. Levin, B.: English Verb Classes and Alternations: A Preliminary Investigation. University of Chicago press (1993)

    Google Scholar 

  11. Liaw, A., Wiener, M.: Classification and regression by randomForest. R News 2(3), 18–22 (2002)

    Google Scholar 

  12. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space (2013). arXiv:1301.3781

  13. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: proceedings of Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)

    Google Scholar 

  14. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Vanderplas, J., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12(Oct), 2825–2830 (2011)

    Google Scholar 

  15. Poornachandran, P., Premjith, B., Soman, K.P.: A distributed approach for predicting malicious activities in a network from a streaming data with support vector machine and explicit random feature mapping. IIOAB J. 7(7), 24–29 (2016)

    Google Scholar 

  16. Premjith, B., Soman, K.P., Kumar, M.A.: A deep learning approach for Malayalam morphological analysis at character level. Procedia Comput. Sci. 132, 47–54 (2018)

    Article  Google Scholar 

  17. Prince, V.: An empirical study for a machine aided translation of French prepositions ‘a’, ‘de’ and ‘en’ into English. In: 8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017

    Google Scholar 

  18. Ramkumar, V.: Sampoornna Malayala Vyakaranam. SISO Books, Trivandrum, India

    Google Scholar 

  19. Ratnam, D.J., Kumar, M.A., Premjith, B., Soman, K.P., Rajendran, S.: Sense disambiguation of English simple prepositions in the context of English–Hindi machine translation system. Knowledge Computing and Its Applications, pp. 245–268. Springer, Singapore

    Chapter  Google Scholar 

  20. Rehurek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks (2010)

    Google Scholar 

  21. Ruck, D.W., Rogers, S.K., Kabrisky, M., Oxley, M.E., Suter, B.W.: The multilayer perceptron as an approximation to a Bayes optimal discriminant function. IEEE Trans. Neural Networks 1(4), 296–298 (1990)

    Article  Google Scholar 

  22. Soman, K.P., Diwakar, S., Ajay, V.: Data Mining: Theory and Practice [with CD]. PHI Learning Pvt, Ltd (2006)

    Google Scholar 

  23. Soman, K.P., Loganathan, R., Ajay, V.: Machine Learning with SVM and Other Kernel Methods. PHI Learning Pvt, Ltd (2009)

    Google Scholar 

  24. Taylor, J.R.: Prepositions: patterns of polysemization and strategies of disambiguation. Natural Language Processing: The Semantics of Prepositions, vol. 3, pp. 151–175

    Google Scholar 

  25. Tratz, S., Hovy, D.: Disambiguation of preposition sense using linguistically motivated features. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium, pp. 96–100, June 2009. Association for Computational Linguistics

    Google Scholar 

  26. Ye, P., Baldwin, T.. MELB-YB: preposition sense disambiguation using rich semantic features. In: Proceedings of the 4th International Workshop on Semantic Evaluations, pp. 241–244, June 2007. Association for Computational Linguistics

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. Premjith .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Premjith, B., Soman, K.P., Anand Kumar, M., Jyothi Ratnam, D. (2019). Embedding Linguistic Features in Word Embedding for Preposition Sense Disambiguation in English—Malayalam Machine Translation Context. In: Kumar, R., Wiil, U. (eds) Recent Advances in Computational Intelligence. Studies in Computational Intelligence, vol 823. Springer, Cham. https://doi.org/10.1007/978-3-030-12500-4_20

Download citation

Publish with us

Policies and ethics