Abstract
Japanese onomatopoeias are very difficult for machines to recognize and translate into other languages due to their uniqueness. In particular, onomatopoeias that convey several meanings are very confusing for machine translation systems to distinguish and translate correctly. In this paper, we discuss what features are helpful in order to automatically disambiguate the meaning of onomatopoeias that have two different meanings. We used nouns, adjectives, and verbs extracted from sentences as features, then carried out a machine learning classification analysis and compared the accuracy of how well these features differentiate two meanings of ambiguous onomatopoeias. As a result, we discovered that employing a combination of machine learning with nouns and verbs as a feature achieved accuracy of above 80 points. In addition, we were able to improve the accuracy by excluding pronouns and proper nouns and also by limiting verbs to those that are modified by onomatopoeias. In future, we plan to concentrate on dependency between verbs that are modified by onomatopoeia and nouns, as we believe that this approach will help machine translation to translate Japanese onomatopoeias correctly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Tokyo Olympics, http://tokyo2020.jp/
Shimizu, Y., Doizaki, R., Sakamoto, M.: A System to Estimate an Impression Conveyed by Onomatopoeia. Transactions of the Japanese Society for Artificial Intelligence 29(1), 41–52 (2014) (in Japanese)
Asaga, C., Mukarramah, Y., Watanabe, C.: ONOMATOPEDIA: Onomatopoeia online example dictionary system extracted from data on the web. In: Zhang, Y., et al. (eds.) APWeb 2008. LNCS, vol. 4976, pp. 601–612. Springer, Heidelberg (2008)
Furutake, Y., Sato, S., Komatani, K.: Onomatope wo iikaeru hyougen no jidoushushu ( Automatic Collection of Expression in Other Words of Onomatopoeias). In: Proceeding of the 17th.Annual Meeting of the Association for Natural Language Processing, pp. 904–907 (2011) (in Japanese)
Uchida, Y., Araki, K., Yoneyama, J.: Semantic Ambiguity of Onomatopoeia Extracted from Blog Entries. In: Proceeding of the 27th Fuzzy System Symposium, pp. 853–856 (2011) (in Japanese)
Twitter, https://twitter.com/
Kudo, T., et al.: MeCab, Yet Another Part-of-Speech and Morphological Analyzer, http://mecab.sourceforge.net/
Joachims, T.: SVM-Light, http://svmlight.joachims.org/
Agency for Cultural Affairs, gaikokujin no tame no kihongoyourei jitenn (Basic Word Usage Dictionary for Foreigners), National Printing Bureau (1990)
Google, https://www.google.co.jp/
Uchida, Y., Araki, K., Yoneyama, J.: Affect Analysis of Onomatopoeia Sentences Extracted from Blog Entries. In: Proceeding of the 10th Forum on Information Technology, pp. 274–279 (2011) (in Japanese)
Ameba, http://www.ameba.jp/
CaboCha, Yet Another Japanese Dependency Structure Aalyser, http://code.google.com/p/cabocha/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Fukushima, H., Araki, K., Uchida, Y. (2014). Disambiguation of Japanese Onomatopoeias Using Nouns and Verbs. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-10816-2_18
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10815-5
Online ISBN: 978-3-319-10816-2
eBook Packages: Computer ScienceComputer Science (R0)