Abstract
Correct understanding to metaphors is an integral part of natural language understanding. It requires, among other issues, the ability to decide whether a given pair of sentences – such that the first one contains a metaphor – form a paraphrase pair. Although this decision task is formally analogous to a “traditional paraphrase detection” task, it requires a different approach. Recently, a first monolingual corpus (in English) for metaphor paraphrasing was released – together with several baselines. In this work we are going to shift this task to a cross-lingual level: we state a task of cross-lingual metaphor paraphrase detection, introduce a corresponding experimental cross-lingual corpus (English-Czech) and present several approaches to this problem and set the baselines to this challenging problem. This cross-lingual approach may allow us to deal with tasks like multi-document summarization involving texts in different languages as well as enable us to improve information retrieval tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
References
Agarwal, B., Ramampiaro, H., Langseth, H., Ruocco, M.: A deep network model for paraphrase detection in short text messages. Inf. Process. Manag. 54(6), 922–937 (2018)
Agić, Ž., Schluter, N.: Baselines and test data for cross-lingual inference. arXiv preprint arXiv:1704.05347 (2017)
Agirre, E., et al.: SemEval-2016 task 1: semantic textual similarity, monolingual and cross-lingual evaluation. In: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pp. 497–511 (2016)
Androutsopoulos, I., Malakasiotis, P.: A survey of paraphrasing and textual entailment methods. J. Artif. Intell. Res. 38, 135–187 (2010)
Barančíková, P., Bojar, O.: In search for linear relations in sentence embedding spaces. arXiv preprint arXiv:1910.03375 (2019)
Bizzoni, Y., Lappin, S.: Predicting human metaphor paraphrase judgments with deep neural networks. In: Proceedings of the Workshop on Figurative Language Processing, pp. 45–55. Association for Computational Linguistics, New Orleans, June 2018. https://doi.org/10.18653/v1/W18-0906. https://www.aclweb.org/anthology/W18-0906
Bollegala, D., Shutova, E.: Metaphor interpretation using paraphrases extracted from the web. PLoS ONE 8(9), e74304 (2013)
Cer, D., et al.: Universal sentence encoder. arXiv preprint arXiv:1803.11175 (2018)
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2017)
Conneau, A., et al.: XNLI: evaluating cross-lingual sentence representations. arXiv preprint arXiv:1809.05053 (2018)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Dien, D., et al.: Vietnamese-English cross-lingual paraphrase identification using siamese recurrent architectures. In: 2019 19th International Symposium on Communications and Information Technologies (ISCIT), pp. 70–75. IEEE (2019)
Klebanov, B.B., Shutova, E., Lichtenstein, P., Muresan, S., Wee, C.: Proceedings of the Workshop on Figurative Language Processing (2018)
Košarko, O., Variš, D., Popel, M.: LINDAT translation service. LINDAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University (2019). http://hdl.handle.net/11234/1-2922
Lample, G., Conneau, A., Denoyer, L., Ranzato, M.: Unsupervised machine translation using monolingual corpora only. arXiv preprint arXiv:1711.00043 (2017)
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
Peters, M.E., et al.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)
Shutova, E., Teufel, S.: Metaphor corpus annotated for source-target domain mappings. In: LREC, vol. 2, p. 2. Citeseer (2010)
Zhou, L., Hovy, E.: Headline summarization at ISI. In: Proceedings of the HLT-NAACL 2003 Text Summarization Workshop and Document Understanding Conference (DUC 2003), pp. 174–178 (2003)
Acknowledgement
The author wants to express thanks to colleagues and friends who contributed to translations of the paraphrase candidates and anonymous referees for their valuable comments and suggestions.
This work contains materials and results achieved in the author’s PhD thesis, currently under review.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Víta, M. (2020). Cross-lingual Metaphor Paraphrase Detection – Experimental Corpus and Baselines. In: Lopata, A., Butkienė, R., Gudonienė, D., Sukackė, V. (eds) Information and Software Technologies. ICIST 2020. Communications in Computer and Information Science, vol 1283. Springer, Cham. https://doi.org/10.1007/978-3-030-59506-7_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-59506-7_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59505-0
Online ISBN: 978-3-030-59506-7
eBook Packages: Computer ScienceComputer Science (R0)