Abstract
The paper addresses an important challenge for the automatic processing of English written text: understanding noun compounds’ semantics. Following Downing (1977) [1], we define noun compounds as sequences of nouns acting as a single noun, e.g., bee honey, apple cake, stem cell, etc. In our view, they are best characterised by the set of all possible paraphrasing verbs that can connect the target nouns, with associated weights, e.g., malaria mosquito can be represented as follows: carry (23), spread (16), cause (12), transmit (9), etc. These verbs are directly usable as paraphrases, and using multiple of them simultaneously yields an appealing fine-grained semantic representation.
In the present paper, we describe the process of constructing such representations for 250 noun-noun compounds previously proposed in the linguistic literature by Levi (1978) [2]. In particular, using human subjects recruited through Amazon Mechanical Turk Web Service, we create a valuable manually-annotated resource for noun compound interpretation, which we make publicly available with the hope to inspire further research in paraphrase-based noun compound interpretation. We further perform a number of experiments, including a comparison to automatically generated weight vectors, in order to assess the dataset quality and the feasibility of the idea of using paraphrasing verbs to characterise noun compounds’ semantics; the results are quite promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Downing, P.: On the creation and use of English compound nouns (53), 810–842 (1977)
Levi, J.: The Syntax and Semantics of Complex Nominals. Academic Press, New York (1978)
Baldwin, T., Tanaka, T.: Translation by machine of compound nominals: Getting it right. In: Proceedings of the ACL 2004 Workshop on Multiword Expressions: Integrating Processing, pp. 24–31 (2004)
Warren, B.: Semantic patterns of noun-noun compounds. In: Gothenburg Studies in English 41, Goteburg, Acta Universtatis Gothoburgensis (1978)
Nastase, V., Szpakowicz, S.: Exploring noun-modifier semantic relations. In: Fifth International Workshop on Computational Semantics (IWCS-5), Tilburg, The Netherlands, pp. 285–301 (2003)
Girju, R., Moldovan, D., Tatu, M., Antohe, D.: On the semantics of noun compounds. Journal of Computer Speech and Language - Special Issue on Multiword Expressions 4(19), 479–496 (2005)
Rosario, B., Hearst, M.: Classifying the semantic relations in noun compounds via a domain-specific lexical hierarchy. In: Proceedings of EMNLP, pp. 82–90 (2001)
Lauer, M.: Designing Statistical Language Learners: Experiments on Noun Compounds. PhD thesis, Dept. of Computing, Macquarie University, Australia (1995)
Finin, T.: The Semantic Interpretation of Compound Nominals. PhD thesis, University of Illinois, Urbana, Illinois (1980)
Nakov, P., Hearst, M.: Using verbs to characterize noun-noun relations. In: Euzenat, J., Domingue, J. (eds.) AIMSA 2006. LNCS (LNAI), vol. 4183, pp. 233–244. Springer, Heidelberg (2006)
Nakov, P.: Using the Web as an Implicit Training Set: Application to Noun Compound Syntax and Semantics. PhD thesis, EECS Department, University of California, Berkeley, UCB/EECS-2007-173 (2007)
Nakov, P., Hearst, M.: Solving relational similarity problems using the web as a corpus. In: Proceedings of ACL 2008: HLT, Columbus, OH (2008)
Nakov, P.: Paraphrasing verbs for noun compound interpretation. In: Proceedings of the LREC 2008 Workshop: Towards a Shared Task for Multiword Expressions (MWE 2008), Marrakech, Morocco (2008)
Cohen, J.: Statistical Power Analysis for the Behavioral Sciences, 2nd edn. Lawrence Erlbaum Associates, Inc., Hillsdale (1988)
Nakov, P., Hearst, M.: UCB: System description for SemEval Task #4. In: Proceedings of SemEval, Prague, Czech Republic, pp. 366–369 (2007)
Grefenstette, G.: The World Wide Web as a resource for example-based machine translation tasks. In: Translating and the Computer 21: Proceedings of the 21st International Conference on Translating and the Computer (1999)
Tanaka, T., Baldwin, T.: Noun-noun compound machine translation: a feasibility study on shallow processing. In: Proceedings of the ACL, workshop on Multiword expressions, pp. 17–24 (2003)
Callison-Burch, C., Koehn, P., Osborne, M.: Improved statistical machine translation using paraphrases. In: Proceedings of Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, New York, pp. 17–24 (2006)
Nakov, P.: Improved statistical machine translation using monolingual paraphrases. In: Proceedings of the European Conference on Artificial Intelligence (ECAI 2008), Patras, Greece (2008)
Nakov, P., Hearst, M.: UCB system description for the WMT 2007 shared task. In: Proceedings of the Second Workshop on Statistical Machine Translation, Prague, Czech Republic, June 2007, pp. 212–215 (2007)
Nakov, P.: Improving English-Spanish statistical machine translation: Experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. In: Proceedings of the Third Workshop on Statistical Machine Translation (WMT), Columbus, OH (2008)
Russo-Lassner, G., Lin, J., Resnik, P.: A paraphrase-based approach to machine translation evaluation. Technical Report LAMP-TR-125/CS-TR-4754/UMIACS-TR-2005-57, University of Maryland (August 2005)
Kauchak, D., Barzilay, R.: Paraphrasing for automatic evaluation. In: Proceedings Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pp. 455–462 (2006)
Zhou, L., Lin, C.-Y., Hovy, E.: Re-evaluating machine translation results with paraphrase support. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006, pp. 77–84 (2006)
Tatu, M., Moldovan, D.: A semantic approach to recognizing textual entailment. In: Proceedings of HLT, pp. 371–378 (2005)
Zhai, C.: Fast statistical parsing of noun phrases for document indexing. In: Proceedings of the fifth conference on Applied natural language processing, pp. 312–319. Morgan Kaufmann Publishers Inc., San Francisco (1997)
Cafarella, M., Banko, M., Etzioni, O.: Relational Web search. Technical Report 2006-04-02, University of Washington, Department of Computer Science and Engineering (2006)
Girju, R., Nakov, P., Nastase, V., Szpakowicz, S., Turney, P., Yuret, D.: Semeval-2007 task 04: Classification of semantic relations between nominals. In: Proceedings of SemEval, Prague, Czech Republic, pp. 13–18 (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nakov, P. (2008). Noun Compound Interpretation Using Paraphrasing Verbs: Feasibility Study. In: Dochev, D., Pistore, M., Traverso, P. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2008. Lecture Notes in Computer Science(), vol 5253. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85776-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-85776-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85775-4
Online ISBN: 978-3-540-85776-1
eBook Packages: Computer ScienceComputer Science (R0)