Skip to main content
Log in

The potentials and limitations of modelling concept concreteness in computational semantic lexicons with dictionary definitions

  • Original Paper
  • Published:
Language Resources and Evaluation Aims and scope Submit manuscript

Abstract

This paper explores the feasibility of modelling concept concreteness perceived by humans and representing it in computational semantic lexicons, addressing an issue at the crossroads of computational linguistics, lexicography, and psycholinguistics. The inherent distinction between concrete words and abstract words in psychology has relied mostly on subjective human ratings. This practice is hardly scalable and does not consider the effect of polysemy. In view of this, we attempt to obtain a measure of concreteness from dictionary definitions comparable to human judgement, capitalising on conventional lexicographic assumptions and the regularities exhibited in the surface structures of sense definitions. The structural pattern of a definition is analysed and scored on a 7-point scale of concreteness ratings. The definition scores turned out to be quite effective for a dichotomous distinction between concrete and abstract concepts and more consistent with human ratings for the former. Beyond the two-way distinction, however, the results were more variable. The study has thus revealed the potentials and limitations of our approach, suggesting that different defining styles probably reflect the describability of concepts, and describability alone may not be sufficient for differentiating the degree of concreteness. The range of definition patterns has to be reconsidered, in combination with other inseparable factors constituting our perception of concreteness, for better modelling on a finer scale of concreteness distinction to enrich semantic lexicons for natural language processing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. The three resources are sampled for the different and typical defining principles they represent. Other examples like the Oxford Advanced Learner’s Dictionary are equally usable.

  2. The conventional defining styles outlined in this section have assumed that words can be defined in isolation. This is particularly challenged in the COBUILD series of dictionaries, which highlights their full-sentence definitions embedding the typical syntactico-collocational context. See Hanks (1987) for a detailed account, and Rundell (2006) for its strengths and weaknesses.

  3. Other forms of noun definitions include if–then statements such as ‘If you say that something has X, you mean Y’, and those starting with ‘you’ explicitly telling how one uses the word, such as ‘You can use X in expressions such as …’. These forms are simply taken as Category 1 in Table 1 in the current study.

  4. The WordNet and LDOCE definitions were first made complete sentences in the form of ‘X is Y’ by adding the headword as subject such as ‘car is a motor vehicle …’ before feeding to the parser. The COBUILD definitions are already complete sentences, so they were directly parsed.

References

  • Amsler, R. (1981). A taxonomy for English nouns and verbs. In Proceedings of the 19th Annual Meeting of the Association for Computational Linguistics (ACL’81), Stanford, pp. 133–138.

  • Atkins, B. T. S., & Rundell, M. (2008). The oxford guide to practical lexicography. Oxford, UK: Oxford University Press.

    Google Scholar 

  • Bleasdale, F. A. (1987). Concreteness dependent associative priming: Separate lexical organization for concrete and abstract words. Journal of Experimental Psychology. Learning, Memory, and Cognition, 13, 582–594.

    Article  Google Scholar 

  • Chodorow, M. S., Byrd, R. J., & Heidorn, G. E. (1985). Extracting semantic hierarchies from a large on-line dictionary. In Proceedings of the 23rd Annual Meeting of the Association for Computational Linguistics (ACL’85), Chicago, pp. 299–304.

  • Fellbaum, C. (1998). WordNet: An electronic lexical database. Cambridge, MA: MIT Press.

    Google Scholar 

  • Hanks, P. (1987). Definitions and explanations. In J. M. Sinclair (Ed.), Looking up: An account of the COBUILD project in lexical computing (Chapter 6). London, UK: HarperCollins Publishers.

    Google Scholar 

  • Johansson, R., & Nugues, P. (2008). Dependency-based syntactic-semantic analysis with PropBank and NomBank. In Proceedings of the 12th Conference on Computational Natural Language Learning (CoNLL 2008), Manchester, pp. 183–187.

  • Kroll, J. F., & Merves, J. S. (1986). Lexical access for concrete and abstract words. Journal of Experimental Psychology. Learning, Memory, and Cognition, 12, 92–107.

    Article  Google Scholar 

  • Kucera, H., & Francis, W. N. (1967). Computational analysis of present-day American English. Providence: Brown University Press.

    Google Scholar 

  • Kwong, O.Y. (2012). Psycholinguistics, lexicography, and word sense disambiguation. In Proceedings of the 26th Pacific Asia Conference on Language, Information and Computation (PACLIC 26), Bali, Indonesia, pp. 408–417.

  • Landau, S. I. (2001). Dictionaries: The art and craft of lexicography. Cambridge, UK: Cambridge University Press.

    Google Scholar 

  • Paivio, A. (1986). Mental representations: A dual coding approach. Oxford, UK: Oxford University Press.

    Google Scholar 

  • Paivio, A., Yuille, J. C., & Madigan, S. A. (1968). Concreteness, imagery, and meaningfulness values for 925 nouns. Journal of Experimental Psychology, Monograph Supplement, 76(1, Pt.2), 1–25.

    Article  Google Scholar 

  • Rundell, M. (2006). More than one way to skin a cat: Why full-sentence definitions have not been universally adopted. In E. Corino, C. Marello, & C. Onesti (Eds.), Proceedings of the XII EURALEX International Congress (EURALEX 2006), Torino, Italy, pp. 323–337.

  • Schwanenflugel, P. J. (1991). Why are abstract concepts hard to understand? In P. J. Schwanenflugel (Ed.), The psychology of word meanings (pp. 223–250). Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.

    Google Scholar 

  • Vossen, P., Meijs, W., & den Broeder, M. (1989). Meaning and structure in dictionary definitions. In B. Boguraev, & E. J. Briscoe (Eds.), Computational lexicography for natural language processing. London: Longman.

    Google Scholar 

  • Yore, L. D., & Ollila, L. O. (1985). Cognitive development, sex, and abstractness in grade one word recognition. Journal of Educational Research, 78, 242–247.

    Google Scholar 

Download references

Acknowledgments

The work described in this paper was partially supported by grants from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project No. CityU 1508/06H), and the Department of Chinese, Translation and Linguistics of the City University of Hong Kong. The author would like to thank the anonymous reviewers for their valuable comments and suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Oi Yee Kwong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kwong, O.Y. The potentials and limitations of modelling concept concreteness in computational semantic lexicons with dictionary definitions. Lang Resources & Evaluation 47, 1149–1161 (2013). https://doi.org/10.1007/s10579-013-9228-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10579-013-9228-1

Keywords

Navigation