Skip to main content

Analogy as an Organizational Principle in the Construction of Large Knowledge-Bases

  • Chapter
  • First Online:
Computational Approaches to Analogical Reasoning: Current Trends

Part of the book series: Studies in Computational Intelligence ((SCI,volume 548))

Abstract

A capacity for analogy is an excellent acid test for the quality of a knowledge-base. A good knowledge-base should be balanced and coherent, so that its high-level generalities are systematically reflected in a variety of lower-level specializations. As such, we can expect a rich, well-structured knowledge-base to support a greater diversity of analogies than one that is imbalanced, disjoint or impoverished. We argue here that the converse is also true: when choosing from a large pool of candidate propositions, in which many propositions are invalid because they are extracted automatically from corpora or volunteered by untrained web-users, we should prefer those that are most likely to enhance the analogical productivity of the knowledge-base. We present a simple and efficient means of finding potential analogies within a large knowledge-base, using a corpus-constrained notion of pragmatic comparability rather than the typically less-constrained notion of semantic similarity. This allows us to empirically demonstrate, in the context of a substantial knowledge-base of simple generalizations automatically extracted from the Google n-grams, that knowledge acquisition proceeds at a significantly faster pace when candidate additions are prioritized according to their analogical potential.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Almuhareb, A., Massimo, P.: Attribute-based and value-based clustering: an evaluation. In: Proceedings of EMNLP, Empirical Methods in NLP, pp. 158–165 (2004)

    Google Scholar 

  2. Barsalou, L.W.: Ad hoc categories. Mem. Cogn. 11, 211–227 (1983)

    Article  Google Scholar 

  3. Brants, T., Franz, A.: Web 1T 5-gram Version 1. Linguistic Data Consortium, Philadelphia (2006)

    Google Scholar 

  4. Bron, C., Kerbosch, J.: Algorithm 457: finding all cliques of an undirected graph. Commun. ACM. 16(9), 575–577 (1973)

    Google Scholar 

  5. Budanitsky, A., Hirst, G.: Evaluating WordNet-based measures of lexical semantic relatedness. Comput. Linguist. 32(1), 13–47 (2006)

    Article  MATH  Google Scholar 

  6. Carlson, G.N., Pelletier, F. (eds.): The Generic Book. University of Chicago Press, Chicago (1995)

    Google Scholar 

  7. Falkenhainer, B., Forbus, K.D., Gentner, D.: Structure-mapping engine: algorithm and examples. Artif. Intell. 41, 1–63 (1989)

    MATH  Google Scholar 

  8. Fellbaum, C.: (ed.). WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)

    Google Scholar 

  9. Gentner, D.: Structure-mapping: a theoretical framework. Cogn. Sci. 7, 155–170 (1983)

    Article  Google Scholar 

  10. Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, pp. 539–545 (1992)

    Google Scholar 

  11. Hofstadter, D.R.: Tracking sentiment in mail: how genders differ on emotional axes. Fluid Concepts and Creative Analogies: Computer Models of the Fundamental Mechanisms of Thought. Basic Books, New York (1995)

    Google Scholar 

  12. Holyoak, K.J., Thagard, P.: Mental Leaps: Analogy in Creative Thought. Basic Books, Cambridge (1995)

    Google Scholar 

  13. Karypis, G.: CLUTO: a clustering toolkit. Technical Report 02–017. University of Minnesota. http://www-users.cs.umn.edu/karypis/cluto/ (2002)

  14. Lenat, D., Guha, R.V.: Building Large Knowledge-Based Systems. Addison Wesley, New York (1990)

    Google Scholar 

  15. Liu, H., Singh, P.: ConceptNet: a practical commonsense reasoning toolkit. BT Tech. J. 22(4), 211–226 (2004)

    Article  MathSciNet  Google Scholar 

  16. Miller, G.A., Charles, W.G.: Contextual correlates of semantic similarity. Lang. Cogn. Process. 6(1), 1–28 (1991)

    Article  Google Scholar 

  17. Nakov, P., Hearst, M.: Using verbs to characterize noun-noun relations. In: Artificial Intelligence: Methodology, Systems, and Applications, pp. 233–244 (2006)

    Google Scholar 

  18. Seco, N., Veale, T., Hayes, J.: An intrinsic information content metric for semantic similarity in WordNet. In: Proceedings of ECAI-2004, the 16th Annual Meeting of the European Association for, Artificial Intelligence (2004)

    Google Scholar 

  19. Speer, R., Havasi, C., Lieberman, H.: AnalogySpace: reducing the dimensionality of common sense knowledge. In: Proceedings of the 23rd AAAI Conference on Artificial Intelligence (2008)

    Google Scholar 

  20. Singh, P.: The public acquisition of commonsense knowledge. In: Proceedings of AAAI Spring Symposium on Acquiring (and Using) Linguistic (and World) Knowledge for Information Access, Palo Alto, CA (2002)

    Google Scholar 

  21. Tong, S., Dean, J.: System and methods for automatically creating lists. US Patent 7,350,187 (granted to Google, March 25, 2008)

    Google Scholar 

  22. Turney, P.D.: Measuring semantic similarity by latent relational analysis. In: Proceedings of IJCAI-2005, the 19th International Joint Conference on Artificial Intelligence, Edinburgh, pp. 1136–1141 (2005)

    Google Scholar 

  23. Turney, P.D.: A uniform approach to analogies, synonyms, antonyms, and associations. In: Proceedings of the 22nd International Conference on Computational Linguistics (COLING 2008), pp. 905–912, Manchester, UK (2008)

    Google Scholar 

  24. Veale, T., Keane, M.T.: The competence of sub-optimal structure mapping on ‘Hard Analogies’. In: Proceedings of ICJAI-1997, the 15th International Joint Conference on Artificial Intelligence, Nagoya, Japan (1997)

    Google Scholar 

  25. Veale, T.: WordNet sits the SAT: a knowledge-based approach to lexical analogy. In: Proceedings of ECAI-2004, the 16th Annual Meeting of the European Association for, Artificial Intelligence (2004)

    Google Scholar 

  26. Veale, T., Li, G.: Ontological cliques-analogy as an organizing principle in ontology construction. In: Proceedings of the International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, Madeira (2009)

    Google Scholar 

  27. Veale, T., Li, G.: Creating similarity: lateral thinking for vertical similarity judgments. In: Proceedings of ACL 2013, the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria (2009)

    Google Scholar 

  28. Weeds, J., Weir, D.: Co-occurrence retrieval: a flexible framework for lexical distributional similarity. Comput. Linguist. 31(4), 433–475 (2005)

    Article  Google Scholar 

  29. Weigher, J.C., Zerbst, R.H.: The externalities of neighborhood parks: an empirical investigation. Land Econ. 49(1), 99–105 (1973)

    Article  Google Scholar 

Download references

Acknowledgments

This research was supported by the WCU (World Class University) program under the National Research Foundation of Korea, and funded by the Ministry of Education, Science and Technology of Korea (Project No: R31-30007).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tony Veale .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Veale, T., Li, G. (2014). Analogy as an Organizational Principle in the Construction of Large Knowledge-Bases. In: Prade, H., Richard, G. (eds) Computational Approaches to Analogical Reasoning: Current Trends. Studies in Computational Intelligence, vol 548. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54516-0_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54516-0_4

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54515-3

  • Online ISBN: 978-3-642-54516-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics