Abstract
A capacity for analogy is an excellent acid test for the quality of a knowledge-base. A good knowledge-base should be balanced and coherent, so that its high-level generalities are systematically reflected in a variety of lower-level specializations. As such, we can expect a rich, well-structured knowledge-base to support a greater diversity of analogies than one that is imbalanced, disjoint or impoverished. We argue here that the converse is also true: when choosing from a large pool of candidate propositions, in which many propositions are invalid because they are extracted automatically from corpora or volunteered by untrained web-users, we should prefer those that are most likely to enhance the analogical productivity of the knowledge-base. We present a simple and efficient means of finding potential analogies within a large knowledge-base, using a corpus-constrained notion of pragmatic comparability rather than the typically less-constrained notion of semantic similarity. This allows us to empirically demonstrate, in the context of a substantial knowledge-base of simple generalizations automatically extracted from the Google n-grams, that knowledge acquisition proceeds at a significantly faster pace when candidate additions are prioritized according to their analogical potential.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Almuhareb, A., Massimo, P.: Attribute-based and value-based clustering: an evaluation. In: Proceedings of EMNLP, Empirical Methods in NLP, pp. 158–165 (2004)
Barsalou, L.W.: Ad hoc categories. Mem. Cogn. 11, 211–227 (1983)
Brants, T., Franz, A.: Web 1T 5-gram Version 1. Linguistic Data Consortium, Philadelphia (2006)
Bron, C., Kerbosch, J.: Algorithm 457: finding all cliques of an undirected graph. Commun. ACM. 16(9), 575–577 (1973)
Budanitsky, A., Hirst, G.: Evaluating WordNet-based measures of lexical semantic relatedness. Comput. Linguist. 32(1), 13–47 (2006)
Carlson, G.N., Pelletier, F. (eds.): The Generic Book. University of Chicago Press, Chicago (1995)
Falkenhainer, B., Forbus, K.D., Gentner, D.: Structure-mapping engine: algorithm and examples. Artif. Intell. 41, 1–63 (1989)
Fellbaum, C.: (ed.). WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Gentner, D.: Structure-mapping: a theoretical framework. Cogn. Sci. 7, 155–170 (1983)
Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics, pp. 539–545 (1992)
Hofstadter, D.R.: Tracking sentiment in mail: how genders differ on emotional axes. Fluid Concepts and Creative Analogies: Computer Models of the Fundamental Mechanisms of Thought. Basic Books, New York (1995)
Holyoak, K.J., Thagard, P.: Mental Leaps: Analogy in Creative Thought. Basic Books, Cambridge (1995)
Karypis, G.: CLUTO: a clustering toolkit. Technical Report 02–017. University of Minnesota. http://www-users.cs.umn.edu/karypis/cluto/ (2002)
Lenat, D., Guha, R.V.: Building Large Knowledge-Based Systems. Addison Wesley, New York (1990)
Liu, H., Singh, P.: ConceptNet: a practical commonsense reasoning toolkit. BT Tech. J. 22(4), 211–226 (2004)
Miller, G.A., Charles, W.G.: Contextual correlates of semantic similarity. Lang. Cogn. Process. 6(1), 1–28 (1991)
Nakov, P., Hearst, M.: Using verbs to characterize noun-noun relations. In: Artificial Intelligence: Methodology, Systems, and Applications, pp. 233–244 (2006)
Seco, N., Veale, T., Hayes, J.: An intrinsic information content metric for semantic similarity in WordNet. In: Proceedings of ECAI-2004, the 16th Annual Meeting of the European Association for, Artificial Intelligence (2004)
Speer, R., Havasi, C., Lieberman, H.: AnalogySpace: reducing the dimensionality of common sense knowledge. In: Proceedings of the 23rd AAAI Conference on Artificial Intelligence (2008)
Singh, P.: The public acquisition of commonsense knowledge. In: Proceedings of AAAI Spring Symposium on Acquiring (and Using) Linguistic (and World) Knowledge for Information Access, Palo Alto, CA (2002)
Tong, S., Dean, J.: System and methods for automatically creating lists. US Patent 7,350,187 (granted to Google, March 25, 2008)
Turney, P.D.: Measuring semantic similarity by latent relational analysis. In: Proceedings of IJCAI-2005, the 19th International Joint Conference on Artificial Intelligence, Edinburgh, pp. 1136–1141 (2005)
Turney, P.D.: A uniform approach to analogies, synonyms, antonyms, and associations. In: Proceedings of the 22nd International Conference on Computational Linguistics (COLING 2008), pp. 905–912, Manchester, UK (2008)
Veale, T., Keane, M.T.: The competence of sub-optimal structure mapping on ‘Hard Analogies’. In: Proceedings of ICJAI-1997, the 15th International Joint Conference on Artificial Intelligence, Nagoya, Japan (1997)
Veale, T.: WordNet sits the SAT: a knowledge-based approach to lexical analogy. In: Proceedings of ECAI-2004, the 16th Annual Meeting of the European Association for, Artificial Intelligence (2004)
Veale, T., Li, G.: Ontological cliques-analogy as an organizing principle in ontology construction. In: Proceedings of the International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, Madeira (2009)
Veale, T., Li, G.: Creating similarity: lateral thinking for vertical similarity judgments. In: Proceedings of ACL 2013, the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria (2009)
Weeds, J., Weir, D.: Co-occurrence retrieval: a flexible framework for lexical distributional similarity. Comput. Linguist. 31(4), 433–475 (2005)
Weigher, J.C., Zerbst, R.H.: The externalities of neighborhood parks: an empirical investigation. Land Econ. 49(1), 99–105 (1973)
Acknowledgments
This research was supported by the WCU (World Class University) program under the National Research Foundation of Korea, and funded by the Ministry of Education, Science and Technology of Korea (Project No: R31-30007).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Veale, T., Li, G. (2014). Analogy as an Organizational Principle in the Construction of Large Knowledge-Bases. In: Prade, H., Richard, G. (eds) Computational Approaches to Analogical Reasoning: Current Trends. Studies in Computational Intelligence, vol 548. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54516-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-54516-0_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54515-3
Online ISBN: 978-3-642-54516-0
eBook Packages: EngineeringEngineering (R0)