Abstract
The purpose of this chapter is to show how it is possible to efficiently extract the structure of a set of objects by use of the notion of proportional analogy. As a proportional analogy involves four objects, the very naïve approach to the problem, has basically a complexity of \(O(n^4)\) for a given set of \(n\) objects. We show, under some conditions on proportional analogy, how to reduce this complexity to \(O(n^2)\) by considering an equivalent problem, that of enumerating analogical clusters that are informative and not redundant. We further show how some improvements make the task tractable. We illustrate our technique with a task related with natural language processing, that of clustering Chinese characters. In this way, we re-discover the graphical structure of these characters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.$$\begin{aligned} {\text {5,000}}^4 \times 0.8~\text {ms}/8&> 5^4 \times 10^9 \times 0.1\,\text {s} \\&> 125 \times 10^{8}\,\text {s} \\&> 1250 \times 10^{7}/( 3.1563 \times 10^7 )\;\text {years} \\&> 394.2\,\text {years}. \\ \end{aligned}$$
- 2.
This comes from the fact that some analogies between strings of characters admit multiple solutions. When this is the case, then, there is not transitivity for : : in the general case for the objects considered (see [18, p. 113]).
- 3.
Font designed by Nagao Sadakazu (snagao@tkb.att.ne.jp). We use version 1.1 of 1999.
- 4.
We use a machine with 4 Gb memory equipped with an Intel Core i5 processor at 1.7 GHz.
- 5.
- 6.
- 7.
Reference [25] is the first mention of the edit distance constraint in terms of similarities; [2] gives the equivalent expression with edit distances; [17] is the published form of the proceedings in which [2] appeared, with few years delay. The edit distance constraint is necessary between strings of symbols to avoid many spurious analogies that would be formed without it.
References
Gentner, D.: Structure mapping: a theoretical model for analogy. Cogn. Sci. 7(2), 155–170 (1983)
Lepage, Y.: Analogy and formal languages. In: Proceedings of FG/MOL 2001, Helsinki, pp. 1–12 (2001)
Yvon, F., Stroppa, N., Miclet, L., Delhay, A.: Solving analogical equations on words. Rapport Technique ENST2004D005, ENST (2004)
Hoffman, R.R.: Monster analogies. AI Mag. 11, 11–35 (1995)
Lepage, Y.: Lower and higher estimates of the number of “true analogies” between sentences contained in a large multilingual corpus. In: Proceedings of COLING-2004, vol. 1, pp. 736–742. Geneva (2004)
Lepage, Y., Migeot, J., Guillerm, E.: A corpus study on the number of true proportional analogies between chunks in two typologically different languages. In: Proceedings of the Seventh International Symposium on Natural Language Processing (SNLP 2007), pp. 117-122. Kasetsart University, Pattaya, Thailand, ISBN 978-974-623-062-9 (2007)
Lepage, Y., Migeot, J., Guillerm, E.: A measure of the number of true analogies between chunks in Japanese. Lect. Notes Artif. Intell. 5603, 154–164 (2009)
Veale, T., Chen, S.: Learning to extract semantic content from the orthographic structure of Chinese words. In: Proceedings of the 17th Irish Conference on Artificial Intelligence and Cognitive Science (AICS2006) (2006)
Paul, H.: Prinzipien der Sprachgeschichte. Niemayer, Tübingen (1920)
Varro, M.T.: De lingua latina. Coll. Belles-lettres. Trad. J. Collart., Paris (1954)
Turney, P.D., Littman, M.L.: Corpus-based learning of analogies and semantic relations. Mach. Learn. 60(1–3), 251–278 (2005)
Turney, P.D.: Similarity of semantic relations. Comput. Linguist. 32(2), 379–416 (2006)
Turney, P.: A uniform approach to analogies, synonyms, antonyms, and associations. In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pp. 905–912. Coling 2008 Organizing Committee, Manchester, UK (2008)
Itkonen, E.: Iconicity, analogy, and universal grammar. J. Pragmat. 22(1), 37–53 (1994)
Yencken, L., Baldwin, T.: Measuring and predicting orthographic associations: modelling the similarity of Japanese kanji. In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pp. 1041–1048. Coling 2008 Organizing Committee, Manchester, UK (2008)
Matsushita, K., Lepage, Y.: Rediscovering the structure of Chinese characters using analogy-based methods (in Japanese). In: Proceedings of the 18th Japanese National Conference in Natural Language Processing, pp. 438–441. Nagoya (2013)
Lepage, Y.: Analogy and formal languages. Electron. Notes Theor. Comput. Sci. 53, 180–191 (2004)
Lepage, Y.: Of that kind of analogies capturing linguistic commutations (in French). Habilitation thesis, Joseph Fourier Grenoble University (2003)
Lepage, Y., Goh, C.: Towards automatic acquisition of linguistic features. In: Jokinen, K., Bick, E. (eds.) Proceedings of the 17th Nordic Conference on Computational Linguistics (NODALIDA 2009), pp. 118–125. Odense (2009)
Langlais, P., Yvon, F.: Scaling up analogical learning. In: Coling 2008: Companion Volume: Posters, pp. 51–54. Coling 2008 Organizing Committee, Manchester, UK (2008)
Finkel, R., Bentley, J.: Quad trees: a data structure for retrieval on composite keys. Acta Informatica 4(1), 1–9 (1974)
Lepage, Y., Gosme, J., Lardilleux, A.: Estimating the proximity between languages by their commonality in vocabulary structures. In: Lecture Notes in Artificial Intelligence Human Language Technology—Challenges for Computer Science and Linguistics, pp. 127–138. (2011)
Lepage, Y., Denoual, E.: Purest ever example-based machine translation: detailed presentation and assessment. Mach. Transl. 19, 251–282 (2005)
Lepage, Y., Denoual, E.: Automatic generation of paraphrases to be used as translation references in objective evaluation measures of machine translation. In: Proceedings of the Third International Workshop on Paraphrasing (IWP 2005), pp. 57–64. Jeju (2005)
Lepage, Y.: Languages of analogical strings. In: Proceedings of COLING-2000, vol. 1, pp. 488–494. Saarbrücken (2000)
Croft, W.: Radical Construction Grammar: Syntactic Theory in Typological Perspective. Oxford Linguistics. Oxford University Press, Oxford (2001)
Itkonen, E.: Analogy as structure and process: approaches in linguistics, cognitive psychology and philosophy of science. In: Dascal, M., Gibbs, R.W., Nuyts, J. (eds.) Human Cogntive Processing, vol. 14, p. 250. John Benjamins Publishing Company, Amsterdam/Philadelphia (2005)
Hofstadter, D.: The Fluid Analogies Research Group: Fluid Concepts and Creative Analogies. Basic Books, New York (1994)
Lepage, Y.: Solving analogies on words: an algorithm. In: Proceedings of COLING-ACL’98, vol. I, pp. 728–735. Montréal (1998)
Correa, W., Prade, H., Richard, G.: When intelligence is just a matter of copying. In: Proceedings of the 20th European Conference on Artificial Intelligence (ECAI 2012), pp. 276–281 (2012)
Matsushita, K.: Data processing of Chinese Hanzi by proportional analogy and verification of learning efficiency by subjects (in Japanese). Master’s thesis, Graduate School of Information, Production and Systems, Waseda University (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Lepage, Y. (2014). Analogies Between Binary Images: Application to Chinese Characters. In: Prade, H., Richard, G. (eds) Computational Approaches to Analogical Reasoning: Current Trends. Studies in Computational Intelligence, vol 548. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54516-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-54516-0_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54515-3
Online ISBN: 978-3-642-54516-0
eBook Packages: EngineeringEngineering (R0)