Abstract
This study relates to the assessment of the argument of the poverty of the stimulus in that we conducted a measure of the number of true proportional analogies between chunks in a language with case markers, Japanese. On a bicorpus of 20,000 sentences, we show that at least 96% of the analogies of form between chunks are also analogies of meaning, thus reporting the presence of at least two million true analogies between chunks in this corpus. As the number of analogies between chunks overwhelmingly surpasses the number of analogies between sentences by three orders of magnitude for this size of corpora, we conclude that proportional analogy is an efficient and undeniable structuring device between Japanese chunks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Turney, P.D., Littman, M.: Corpus-based learning of analogies and semantic relations. Machine Learning 60(1–3), 251–278 (2005)
Turney, P.D.: A uniform approach to analogies, synonyms, antonyms, and associations. In: Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), Manchester, UK, August 2008, pp. 905–912. Coling 2008 Organizing Committee (2008)
Stroppa, N., Yvon, F.: An analogical learner for morphological analysis. In: Proceedings of the 9th Conference on Computational Natural Language Learning (CoNLL 2005), Ann Arbor, MI, June 2005, pp. 120–127 (2005)
Claveau, V., L’Homme, M.C.: Terminology by analogy-based machine learning. In: Proceedings of the 7th International Conference on Terminology and Knowledge Engineering (TKE 2005), Copenhagen (2005)
Lepage, Y., Denoual, E.: Purest ever example-based machine translation: detailed presentation and assessment. Machine Translation 19, 251–282 (2005), http://dx.doi.org/10.1007/s10590-006-9010-x
Langlais, P., Patry, A.: Translating unknown words by analogical learning. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 877–886 (2006)
Anttila, R.: Analogy. In: Mouton – Trends in linguistics: state of the art reports 10, The Hague (1977)
Pullum, G.K., Scholtz, B.C.: Empirical assessment of stimulus poverty arguments. The Linguistic Review 19, 9–50 (2002)
Legate, J.A., Yang, C.D.: Empirical re-assessment of stimulus poverty arguments. The Linguistic Review 19, 151–162 (2002)
Scholtz, B.C., Pullum, G.K.: Searching for arguments to support linguistic nativism. The Linguistic Review 19, 185–224 (2002)
Chouinard, M.M., Clark, E.E.: Adult reformulations of child errors as negative evidence. Journal of Child Language 30, 637–669 (2003)
Pullum, G.K.: Generative grammar, pp. 340–343. The MIT Press, Cambridge (1999)
Eck, T., Hori, C.: Overview of the IWSLT 2005 evaluation campaign. In: Carnegie Mellon University (ed.): Proc. of the International Workshop on Spoken Language Translation, pp. 1–22 (2005)
Kudo, T., Matsumoto, Y.: Fast methods for kernel-based text analysis. In: Proceedings of ACL 2003, Sapporo, July 2003, pp. 24–31 (2003)
Lepage, Y.: Lower and higher estimates of the number of “true analogies” between sentences contained in a large multilingual corpus. In: Proceedings of COLING-2004, Geneva, August 2004, vol. 1, pp. 736–742 (2004)
Delhay, A., Miclet, L.: Analogical equations in sequences: Definition and resolution. In: Paliouras, G., Sakakibara, Y. (eds.) ICGI 2004. LNCS (LNAI), vol. 3264, pp. 127–138. Springer, Heidelberg (2004)
Allison, L., Dix, T.I.: A bit string longest common subsequence algorithm. Information Processing Letter 23, 305–310 (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lepage, Y., Migeot, J., Guillerm, E. (2009). A Measure of the Number of True Analogies between Chunks in Japanese. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-04235-5_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)