Abstract
To better understand the distribution of words in all kinds of syntactic structures, the paper calculates the word distribution in syntactic structures of both English and Chinese. On the basis of the calculation, the article presents the definition of the words’ syntactic distribution complexity. After arranging the Chinese and English words according to their own syntactic distribution complexity, respectively, the Lotka phenomenon can be clearly attested by the results. The discovery made in the paper reveals the law of the words’ syntactic distribution in linguistic studies on one hand and the statistically proven fact that Chinese words’ syntax is much more complex than that of the English after comparing the Lotka phenomenon of both Chinese and English words’ syntactic distribution complexity on the other hand.
References
Chen, Y. (1988). Analysis of Lotka’s law: The Simon-Yule approach. Information Processing & Management, 25(05), 527–544.
Chen, X. (1999). Chinese words’ classes from the perspective of automatic syntactic analysis. Language Teaching and Research, 3, 63–72.
Dik, S. (1989). The theory of functional grammar. Dordrecht: Floris.
Egghe, L. (1987). An exact calculation of Price’s law for the law of Lotka. Scientometrics, 11(1–2), 81–97.
Egghe, L. (2005). Power laws in the information production process: Lotkaian informetrics. Bradford: Emerald Group Publishing Limited.
Egghe, L., & Rousseau, R. (1990). Introduction to Informetrics: quantitative methods in library, documentation and information science. Amsterdam: Elsevier.
Huber, J. (1997). The underlying process generating Lotka’sLaw and the statistics of exceedances. Information Processing & Management, 34(04), 471–487.
Kalampakas, A. (2007). The syntactic complexity of Eulerian graphs. Lecture Notes in Computer Science, 4728, 208–217.
Kretschmer, H. (1983). The reflection of LOTKA’s law in the structure of citations of a journal. Scientometric, 5(2), 85–92.
Kretschmera, H., & Kretschmerb, T. (2007). Lotka’s distribution and distribution of co-author pairs’ frequencies. Journal of Informetrics, 1(04), 308–337.
Lotka, A.-J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 12, 317–324.
Maienborn, C. (2001). On the position and interpretation of locative modifiers. Natural Language Semantics, 9(2), 191–240.
Pao, M. (1985). Lotka’s law: A testing procedure. Information Processing & Management, 21(04), 305–320.
Quirk, R., Sidney, G., et al. (1985). A comprehensive grammar of the English language. London: Longman.
Rao, R., & Kedage, I. (1988). Probability distributions and inequality measures for analyses of circulation data. Amsterdam: Elsevier.
Rousseau, R. (1990). Relations between continuous versions of bibliometric laws. Journal of the American Society for Information, 41(03), 197–203.
Rousseau, R., & Zhang, Q. (1992). Zipf’s data on the frequency of Chinese words revisited. Scientometrics, 124(2), 201–220.
Stepanov, A., & Tsa, W. D. (2008). Cartography and licensing of wh- adjuncts: a cross-linguistic perspective. Natural Language & Linguistic Theory, 26(3), 589–638.
Uzuner, Ö., & Katz, B. (2005). A comparative study of language models for book and author recognition. Lecture Notes in Computer Science, 3651, 969–980.
Zipf, G.-K. (1949). Human behaviour and the principle of least-effort. Cambridge, MA: Addison-Wesley.
Acknowledgments
This study was supported in part by a grant from the Research of Knowledge Mining Technology and application Based on Intelligent Information Process (Grant No. 08JJD870225) which is supported by the Foundation from Ministry of Education of China and the research of Automatic Acquisition of English–Chinese Parallel Pairs from Websites (Grant No. 2010CW02) which is Supported by the Scientific Research Foundation of Graduate School of Nanjing University. We would like to thank Dr Bin Li of the Department of Computer Science and Technology of Nanjing University, Professor Xiaohe Chen of the Department of Language Technology of Nanjing Normal University and Professor Boran Zhang and Xiangqing Wei of Center for Bilingual Dictionary Research of Nanjing University, for their data, academic insight and valuable comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, D., Zhu, D. & Su, X. Lotka phenomenon in the words’ syntactic distribution complexity. Scientometrics 90, 483–498 (2012). https://doi.org/10.1007/s11192-011-0546-z
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-011-0546-z