Lotka phenomenon in the words’ syntactic distribution complexity

Wang, Dongbo; Zhu, Danhao; Su, Xinning

doi:10.1007/s11192-011-0546-z

Lotka phenomenon in the words’ syntactic distribution complexity

Published: 08 November 2011

Volume 90, pages 483–498, (2012)
Cite this article

Scientometrics Aims and scope Submit manuscript

Dongbo Wang¹,
Danhao Zhu¹ &
Xinning Su¹

406 Accesses
Explore all metrics

Abstract

To better understand the distribution of words in all kinds of syntactic structures, the paper calculates the word distribution in syntactic structures of both English and Chinese. On the basis of the calculation, the article presents the definition of the words’ syntactic distribution complexity. After arranging the Chinese and English words according to their own syntactic distribution complexity, respectively, the Lotka phenomenon can be clearly attested by the results. The discovery made in the paper reveals the law of the words’ syntactic distribution in linguistic studies on one hand and the statistically proven fact that Chinese words’ syntax is much more complex than that of the English after comparing the Lotka phenomenon of both Chinese and English words’ syntactic distribution complexity on the other hand.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SyntaxNet Errors from the Linguistic Point of View

Bridging Collocational and Syntactic Analysis

Information Extraction for Czech Based on Syntactic Analysis

References

Chen, Y. (1988). Analysis of Lotka’s law: The Simon-Yule approach. Information Processing & Management, 25(05), 527–544.
Article Google Scholar
Chen, X. (1999). Chinese words’ classes from the perspective of automatic syntactic analysis. Language Teaching and Research, 3, 63–72.
Google Scholar
Dik, S. (1989). The theory of functional grammar. Dordrecht: Floris.
MATH Google Scholar
Egghe, L. (1987). An exact calculation of Price’s law for the law of Lotka. Scientometrics, 11(1–2), 81–97.
Article Google Scholar
Egghe, L. (2005). Power laws in the information production process: Lotkaian informetrics. Bradford: Emerald Group Publishing Limited.
Google Scholar
Egghe, L., & Rousseau, R. (1990). Introduction to Informetrics: quantitative methods in library, documentation and information science. Amsterdam: Elsevier.
Google Scholar
Huber, J. (1997). The underlying process generating Lotka’sLaw and the statistics of exceedances. Information Processing & Management, 34(04), 471–487.
Article Google Scholar
Kalampakas, A. (2007). The syntactic complexity of Eulerian graphs. Lecture Notes in Computer Science, 4728, 208–217.
Article MathSciNet Google Scholar
Kretschmer, H. (1983). The reflection of LOTKA’s law in the structure of citations of a journal. Scientometric, 5(2), 85–92.
Article Google Scholar
Kretschmera, H., & Kretschmerb, T. (2007). Lotka’s distribution and distribution of co-author pairs’ frequencies. Journal of Informetrics, 1(04), 308–337.
Article Google Scholar
Lotka, A.-J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 12, 317–324.
Google Scholar
Maienborn, C. (2001). On the position and interpretation of locative modifiers. Natural Language Semantics, 9(2), 191–240.
Article Google Scholar
Pao, M. (1985). Lotka’s law: A testing procedure. Information Processing & Management, 21(04), 305–320.
Article Google Scholar
Quirk, R., Sidney, G., et al. (1985). A comprehensive grammar of the English language. London: Longman.
Google Scholar
Rao, R., & Kedage, I. (1988). Probability distributions and inequality measures for analyses of circulation data. Amsterdam: Elsevier.
Google Scholar
Rousseau, R. (1990). Relations between continuous versions of bibliometric laws. Journal of the American Society for Information, 41(03), 197–203.
Article Google Scholar
Rousseau, R., & Zhang, Q. (1992). Zipf’s data on the frequency of Chinese words revisited. Scientometrics, 124(2), 201–220.
Article Google Scholar
Stepanov, A., & Tsa, W. D. (2008). Cartography and licensing of wh- adjuncts: a cross-linguistic perspective. Natural Language & Linguistic Theory, 26(3), 589–638.
Article Google Scholar
Uzuner, Ö., & Katz, B. (2005). A comparative study of language models for book and author recognition. Lecture Notes in Computer Science, 3651, 969–980.
Article Google Scholar
Zipf, G.-K. (1949). Human behaviour and the principle of least-effort. Cambridge, MA: Addison-Wesley.
Google Scholar

Download references

Acknowledgments

This study was supported in part by a grant from the Research of Knowledge Mining Technology and application Based on Intelligent Information Process (Grant No. 08JJD870225) which is supported by the Foundation from Ministry of Education of China and the research of Automatic Acquisition of English–Chinese Parallel Pairs from Websites (Grant No. 2010CW02) which is Supported by the Scientific Research Foundation of Graduate School of Nanjing University. We would like to thank Dr Bin Li of the Department of Computer Science and Technology of Nanjing University, Professor Xiaohe Chen of the Department of Language Technology of Nanjing Normal University and Professor Boran Zhang and Xiangqing Wei of Center for Bilingual Dictionary Research of Nanjing University, for their data, academic insight and valuable comments.

Author information

Authors and Affiliations

Department of Information Management, Nanjing University, Nanjing, China
Dongbo Wang, Danhao Zhu & Xinning Su

Authors

Dongbo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Danhao Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xinning Su
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dongbo Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, D., Zhu, D. & Su, X. Lotka phenomenon in the words’ syntactic distribution complexity. Scientometrics 90, 483–498 (2012). https://doi.org/10.1007/s11192-011-0546-z

Download citation

Received: 07 May 2011
Published: 08 November 2011
Issue Date: February 2012
DOI: https://doi.org/10.1007/s11192-011-0546-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Lotka phenomenon in the words’ syntactic distribution complexity

Abstract

Access this article

Similar content being viewed by others

SyntaxNet Errors from the Linguistic Point of View

Bridging Collocational and Syntactic Analysis

Information Extraction for Czech Based on Syntactic Analysis

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abstract

Access this article

Similar content being viewed by others

SyntaxNet Errors from the Linguistic Point of View

Bridging Collocational and Syntactic Analysis

Information Extraction for Czech Based on Syntactic Analysis

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation