Skip to main content

Core-Periphery Organization of Graphemes in Written Sequences: Decreasing Positional Rigidity with Increasing Core Order

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7181))

Abstract

The positional rigidity of graphemes (as well as words considered as single units) in written sequences has been analyzed in this paper using complex network methodology. In particular, the information about adjacent co-occurrence of graphemes in a corpus has been used to construct a network, where the nodes represent the distinct signs used. Core-periphery structure of this network has been uncovered using k-core decomposition technique suitably generalized for directed networks. This allows identification of a core signary or “graphem-ome” of the corresponding writing system, i.e., the group of frequently co-occurring graphemes. The distribution of the frequency with which such signs occur at different positions in a sequence (e.g., at the beginning or at the end or in the middle) shows that while signs belonging to the periphery often occur only at specific positions, those in the innermost cores may occur at many different positions. This is quantified by using a positional entropy measure that shows a systematic increase with core order for the different databases used in this study (corpus of English, Chinese and Sumerian sentences as well as a database of Indus civilization inscriptions).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Biemann, C., Quasthoff, U.: Networks generated from natural language text. In: Ganguly, N., et al. (eds.) Dynamics on and of Complex Networks, pp. 167–185. Birkhauser, Boston (2009)

    Chapter  Google Scholar 

  • Chatterjee, N., Sinha, S.: Understanding the mind of a worm. Progress in Brain Research 168, 145–153 (2007)

    Article  Google Scholar 

  • Choudhury, M., Mukherjee, A.: The structure and dynamics of linguistic networks. In: Ganguly, N., et al. (eds.) Dynamics on and of Complex Networks, pp. 145–166. Birkhauser, Boston (2009)

    Chapter  Google Scholar 

  • Dorogovtsev, S.N., Mendes, J.F.: Language as an evolving word web. Proceedings of the Royal Society of London B 268(1485), 2603–2606 (2001)

    Article  Google Scholar 

  • Ferrer i Cancho, R., Sole, R.V.: The small world of human language. Proceedings of the Royal Society of London B 268(1482), 2261–2265 (2001)

    Article  Google Scholar 

  • Fuls, A.: Entwicklung einer geographisch-epigraphischen datenbank der indusschrift. In: Weisbruch, S., Kaden, R. (eds.) Entwicklerforum Geodäsie und Geoinformationstechnik 2010. Technische Universität, Berlin (2010)

    Google Scholar 

  • Holme, P.: Core-periphery organization of complex networks. Physical Review E 72(4), 046111(1-4) (2005)

    Google Scholar 

  • Lamb, S.M.: Linguistic and cognitive networks. In: Garvin, P. (ed.) Cognition: A Multiple View, pp. 195–222. Spartan Books, New York (1970)

    Google Scholar 

  • Palaima, T.G., Pope, E.I., Kent Reilly, F.: Unlocking the secrets of ancient writing. Catalogue of an exhibition in conjunction with the 11th International Mycenological Colloqium. The University of Texas at Austin (2000)

    Google Scholar 

  • Parpola, A.: Deciphering the Indus Script. Cambridge University Press, Cambridge (1994)

    Google Scholar 

  • Saha Roy, R., Ganguly, N., Chowdhury, M., Singh, N.K.: Complex network analysis reveals kernel-periphery structure in web search queries. In: 2nd International ACM SIGIR Workshop on Query Representation and Understanding (QRU 2011), pp. 5–8 (2011)

    Google Scholar 

  • Sinha, S., Izhar, A.M., Pan, R.K., Wells, B.K.: Network analysis of a corpus of undeciphered Indus civilization inscriptions indicates syntactic organization. Computer Speech and Language 25(3), 639–654 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ashraf, M.I., Sinha, S. (2012). Core-Periphery Organization of Graphemes in Written Sequences: Decreasing Positional Rigidity with Increasing Core Order. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28604-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28604-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28603-2

  • Online ISBN: 978-3-642-28604-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics