Abstract
The positional rigidity of graphemes (as well as words considered as single units) in written sequences has been analyzed in this paper using complex network methodology. In particular, the information about adjacent co-occurrence of graphemes in a corpus has been used to construct a network, where the nodes represent the distinct signs used. Core-periphery structure of this network has been uncovered using k-core decomposition technique suitably generalized for directed networks. This allows identification of a core signary or “graphem-ome” of the corresponding writing system, i.e., the group of frequently co-occurring graphemes. The distribution of the frequency with which such signs occur at different positions in a sequence (e.g., at the beginning or at the end or in the middle) shows that while signs belonging to the periphery often occur only at specific positions, those in the innermost cores may occur at many different positions. This is quantified by using a positional entropy measure that shows a systematic increase with core order for the different databases used in this study (corpus of English, Chinese and Sumerian sentences as well as a database of Indus civilization inscriptions).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Biemann, C., Quasthoff, U.: Networks generated from natural language text. In: Ganguly, N., et al. (eds.) Dynamics on and of Complex Networks, pp. 167–185. Birkhauser, Boston (2009)
Chatterjee, N., Sinha, S.: Understanding the mind of a worm. Progress in Brain Research 168, 145–153 (2007)
Choudhury, M., Mukherjee, A.: The structure and dynamics of linguistic networks. In: Ganguly, N., et al. (eds.) Dynamics on and of Complex Networks, pp. 145–166. Birkhauser, Boston (2009)
Dorogovtsev, S.N., Mendes, J.F.: Language as an evolving word web. Proceedings of the Royal Society of London B 268(1485), 2603–2606 (2001)
Ferrer i Cancho, R., Sole, R.V.: The small world of human language. Proceedings of the Royal Society of London B 268(1482), 2261–2265 (2001)
Fuls, A.: Entwicklung einer geographisch-epigraphischen datenbank der indusschrift. In: Weisbruch, S., Kaden, R. (eds.) Entwicklerforum Geodäsie und Geoinformationstechnik 2010. Technische Universität, Berlin (2010)
Holme, P.: Core-periphery organization of complex networks. Physical Review E 72(4), 046111(1-4) (2005)
Lamb, S.M.: Linguistic and cognitive networks. In: Garvin, P. (ed.) Cognition: A Multiple View, pp. 195–222. Spartan Books, New York (1970)
Palaima, T.G., Pope, E.I., Kent Reilly, F.: Unlocking the secrets of ancient writing. Catalogue of an exhibition in conjunction with the 11th International Mycenological Colloqium. The University of Texas at Austin (2000)
Parpola, A.: Deciphering the Indus Script. Cambridge University Press, Cambridge (1994)
Saha Roy, R., Ganguly, N., Chowdhury, M., Singh, N.K.: Complex network analysis reveals kernel-periphery structure in web search queries. In: 2nd International ACM SIGIR Workshop on Query Representation and Understanding (QRU 2011), pp. 5–8 (2011)
Sinha, S., Izhar, A.M., Pan, R.K., Wells, B.K.: Network analysis of a corpus of undeciphered Indus civilization inscriptions indicates syntactic organization. Computer Speech and Language 25(3), 639–654 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ashraf, M.I., Sinha, S. (2012). Core-Periphery Organization of Graphemes in Written Sequences: Decreasing Positional Rigidity with Increasing Core Order. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28604-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-28604-9_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28603-2
Online ISBN: 978-3-642-28604-9
eBook Packages: Computer ScienceComputer Science (R0)