Abstract
An interesting way to analyse the authorship authenticity of a document, is the use of stylometry. However, the use of conventional features and classifiers has some disadvantages such as the automatic authorship decision, which usually gives us a speechless authorship classification without (often) any way to measure or interpret the consistency of the results.
In this paper, we present a visual analytics based approach for the task of authorship discrimination. A specific application is dedicated to the authorship comparison between two ancient religious books: the Quran and Hadith. In fact, an important raising question is: could these ancient books be written by the same Author?
Thus, seven types of features are combined and normalized by PCA reduction and three visual analytical clustering methods are employed and commented on, namely: Principal Component Analysis, Gaussian Mixture Models and Self Organizing Maps.
The new visual analytical approach appears interesting, since it does not only show the distinction between the author styles, but also sheds light on how consistent was that distinction (i.e. visually).
Concerning the discrimination application on the ancient religious books, the results have shown the appearance of two separated clusters: namely a Quran cluster and Hadith cluster. The clusters distinction corresponds to a clear authorship difference between the two investigated documents, which implies that the two books (i.e. Quran and Hadith) come from two different Authors.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Blascheck, T., John, M., Kurzhals, K., Koch, S., Ertl, T.: VA2: a visual analytics approach for evaluating visual analytics applications. IEEE Trans. Vis. Comput. Graph. 22(1), 61–70 (2016)
Sayoud, H.: Segmental analysis based authorship discrimination between the Holy Quran and Prophet’s statements. Digital Stud. J. 2014–2015 (2015)
Sayoud, H.: A visual analytics based investigation on the authorship of the Holy Quran. In: International Conference on Information Visualization Theory and Applications (IVAPP’2015), 11–14 March 2015, pp. 177–181 (2015)
Ibrahim, I.A.: A brief illustrated guide to understanding Islam. Library of Congress, Darussalam Publishers, Houston. www.islam-guide.com/contents-wide.htm
Sayoud, H.: Author discrimination between the Holy Quran and Prophet’s statements. Literary Linguist. Comput. 27(4), 427–444 (2012)
Norusis, M.: Cluster analysis. In: SPSS 17.0 Statistical Procedures Companion, Marija Norusis, pp. 361–391. Pearson editor (2008). Chap. 16
Ellis, G., Mansmann, F.: VisMaster, Visual Analytics. In: Mastering the Information Age. Scientific Coordinator of VisMaster. Daniel Keim Jörn Kohlhammer (2010). Chap. 2
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Sayoud, H. (2018). Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith. In: Mouhoub, M., Sadaoui, S., Ait Mohamed, O., Ali, M. (eds) Recent Trends and Future Technology in Applied Intelligence. IEA/AIE 2018. Lecture Notes in Computer Science(), vol 10868. Springer, Cham. https://doi.org/10.1007/978-3-319-92058-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-92058-0_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92057-3
Online ISBN: 978-3-319-92058-0
eBook Packages: Computer ScienceComputer Science (R0)