Abstract
Automatically identifying the gender of a writer from a handwritten sample is an essential task in various domains, including historical document analysis, handwriting biometrics, and psychology. Technological advances in computer vision and image analysis have yielded various techniques suitable for this task, each with its own merits and limitations. However, a systematic survey of these techniques, which would provide researchers guidance on selecting the appropriate approach, is currently lacking. To address this gap, we used a predefined query to select and then analyze a selection of peer-reviewed studies published between 2012 and 2021 that presented automatic methods for the classification of gender from handwriting. Next, we describe and categorize the feature extraction methods applied and classifiers used in each study, overview the existing datasets, and compare the results across studies. Finally, based on these data, we specify yet-unresolved issues in the field and provide recommendations for the development of new and improved classification methods.
Similar content being viewed by others
Data Availability
Data sharing is not applicable to this article as no datasets were generated during the current study.
Notes
The initial version of the query also contained document* OR manuscript* OR (document* AND image*) OR (manuscript* AND image*), but we noticed that many relevant papers did not contain any of these words in their titles (nor in their abstract). Therefore, we decided to remove this condition, to avoid false negatives in our search.
Studies that explored both public and private datasets were counted in both groups.
References
Huber RA, Headrick AM (1999) Handwriting identification: facts and fundamentals. CRC press, Boca Raton. https://doi.org/10.1201/9781420048773
Goodenough FL (1945) Sex differences in judging the sex of handwriting. J Soc Psychol 22(1):61–68. https://doi.org/10.1080/00224545.1945.9714182
Hamid S, Loewenthal KM (1996) Inferring gender from handwriting in Urdu and English. J Soc Psychol 136(6):778–782. https://doi.org/10.1080/00224545.1996.9712254
Upadhyay S, Singh J, Shukla S (2017) Determination of sex through handwriting characteristics. Int J Cur Res Rev 9(13):11. https://doi.org/10.7324/IJCRR.2017.9133
Sesa-Nogueras E, Faundez-Zanuy M, Roure-Alcobé J (2016) Gender classification by means of online uppercase handwriting: a text-dependent allographic approach. Cogn Comput 8(1):15–29. https://doi.org/10.1007/s12559-015-9332-1
Topaloglu M, Ekmekci S (2017) Gender detection and identifying one’s handwriting with handwriting analysis. Expert Syst Appl 79:236–243. https://doi.org/10.1016/j.eswa.2017.03.001
Illouz E , David EO , Netanyahu NS (2018) Handwriting-based gender classification using end-to-end deep neural networks. In: International conference on artificial neural networks, Springer. pp 613–621. https://doi.org/10.1007/978-3-030-01424-7_60
Rabaev I, Litvak M, Asulin S, Tabibi OH (2021) Automatic gender classification from handwritten images: a case study. In: International conference on computer analysis of images and patterns. Springer pp 329–339. https://doi.org/10.1007/978-3-030-89131-2_30
Kitchenham B (2004) Procedures for performing systematic reviews. Keele, UK, Keele University 33(2004):1–26
Pautasso M (2013) Ten simple rules for writing a literature review. PLoS Comput Bio 9(7):1003149. https://doi.org/10.1371/journal.pcbi.1003149
Xue G, Liu S, Gong D, Ma Y (2021) ATP-DenseNet: a hybrid deep learning-based gender identification of handwriting. Neural Comput Applic 33:4611–4622. https://doi.org/10.1007/s00521-020-05237-3
Siddiqi I, Djeddi C, Raza A, Souici-meslati L (2015) Automatic analysis of handwriting for gender classification. Pattern Anal Applic 18:887–899. https://doi.org/10.1007/s10044-014-0371-0
Maken P, Gupta A (2021) A method for automatic classification of gender based on text-independent handwriting. Multimed Tools Appl. https://doi.org/10.1007/s11042-021-10837-9
Bi N, Suen CY, Nobile N, Tan J (2019) A multi-feature selection approach for gender identification of handwriting based on kernel mutual information. Pattern Recogn Lett 121:123–132. https://doi.org/10.1016/j.patrec.2018.05.005
Navya B, Shivakumara P, Shwetha G, Roy S, Guru D, Pal U, Lu T (2018) Adaptive multi-gradient kernels for handwritting based gender identification. In: Proceedings of international conference on frontiers in handwriting recognition, ICFHR, pp 392–397. https://doi.org/10.1109/ICFHR-2018.2018.00075
Moetesum M, Siddiqi I, Djeddi C, Hannad Y, Al-Maadeed S (2018) Data driven feature extraction for gender classification using multi-script handwritten texts. In: Proceedings of international conference on frontiers in handwriting recognition, ICFHR, pp 564–569. https://doi.org/10.1109/ICFHR-2018.2018.00104
Mirza A, Moetesum M, Siddiqi I, Djeddi C (2016) Gender classification from offline handwriting images using textural features. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 395–398 https://doi.org/10.1109/ICFHR.2016.0080
Gattal A, Djeddi C, Siddiqi I, Chibani Y (2018) Gender classification from offline multi-script handwriting images using oriented Basic Image Features (oBIFs). Expert Syst Appl 99:155–167. https://doi.org/10.1016/j.eswa.2018.01.038
Hassaïne A, Al Maadeed S, Aljaam J, Jaoua A (2013) ICDAR 2013 competition on gender prediction from handwriting. In: International conference on document analysis and recognition pp 1417–1421. https://doi.org/10.1109/ICDAR.2013.286
Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Souici-Meslati L, El Abed H (2015) ICDAR 2015 competition on multi-script writer identification and gender classification using ‘QUWI’ database. In: International conference on document analysis and recognition, pp 1191–1195. https://doi.org/10.1109/ICDAR.2015.7333949
Ahmed M, Rasool AG, Afzal H, Siddiqi I (2017) Improving handwriting based gender classification using ensemble classifiers. Expert Syst Appl 85:158–168. https://doi.org/10.1016/j.eswa.2017.05.033
Tan J, Bi N, Suen CY, Nobile N (2016) Multi-feature selection of handwriting for gender identification using mutual information. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR), pp 578–583. https://doi.org/10.1109/ICFHR.2016.0111
Navya BJ, Swetha GC, Shivakumara P, Roy S, Guru DS, Pal U, Lu T (2018) Multi-gradient directional features for gender identification. In: 2018 24th international conference on pattern recognition (ICPR), pp 3657–3662. https://doi.org/10.1109/ICPR.2018.8546033
Akbari Y, Nouri K, Sadri J, Djeddi C, Siddiqi I (2017) Wavelet-based gender detection on off-line handwritten documents using probabilistic finite state automata. Image Vis Comput 59:17–30. https://doi.org/10.1016/j.imavis.2016.11.017
Alaei F, Alaei A (2021) Gender detection based on spatial pyramid matching. In: International conference on document analysis and recognition, vol 12824, LNCS. pp 305–317. https://doi.org/10.1007/978-3-030-86337-1_21
Bouadjenek N, Nemmour H, Chibani Y (2017) Fuzzy integrals for combining multiple svm and histogram features for writer’s gender prediction. IET Biometrics 6:429–437. https://doi.org/10.1049/iet-bmt.2016.0140
Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Ennaji A, El Abed H (2016) ICFHR2016 competition on multi-script writer demographics classification using “QUWI” database. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 602–606. https://doi.org/10.1109/ICFHR.2016.0115
Bouadjenek N, Nemmour H, Chibani Y (2015) Age, gender and handedness prediction from handwriting using gradient features. In: 2015 13th international conference on document analysis and recognition (ICDAR), pp 1116–1120. https://doi.org/10.1109/ICDAR.2015.7333934
Morera Á, Sánchez Á, Vélez JF, Moreno AB (2018) Gender and handedness prediction from offline handwriting using convolutional neural networks. Complexity 2018. https://doi.org/10.1155/2018/3891624
Rahmanian M, Shayegan MA (2021) Handwriting-based gender and handedness classification using convolutional neural networks. Multimed Tools Appl 80(28):35341–35364. https://doi.org/10.1007/s11042-020-10170-7
Bouadjenek N, Nemmour H, Chibani Y (2016) Robust soft-biometrics prediction from off-line handwriting analysis. Appl Soft Comput 46:980–990. https://doi.org/10.1016/j.asoc.2015.10.021
Marzinotto G, Rosales JC, El-Yacoubi MA, Garcia-Salicetti S (2015) Age and gender characterization through a two layer clustering of online handwriting. In: International conference on advanced concepts for intelligent vision systems. Springer, pp 428–439 https://doi.org/10.1007/978-3-319-25903-1_37
Gornale SS, Kumar S, Patil A, Hiremath PS (2021) Behavioral biometric data analysis for gender classification using feature fusion and machine learning. Frontiers in Robotics and AI 8:685966. https://doi.org/10.3389/frobt.2021.685966
Dargan S, Kumar M, Tuteja S (2021) Pca-based gender classification system using hybridization of features and classification techniques. Soft Comput 25(24):15281–15295. https://doi.org/10.1007/s00500-021-06118-0
Al Maadeed S, Ayouby W, Hassaine A, Aljaam JM (2012) QUWI: an Arabic and English handwriting dataset for offline writer identification. In: 2012 international conference on frontiers in handwriting recognition. IEEE, pp 746–751. https://doi.org/10.1109/ICFHR.2012.256
Djeddi C, Gattal A, Souici-Meslati L, Siddiqi I, Chibani Y, El Abed H (2014) LAMIS-MSHD: a multi-script offline handwriting database. In: 2014 14th international conference on frontiers in handwriting recognition. IEEE pp 93–97. https://doi.org/10.1109/ICFHR.2014.23
Liwicki M, Bunke H (2005) IAM-OnDB-an on-line English sentence database acquired from handwritten text on a whiteboard. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 956–9612, https://doi.org/10.1109/ICDAR.2005.132
Mahmoud SA, Ahmad I, Alshayeb M, Al-Khatib WG, Parvez MT, Fink GA, Märgner V, El Abed H (2012) KHATT: Arabic offline handwritten text database. In: 2012 international conference on frontiers in handwriting recognition. IEEE, pp 449–454. https://doi.org/10.1109/ICFHR.2012.224
Mahmoud SA, Ahmad I, Al-Khatib WG, Alshayeb M, Parvez MT, Märgner V, Fink GA (2014) KHATT: an open Arabic offline handwritten text database. Pattern Recognit 47(3):1096–1112. https://doi.org/10.1016/j.patcog.2013.08.009
Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Souici-Meslati L, Abed HE (2015) ICDAR2015 competition on multi-script writer identification and gender classification using ’QUWI’ database. Proc Int Conf Doc Anal Recognit, ICDAR:1191–1195. https://doi.org/10.1109/ICDAR.2015.7333949
Armi L, Fekri-Ershad S (2019) Texture image analysis and texture classification methods—a review. arXiv preprint. arXiv:1904.06554
Zhang D, Lu G (2004) Review of shape representation and description techniques. Pattern Recognit 37(1):1–19. https://doi.org/10.1016/j.patcog.2003.07.008
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. https://doi.org/10.1007/BF00994018
Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Philip SY (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37. https://doi.org/10.1007/s10115-007-0114-2
Ho TK (1995) Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition, vol 1. IEEE, pp 278–282. https://doi.org/10.1109/ICDAR.1995.598994
Freund Y, Schapire R, Abe N (1999) A short introduction to boosting. Journal-Japanese Society For Artificial Intelligence 14(771-780):1612
McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5(4):115–133. https://doi.org/10.1007/BF02478259
Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46(3):175–185. https://doi.org/10.1080/00031305.1992.10475879
Fisher RA (1938) The statistical utilization of multiple measurements. Ann Eugen 8(4):376–386
Rayens WS (1993) Discriminant analysis and statistical pattern recognition. Technometrics 35 (3):324–326. https://doi.org/10.1080/00401706.1993.10485331
McLachlan GJ (2005) Discriminant analysis and statistical pattern recognition. John Wiley & Sons, Hoboken
Liwicki M, Schlapbach A, Loretan P, Bunke H (2007) Automatic detection of gender and handedness from on-line handwriting. In: Proceedings of the 13th biennial conference of the international graphonomics society (IGS2007). Citeseer, pp 179–183. https://boris.unibe.ch/id/eprint/26482
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of Interests
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Irina Rabaev and Marina Litvak are contributed equally to this work.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Rabaev, I., Litvak, M. Automated gender classification from handwriting: a systematic survey. Appl Intell 53, 17154–17177 (2023). https://doi.org/10.1007/s10489-022-04347-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04347-w