Skip to main content
Log in

Automated gender classification from handwriting: a systematic survey

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Automatically identifying the gender of a writer from a handwritten sample is an essential task in various domains, including historical document analysis, handwriting biometrics, and psychology. Technological advances in computer vision and image analysis have yielded various techniques suitable for this task, each with its own merits and limitations. However, a systematic survey of these techniques, which would provide researchers guidance on selecting the appropriate approach, is currently lacking. To address this gap, we used a predefined query to select and then analyze a selection of peer-reviewed studies published between 2012 and 2021 that presented automatic methods for the classification of gender from handwriting. Next, we describe and categorize the feature extraction methods applied and classifiers used in each study, overview the existing datasets, and compare the results across studies. Finally, based on these data, we specify yet-unresolved issues in the field and provide recommendations for the development of new and improved classification methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data Availability

Data sharing is not applicable to this article as no datasets were generated during the current study.

Notes

  1. The initial version of the query also contained document* OR manuscript* OR (document* AND image*) OR (manuscript* AND image*), but we noticed that many relevant papers did not contain any of these words in their titles (nor in their abstract). Therefore, we decided to remove this condition, to avoid false negatives in our search.

  2. Studies that explored both public and private datasets were counted in both groups.

References

  1. Huber RA, Headrick AM (1999) Handwriting identification: facts and fundamentals. CRC press, Boca Raton. https://doi.org/10.1201/9781420048773

  2. Goodenough FL (1945) Sex differences in judging the sex of handwriting. J Soc Psychol 22(1):61–68. https://doi.org/10.1080/00224545.1945.9714182

    Article  Google Scholar 

  3. Hamid S, Loewenthal KM (1996) Inferring gender from handwriting in Urdu and English. J Soc Psychol 136(6):778–782. https://doi.org/10.1080/00224545.1996.9712254

    Article  Google Scholar 

  4. Upadhyay S, Singh J, Shukla S (2017) Determination of sex through handwriting characteristics. Int J Cur Res Rev 9(13):11. https://doi.org/10.7324/IJCRR.2017.9133

    Article  Google Scholar 

  5. Sesa-Nogueras E, Faundez-Zanuy M, Roure-Alcobé J (2016) Gender classification by means of online uppercase handwriting: a text-dependent allographic approach. Cogn Comput 8(1):15–29. https://doi.org/10.1007/s12559-015-9332-1

    Article  Google Scholar 

  6. Topaloglu M, Ekmekci S (2017) Gender detection and identifying one’s handwriting with handwriting analysis. Expert Syst Appl 79:236–243. https://doi.org/10.1016/j.eswa.2017.03.001

    Article  Google Scholar 

  7. Illouz E , David EO , Netanyahu NS (2018) Handwriting-based gender classification using end-to-end deep neural networks. In: International conference on artificial neural networks, Springer. pp 613–621. https://doi.org/10.1007/978-3-030-01424-7_60

  8. Rabaev I, Litvak M, Asulin S, Tabibi OH (2021) Automatic gender classification from handwritten images: a case study. In: International conference on computer analysis of images and patterns. Springer pp 329–339. https://doi.org/10.1007/978-3-030-89131-2_30

  9. Kitchenham B (2004) Procedures for performing systematic reviews. Keele, UK, Keele University 33(2004):1–26

    Google Scholar 

  10. Pautasso M (2013) Ten simple rules for writing a literature review. PLoS Comput Bio 9(7):1003149. https://doi.org/10.1371/journal.pcbi.1003149

    Article  Google Scholar 

  11. Xue G, Liu S, Gong D, Ma Y (2021) ATP-DenseNet: a hybrid deep learning-based gender identification of handwriting. Neural Comput Applic 33:4611–4622. https://doi.org/10.1007/s00521-020-05237-3

    Article  Google Scholar 

  12. Siddiqi I, Djeddi C, Raza A, Souici-meslati L (2015) Automatic analysis of handwriting for gender classification. Pattern Anal Applic 18:887–899. https://doi.org/10.1007/s10044-014-0371-0

    Article  MathSciNet  Google Scholar 

  13. Maken P, Gupta A (2021) A method for automatic classification of gender based on text-independent handwriting. Multimed Tools Appl. https://doi.org/10.1007/s11042-021-10837-9

  14. Bi N, Suen CY, Nobile N, Tan J (2019) A multi-feature selection approach for gender identification of handwriting based on kernel mutual information. Pattern Recogn Lett 121:123–132. https://doi.org/10.1016/j.patrec.2018.05.005

    Article  Google Scholar 

  15. Navya B, Shivakumara P, Shwetha G, Roy S, Guru D, Pal U, Lu T (2018) Adaptive multi-gradient kernels for handwritting based gender identification. In: Proceedings of international conference on frontiers in handwriting recognition, ICFHR, pp 392–397. https://doi.org/10.1109/ICFHR-2018.2018.00075

  16. Moetesum M, Siddiqi I, Djeddi C, Hannad Y, Al-Maadeed S (2018) Data driven feature extraction for gender classification using multi-script handwritten texts. In: Proceedings of international conference on frontiers in handwriting recognition, ICFHR, pp 564–569. https://doi.org/10.1109/ICFHR-2018.2018.00104

  17. Mirza A, Moetesum M, Siddiqi I, Djeddi C (2016) Gender classification from offline handwriting images using textural features. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 395–398 https://doi.org/10.1109/ICFHR.2016.0080

  18. Gattal A, Djeddi C, Siddiqi I, Chibani Y (2018) Gender classification from offline multi-script handwriting images using oriented Basic Image Features (oBIFs). Expert Syst Appl 99:155–167. https://doi.org/10.1016/j.eswa.2018.01.038

    Article  Google Scholar 

  19. Hassaïne A, Al Maadeed S, Aljaam J, Jaoua A (2013) ICDAR 2013 competition on gender prediction from handwriting. In: International conference on document analysis and recognition pp 1417–1421. https://doi.org/10.1109/ICDAR.2013.286

  20. Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Souici-Meslati L, El Abed H (2015) ICDAR 2015 competition on multi-script writer identification and gender classification using ‘QUWI’ database. In: International conference on document analysis and recognition, pp 1191–1195. https://doi.org/10.1109/ICDAR.2015.7333949

  21. Ahmed M, Rasool AG, Afzal H, Siddiqi I (2017) Improving handwriting based gender classification using ensemble classifiers. Expert Syst Appl 85:158–168. https://doi.org/10.1016/j.eswa.2017.05.033

    Article  Google Scholar 

  22. Tan J, Bi N, Suen CY, Nobile N (2016) Multi-feature selection of handwriting for gender identification using mutual information. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR), pp 578–583. https://doi.org/10.1109/ICFHR.2016.0111

  23. Navya BJ, Swetha GC, Shivakumara P, Roy S, Guru DS, Pal U, Lu T (2018) Multi-gradient directional features for gender identification. In: 2018 24th international conference on pattern recognition (ICPR), pp 3657–3662. https://doi.org/10.1109/ICPR.2018.8546033

  24. Akbari Y, Nouri K, Sadri J, Djeddi C, Siddiqi I (2017) Wavelet-based gender detection on off-line handwritten documents using probabilistic finite state automata. Image Vis Comput 59:17–30. https://doi.org/10.1016/j.imavis.2016.11.017

    Article  Google Scholar 

  25. Alaei F, Alaei A (2021) Gender detection based on spatial pyramid matching. In: International conference on document analysis and recognition, vol 12824, LNCS. pp 305–317. https://doi.org/10.1007/978-3-030-86337-1_21

  26. Bouadjenek N, Nemmour H, Chibani Y (2017) Fuzzy integrals for combining multiple svm and histogram features for writer’s gender prediction. IET Biometrics 6:429–437. https://doi.org/10.1049/iet-bmt.2016.0140

    Article  Google Scholar 

  27. Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Ennaji A, El Abed H (2016) ICFHR2016 competition on multi-script writer demographics classification using “QUWI” database. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 602–606. https://doi.org/10.1109/ICFHR.2016.0115

  28. Bouadjenek N, Nemmour H, Chibani Y (2015) Age, gender and handedness prediction from handwriting using gradient features. In: 2015 13th international conference on document analysis and recognition (ICDAR), pp 1116–1120. https://doi.org/10.1109/ICDAR.2015.7333934

  29. Morera Á, Sánchez Á, Vélez JF, Moreno AB (2018) Gender and handedness prediction from offline handwriting using convolutional neural networks. Complexity 2018. https://doi.org/10.1155/2018/3891624

  30. Rahmanian M, Shayegan MA (2021) Handwriting-based gender and handedness classification using convolutional neural networks. Multimed Tools Appl 80(28):35341–35364. https://doi.org/10.1007/s11042-020-10170-7

    Article  Google Scholar 

  31. Bouadjenek N, Nemmour H, Chibani Y (2016) Robust soft-biometrics prediction from off-line handwriting analysis. Appl Soft Comput 46:980–990. https://doi.org/10.1016/j.asoc.2015.10.021

    Article  Google Scholar 

  32. Marzinotto G, Rosales JC, El-Yacoubi MA, Garcia-Salicetti S (2015) Age and gender characterization through a two layer clustering of online handwriting. In: International conference on advanced concepts for intelligent vision systems. Springer, pp 428–439 https://doi.org/10.1007/978-3-319-25903-1_37

  33. Gornale SS, Kumar S, Patil A, Hiremath PS (2021) Behavioral biometric data analysis for gender classification using feature fusion and machine learning. Frontiers in Robotics and AI 8:685966. https://doi.org/10.3389/frobt.2021.685966

    Article  Google Scholar 

  34. Dargan S, Kumar M, Tuteja S (2021) Pca-based gender classification system using hybridization of features and classification techniques. Soft Comput 25(24):15281–15295. https://doi.org/10.1007/s00500-021-06118-0

    Article  Google Scholar 

  35. Al Maadeed S, Ayouby W, Hassaine A, Aljaam JM (2012) QUWI: an Arabic and English handwriting dataset for offline writer identification. In: 2012 international conference on frontiers in handwriting recognition. IEEE, pp 746–751. https://doi.org/10.1109/ICFHR.2012.256

  36. Djeddi C, Gattal A, Souici-Meslati L, Siddiqi I, Chibani Y, El Abed H (2014) LAMIS-MSHD: a multi-script offline handwriting database. In: 2014 14th international conference on frontiers in handwriting recognition. IEEE pp 93–97. https://doi.org/10.1109/ICFHR.2014.23

  37. Liwicki M, Bunke H (2005) IAM-OnDB-an on-line English sentence database acquired from handwritten text on a whiteboard. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 956–9612, https://doi.org/10.1109/ICDAR.2005.132

  38. Mahmoud SA, Ahmad I, Alshayeb M, Al-Khatib WG, Parvez MT, Fink GA, Märgner V, El Abed H (2012) KHATT: Arabic offline handwritten text database. In: 2012 international conference on frontiers in handwriting recognition. IEEE, pp 449–454. https://doi.org/10.1109/ICFHR.2012.224

  39. Mahmoud SA, Ahmad I, Al-Khatib WG, Alshayeb M, Parvez MT, Märgner V, Fink GA (2014) KHATT: an open Arabic offline handwritten text database. Pattern Recognit 47(3):1096–1112. https://doi.org/10.1016/j.patcog.2013.08.009

    Article  Google Scholar 

  40. Djeddi C, Al-Maadeed S, Gattal A, Siddiqi I, Souici-Meslati L, Abed HE (2015) ICDAR2015 competition on multi-script writer identification and gender classification using ’QUWI’ database. Proc Int Conf Doc Anal Recognit, ICDAR:1191–1195. https://doi.org/10.1109/ICDAR.2015.7333949

  41. Armi L, Fekri-Ershad S (2019) Texture image analysis and texture classification methods—a review. arXiv preprint. arXiv:1904.06554

  42. Zhang D, Lu G (2004) Review of shape representation and description techniques. Pattern Recognit 37(1):1–19. https://doi.org/10.1016/j.patcog.2003.07.008

    Article  Google Scholar 

  43. Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. https://doi.org/10.1007/BF00994018

    Article  MATH  Google Scholar 

  44. Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Philip SY (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37. https://doi.org/10.1007/s10115-007-0114-2

    Article  Google Scholar 

  45. Ho TK (1995) Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition, vol 1. IEEE, pp 278–282. https://doi.org/10.1109/ICDAR.1995.598994

  46. Freund Y, Schapire R, Abe N (1999) A short introduction to boosting. Journal-Japanese Society For Artificial Intelligence 14(771-780):1612

    Google Scholar 

  47. McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys 5(4):115–133. https://doi.org/10.1007/BF02478259

    Article  MathSciNet  MATH  Google Scholar 

  48. Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46(3):175–185. https://doi.org/10.1080/00031305.1992.10475879

    Article  MathSciNet  Google Scholar 

  49. Fisher RA (1938) The statistical utilization of multiple measurements. Ann Eugen 8(4):376–386

    Article  MATH  Google Scholar 

  50. Rayens WS (1993) Discriminant analysis and statistical pattern recognition. Technometrics 35 (3):324–326. https://doi.org/10.1080/00401706.1993.10485331

    Article  Google Scholar 

  51. McLachlan GJ (2005) Discriminant analysis and statistical pattern recognition. John Wiley & Sons, Hoboken

    MATH  Google Scholar 

  52. Liwicki M, Schlapbach A, Loretan P, Bunke H (2007) Automatic detection of gender and handedness from on-line handwriting. In: Proceedings of the 13th biennial conference of the international graphonomics society (IGS2007). Citeseer, pp 179–183. https://boris.unibe.ch/id/eprint/26482

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Irina Rabaev or Marina Litvak.

Ethics declarations

Conflict of Interests

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Irina Rabaev and Marina Litvak are contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rabaev, I., Litvak, M. Automated gender classification from handwriting: a systematic survey. Appl Intell 53, 17154–17177 (2023). https://doi.org/10.1007/s10489-022-04347-w

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-04347-w

Keywords

Navigation