Abstract
Character segmentation is a challenging problem in the field of optical character recognition. Presence of touched characters make this dilemma more crucial. The goal of this paper is to provide major concepts and progress in domain of off-line cursive touched character segmentation. Accordingly, two broad classes of technique are identified. These include methods that perform explicit or implicit character segmentation. The basic methods used by each class of technique are presented and the contributions of individual algorithms within each class are discussed. It is the first survey that focuses on touched character segmentation and provides segmentation rates, descriptions of the test data for the approaches discussed. Finally, the main trends in the field of touched character segmentation are examined, important contributions are presented and future directions are also suggested.
Similar content being viewed by others
References
Alhajj R, Polat F, Elnagar A (2000) Employing multi-agents to identify touching of adjacent digits in handwritten Hindi numerals. In: Proceedings of the IEEE international conference on systems, man, and cybernetics, vol 4. pp 2725–2730
Bae JH, Jung KC, Kim JW, Kim HJ (1998) Segmentation of touching characters using an MLP. Pattern Recognit Lett 19: 701–709
Bansal V, Sinha RMK (2002) Segmentation of touching and fused Devanagari characters. Pattern Recognit 35(4): 875–893
Bayer T, Kressel U (1993) Cut classification for segmentation. Paper presented at the proceedings of international conference on document analysis and recognition. Tsukuba Science City, Japan, pp 565–568
Broumandnia A, Shanbehzadeh J, Nourani M (2007) Segmentation of printed Farsi/Arabic words. In: Proceedings of IEEE/ACS international conference on computer systems and applications. pp 761–766
Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Pattern Anal Mach Intell 17: 690–706
Casey RG, Nagy G (1982) Recursive segmentation and classification of composite character patterns. Paper presented at the proceedings of the 6th international conference on pattern recognition, Munich, Germany
Chunheng W, Hotta Y, Suwa M, Naoi N (2004) Handwritten Chinese address recognition. In: Proceedings of the ninth international workshop on frontiers in handwriting recognition. pp 539–544
Dijkstra EW (1959) A note on two problems in connection with graphs. Numer Comput Math 1: 269–271
Elnagar A, Alhajj R (2003) Segmentation of connected handwritten numeral strings. Pattern Recognit 36(3): 625–634
Fujisawa H (2007) A view on the past and future of character and document recognition. In: Ninth international conference on document analysis and recognition. pp 3–7
Garain U, Chaudhuri BB (2002) Segmentation of touching characters in printed devnagari and bangla scripts using fuzzy multifactorial analysis. IEEE Trans Syst Man Cybern 32(4): 449–459
Han Z, Liu CP, Yin XC (2005) A two-stage handwritten character segmentation approach in mail address recognition. In: Proceedings of eighth international conference on document analysis and recognition, vol 1. pp 111–115
Hilditch CJ (1969) Linear skelton from square cupboards. Mach Intell 4: 403–420
Hoffman RL, Mccullough JW (1971) Segmentation methods for recognition of machine-printed characters. IBM J Res Dev 15, 153–165
Hong-Gang Z, Gang L, Wei-Ran X, Jun G (2002) An algorithm of handwritten digits segmentation based on multi-mould. In: Proceedings of international conference on machine learning and cybernetics, vol 2. pp 1081–1084
Jayarathna UKS, Bandara GEMDC (2006) New segmentation algorithm for off-line handwritten connected character segmentation. In: Proceedings of the first international conference on industrial and information systems. pp 540–546
Jianming H, Donggang Y, Hong Y (1999) Construction of partitioning paths for touching handwritten characters. Non-Linear Anal 20(3): 293–303
Jin-Hak B, Kee-Chul J, Jin-Wook K, Hang-Joon K (1998) Segmentation of touching characters using an MLP. Pattern Recognit Lett 19(8): 701–709
Jiqiang S, Zuo L, Lyu MR, Shijie C (2005) Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking. IEEE Trans Syst Man Cybern B 35(1): 2–11
Kahan S, Pavlidis T, Baird HS (1987) On the recognition of printed characters of any font and size. IEEE Trans Pattern Anal Mach Intell 9(2): 274–288
Kim KK, Kim JH, Suen CY (2002) Segmentation-based recognition of handwritten touching pairs of digits using structural features. Pattern Recognit Lett 23(1–3): 13–24
Kimura F, Shridhar M, Chen Z (1993) Improvements of a lexicon directed algorithm for recognition of unconstrained handwritten words. In: Proceedings of the second international conference on document analysis and recognition. pp 18–22
Kurniawan F, Rehman A, Dzulkifli M, Mariyam S (2009) Self organizing features map with improved segmentation to identify touching of adjacent characters in handwritten words (2011). In: Ninth international conference on hybrid intelligent systems (HIS 2009), vol 1. Shenyang, LiaoNing, China, pp 475–480
Kurniawan F, Rehman A, Dzulkifli M, Mariyam S, Sulong G (2011) Region-based touched character segmentation in handwritten words. Int J Innov Comput Inf Control (IJICIC) 7(6): 3107–3120
Lee H-J, Lee M-C (1993) Understanding mathematical expressions in a printed document. In: Proceedings of second international conference on document analysis and recognition. pp 502–505
Liang S, Shridhar M, Ahmadi M (1994) Segmentation of touching characters in printed document recognition. Pattern Recognit 27(6): 825–840
Lorenz O, Monagan G (1994) Retrieval of line drawings. In: Proceedings of the third annual symposium on document analysis and information retrieval. Las Vegas, Nevada
Lorigo L, Govindaraju V (2005) Segmentation and pre-recognition of arabic handwriting. In: Proceedings of international conference on document analysis and recognition, vol 2. pp 605–609
Lu Y (1995) Machine printed character segmentation: an overview. Pattern Recognit 28(1): 67–80
Masayuki O, Syougo S, Tadashi S (1999) Segmentation of touching characters in formulas. In: Proceedings of third IAPR workshop on document analysis systems theory and practice
Min-Chul J, Yong-Chul S, Srihari SN (1999) Machine printed character segmentation method using side profiles. In: Proceedings of IEEE SMC ’99 conference on systems, man, and cybernetics
Misako S, Satoshi N (2004) Segmentation of handwritten numerals by graph representation. In: Proceedings of the ninth international workshop on frontiers in handwriting recognition
Monagan G (1994) A procedure for segmenting touching numbers in cadastral maps. In: Proceedings of IAPR workshop on machine vision applications (MVA 1994). Kawasaki, Japan
Okamoto M, Sakaguchi S, Suzuki T (1999) Segmentation of touching characters in formulas. Lect Note Comput Sci 1655:151–156
Oliveira LES, Lethelier E, Bortolozzi F, Sabourin R (2000) A new segmentation approach for handwritten digits. In: Proceedings of the 15th international conference on pattern recognition, vol 2. pp 323–326
Ouchtati S, Bedda M, Lachouri A (2007) Segmentation and recognition of handwritten numeric chains. J Comput Sci 3(4): 242–248
Pal U, Belaid A, Choisy C (2001) Water reservoir based approach for touching numeral segmentation. In: Proceedings sixth international conference on document analysis and recognition. pp 892–896
Pal U, Belaid A, Choisy C (2003) Touching numeral segmentation using water reservoir concept. Pattern Recognit Lett 24(1–3): 261–272
Rao AVS, Subbarao M, Rao NV, Sastry ASCS, Reddy LP (2009) Segmentation of touching handwritten numerals and alphabets. In: Second international conference on computer and electrical engineering. pp 304–307
Rehman A, Saba T (2011) Performance analysis of segmentation approach for cursive handwritten word recognition on benchmark database. Digit Signal Process (Elsevier) 21: 486–490
Roy PP, Pal U, Llados J (2008) Recognition of multi-oriented touching characters in graphical documents. In: Proceedings of sixth Indian conference on computer vision, graphics & image processing. pp 297–304
Rui MA, Yingnan Z, Yongquan X, Yunyang Y (2008) A touching pattern-oriented strategy for handwritten digits segmentation. In: Proceeding of international conference on computational intelligence and security. pp 174–179
Saba T, Rehman A (2011) Off-line cursive script recognition: current advances, comparisons and remaining problems. Artif Intell Rev. doi:10.1007/s10462-011-9229-7
Saba T, Rehman A, Sulong G (2010a) Improved off-line connected script recognition based on hybrid strategy. Int J Eng Sci Technol 2(6): 1603–1611
Saba T, Rehman A, Sulong G (2010b) Non-linear segmentation of touched roman characters based on genetic algorithm. Int J Comput Sci Eng 2(6): 2167–2172
Saba T, Rehman A, Sulong G (2010c) Touched cursive script segmentation with neural confidence. Int J Innov Comput Inf Control 7(7): 1–10
Saba T, Sulong G, Rahim S, Rehman A (2010d) On the segmentation of multiple touched characters: a heuristics approach. Inf Commun Technol Lect Note Comput Sci (LNCS) Springer Verlag 101(3): 540–542. doi:10.1007/978-3-642-15766-0_91
Saba T, Sulong G, Rehman A (2011) Document image analysis: issues, comparison of methods and remaining problems. Artif Intell Rev 35(2): 101–118. doi:10.1007/s10462-010-9186-6
Seo W, Cho BJ (2004) Efficient segmentation path generation for unconstrained handwritten Hangul characters. Lect Note Artif Intell 3192: 438–446
Shi Z, Govindaraju V (1997) Segmentation and recognition of connected handwritten numeral strings. Pattern Recognit 30(9): 1501–1504
Strathy NW, Suen CY, Krzyzak A (1993) Segmentation of handwritten digits using contour features. In: Proceedings of the second international conference on document analysis and recognition. pp 577–580
Suwa M (2005) Segmentation of connected handwritten numerals by graph representation. In: Proceedings of eighth international conference on document analysis and recognition, vol 2. pp 750–754
Teruyuki Y, Shinji T, Tomohiro Y, Tsuyoshi S, Eiji M, Hisao O (2002) A segmentation system for touching handwritten Japanese characters. In: Proceedings of the eighth international workshop on frontiers in handwriting recognition (IWFHR’02). pp 407–412
Tian X, Zhang Y (2007) Segmentation of touching characters in mathematical expressions using contour feature technique. In: Eighth ACIS international conference on software engineering, artificial intelligence, networking, and parallel/distributed computing. pp 206–209
Tripathy N, Pal U (2004) Handwriting segmentation of un-constrained Oriya text. In: Proceedings of the international workshop on frontiers in handwriting recognition. pp 306–311
Tsujimoto S, Assada H (1991) Resolving ambiguity in segmenting touching characters. In: Proceedings of the first international conference on document analysis and recognition
Ventzislav A (2004) Using critical points in contours for segmentation of touching characters. In: Proceedings of the 5th international conference on computer systems and technologies
Wei X, Ma S (2005) Segmentation of touching Chinese character based on convex hull ratio feature. J Chin Inf Process 47: 91–96
Wei X, Ma S, Jin Y (2005) Segmentation of connected Chinese characters based on genetic algorithm. In: Proceedings of the eighth international conference on document analysis and recognition (ICDAR’05), vol 2. pp 645–649
Wshah S, Shi Z, Govindaraju V (2009a) Segmentation of Arabic handwriting based on both contour and skeleton segmentation. In: Proceedings of the tenth international conference on document analysis and recognition. pp 793–797
Wshah S, Shi Z, Govindaraju V (2009b) Segmentation of Arabic handwriting based on both contour and skeleton segmentation In: Proceedings of the 10th international conference on document analysis and recognition. pp 793–797
Xian W, Govindaraju V, Srihari S (1999) Multi-experts for touching digit string recognition. In: Proceedings of the fifth international conference on document analysis and recognition. pp 800–803
Xianghui W, Shaoping M, Yijiang J (2005) Segmentation of connected Chinese characters based on genetic algorithm. Paper presented at the proceedings of the eighth international conference on document analysis and recognition
Yamaguchi T, Yoshikawa T, Shinogi T, Tsuruoka S, Teramoto M (2001) A segmentation method for touching Japanese handwritten characters based on connecting condition of lines. In: Proceedings of sixth international conference on document analysis and recognition. pp 837–841
Yi-Kai C, Jhing-Fa W (2000) Segmentation of single or multiple-touching handwritten numeral string using background and foreground analysis. IEEE Trans Pattern Anal Mach Intell 22(11): 1304–1317
Yong G, Yan Z, Zhao H (2009a) Touching string segmentation using MRF. In: International conference on computational intelligence and security. pp 520–524
Yong G, Yan Z, Zhao H (2009b) Touching string segmentation using MRF. In: International conference on computational intelligence and security. pp 520–524
Yong G, Yan Z, Zhao H (2009c) Touching string segmentation using MRF. In: International conference on computational intelligence and security. pp 520–524
Yoon JJ, Kim G (2000) An approach for active segmentation of unconstrained handwritten Korean strings using run-length code. In: Proceedings of the seventh international workshop on frontiers in handwriting recognition. Amsterdam
Yu D, Yan H (2001) Separation of touching handwritten multi-numeral strings based on morphological structural features. Pattern Recognit 34(3): 587–599
Yun L, Liu CS, Ding XQ, Qiang F (2004) A recognition based system for segmentation of touching handwritten numeral strings. In: Proceedings of the ninth international workshop on frontiers in handwriting recognition. pp 294–299
Zhang JY, Ding XQ (2000) Multi-scale feature extraction and nested-subset classifier design for high accuracy handwritten character recognition. Paper presented at the, Barcelona, Spain
Zhang S, Karim MA (2002) A new impulse detector for switching median filters. IEEE Signal Process Lett 9: 360–363
Zhang LQ, Suen CY (2002) Recognition of courtesy amounts on bank checks based on a segmentation approach. In: Proceedings of eighth international workshop on handwriting recognition. pp 298–302
Zhang T, Wang X, Chen C, Liu J (2009) Connected numeral strings segmentation based on the combination of characteristic position and contour detecting. In: Proceedings of the first international conference on digital image processing. pp 81–84
Zhongkang L, Zheru C, Wan-Chi S, Pengfei S (1999) A background-thinnig-based approach for separating and recognizing connected handwritten digit strings. Pattern Recognit 32: 921–933
Zhu X, Yin X (2002) A new textual/non-textual classifier for document skew correction. In: Proceedings of the 16th international conference on pattern recognition (ICPR). pp 480–482
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Saba, T., Rehman, A. & Elarbi-Boudihir, M. Methods and strategies on off-line cursive touched characters segmentation: a directional review. Artif Intell Rev 42, 1047–1066 (2014). https://doi.org/10.1007/s10462-011-9271-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-011-9271-5