Skip to main content
Log in

A survey on camera-captured scene text detection and extraction: towards Gurmukhi script

  • Trends and Surveys
  • Published:
International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Abstract

Owing to technological advancements and new inventions, the world is transforming into a digital world where cameras have replaced the scanners for converting the hard copy documents into their soft versions. And, nowadays, these gadgets are used to capture the images of any text, present anywhere in real-life scenes, and process it, hence making the area of camera-captured scene text extraction a key area of interest for researchers. This paper discusses the certain background concepts and provides a categorization of the major challenges faced while extracting the camera-captured scene text, and scope and applications of the scene text extraction. Moreover, it presents a comprehensive literature survey of different techniques and methods adopted by researchers to extract the text from the scene images captured via cameras with a special emphasis on Gurmukhi script. For better understanding, this paper classifies various techniques by three different parameters and discusses the available datasets along with performance metrics. To summarize, this survey provides a fundamental study and review of latest happenings in the area of camera-captured scene text extraction and offers researchers an overview of the work done so far along with certain future directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. Google Goggles available at https://play.google.com/store/apps.

  2. The voice for android available at http://www.artificialvision.com/android.htm.

  3. Automatic book scanners available at http://www.kirtas.com/kabisIII.php.

  4. Available at http://www.isical.ac.in/~ujjwal/download/database.html.

  5. Available at http://yann.lecun.com/exdb/mnist/.

  6. Available at: http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/: USTB-SV.

References

  1. Chen D, Luettin J, Shearer K (2000) A survey of text detection and recognition in images and videos. Technical Report, Institute Dalle Molled Intelligence Perceptive Research Report IDIAP, pp 0–38

  2. Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recognit 37(5):977–997. doi:10.1016/j.patcog.2003.10.012

    Article  Google Scholar 

  3. Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. Int J Doc Anal Recognit 7(2–3):84–104. doi:10.1007/s10032-004-0138-z

    Article  Google Scholar 

  4. Zhu Y, Yao C, Bai X (2015) Scene text detection and recognition: recent advances and future trends. Front Comput Sci. doi:10.1007/s11704-015-4488-0

    Google Scholar 

  5. Ye Q, Doermann D (2015) Text detection and recognition in imagery? A survey. IEEE Trans Pattern Anal Mach Intell 8828(7):1–20. doi:10.1109/TPAMI.2014.2366765

    Google Scholar 

  6. Zhang H, Zhao K, Song YZ, Guo J (2013) Text extraction from natural scene image: a survey. Neurocomputing 122:310–323. doi:10.1016/j.neucom.2013.05.037

    Article  Google Scholar 

  7. Karatzas D, Shafait F, Uchida S, Iwamura M, Gomez, L (2013) ICDAR 2013 robust reading competition. In: Proceedings of international conference on document analysis and recognition, pp 1484–1493. doi:10.1109/ICDAR.2013.221

  8. Karatzas D, Gomez-Bigorda L, Nicolaou A et al (2015) ICDAR 2015 competition on robust reading. In: Proceedings of international conference on document analysis and recognition (ICDAR), pp 1156–1160. doi:10.1109/ICDAR.2015.7333942

  9. Kasar T (2011) Camera-captured document image analysis. Dissertation, Indian Institute of Science, Bangalore

  10. Simon C, Williem Park IK (2015) Correcting geometric and photometric distortion of document images on a smartphone. J Electron Imaging 24(1):1–14. doi:10.1117/1.JEI.24.1.013038

    Article  Google Scholar 

  11. Simon C, Cho S, Park I K (2014) Fast and robust perspective rectification of document images on a smartphone. In: IEEE conference on computer vision and pattern recognition workshops, pp 196–197. doi:10.1109/CVPRW.2014.37

  12. Phan TQ, Shivakumara P, Tian S, Tan CL (2013) Recognizing text with perspective distortion in natural scenes. In: IEEE international conference on computer vision (ICCV), pp 569–576. doi:10.1109/ICCV.2013.76

  13. Merino-Gracia C, Mirmehdi M, Sigut J, González-Mora JL (2013) Fast perspective recovery of text in natural scenes. Image Vis Comput 31(10):714–724. doi:10.1016/j.imavis.2013.07.002

    Article  Google Scholar 

  14. Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. IEEE Conf Comput Vis Pattern Recognit (CVPR) 8:1083–1090. doi:10.1109/CVPR.2012.6247787

    Google Scholar 

  15. Yin XC, Pei W-Y, Zhang J, Hao H-W (2015) Multi-orientation scene text detection with adaptive clustering. IEEE Trans Pattern Anal Mach Intell 37(9):1930–1937. doi:10.1109/TPAMI.2014.2388210

    Article  Google Scholar 

  16. Black M, Berard F, Jepson A, Newman W, Saund E, Socher G, Taylor M (1998) The digital office: overview. In: Proceedings of the aaai spring symposium on intelligent environments, pp 1–6

  17. Arth C, Limberger F, Bischof H (2007) Real-time license plate recognition on an embedded DSP-platform. In: IEEE conference on computer vision and pattern recognition, pp 1–8. doi:10.1109/CVPR.2007.38341

  18. Sermanet P, Chintala S, LeCun Y (2012) Convolutional neural networks applied to house numbers digit classification. In: Proceedings of international conference on pattern recognition ICPR, pp 10–13

  19. Lee C-M, Kankanhalli A (1995) Automatic extraction of characters in complex scene images. Int J Pattern Recognit Artif Intell 9(1):67–82. doi:10.1142/S0218001495000043

    Article  Google Scholar 

  20. Liu X, Samarabandu J (2005) An edge-based text region extraction algorithm for indoor mobile robot navigation. IEEE Inte Conf Mech Autom 2(7):2008–2015. doi:10.1109/ICMA.2005.1626635

    Google Scholar 

  21. Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and video frames. Image Vis Comput 23(6):565–576. doi:10.1016/j.imavis.2005.01.004

    Article  Google Scholar 

  22. Frames V (2003) Robust text detection algorithm in images and video frames. ICICS PCM 2:802–806. doi:10.1109/ICICS.2003.1292567

    Google Scholar 

  23. Jain AK, Yu B (1998) Automatic text location in images and video frames. Proceedings of fourteenth international conference on pattern recognition 2:3–5. doi:10.1109/ICPR.1998.711990

    Google Scholar 

  24. Lee CW, Jung K, Kim HJ (2003) Automatic text detection and removal in video sequences. Pattern Recognit Lett 24(15):2607–2623. doi:10.1016/S0167-8655(03)00105-3

    Article  Google Scholar 

  25. Tehsin S, Masood A, Kausar S, Javed Y (2015) A caption text detection method from images/videos for efficient indexing and retrieval of multimedia data. Int J Pattern Recognit Artif Intell 29(1):1–23. doi:10.1142/S0218001415550034

    Article  Google Scholar 

  26. Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell 33(2):412–419. doi:10.1109/TPAMI.2010.166

    Article  Google Scholar 

  27. Lienhart R, Wernicke A (2002) Localizing and segmenting text in images and videos. IEEE Trans Circuits Syst Video Technol 12(4):256–268. doi:10.1109/76.999203

    Article  Google Scholar 

  28. Ohya J, Shio A, Akamatsu S (1994) Recognizing characters in scene images. IEEE Trans Pattern Anal Mach Intell 16(2):214–220. doi:10.1109/34.273729

    Article  Google Scholar 

  29. Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intell 25(12):1631–1639. doi:10.1109/TPAMI.2003.1251157

    Article  Google Scholar 

  30. Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition (CVPR), vol 2, pp 366–373. doi:10.1109/CVPR.2004.1315187

  31. Hanif SM, Prevost L, Negri PA, Paris UU, Syst I, Robotique D (2008) A cascade detector for text detection in natural scene images. In: Proceedings of international conference on pattern recognition (ICPR), pp 1–4. doi:10.1109/ICPR.2008.4761536

  32. Zhou G, Liu Y, Meng Q, Zhang Y (2011) Detecting multilingual text in natural scene. In: IEEE international symposium on access spaces (IEEE-ISAS), pp 116–120. doi:10.1109/ISAS.2011.5960931

  33. Gllavata J, Ewerth R, Freisleben B (2004) Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: Proceedings of international conference on pattern recognition (ICPR), vol 1, pp 425–428. doi:10.1109/ICPR.2004.1334146

  34. Saoi T, Goto H (2005) Text detection in color scene images based on unsupervised clustering of multi-channel wavelet features. In: Proceedings of the eight international conference on document analysis and recognition (ICDAR), vol 2, pp 690–694. doi:10.1109/ICDAR.2005.227

  35. Ji R, Xu P, Yao H, Zhang Z, Sun X, Liu T (2008) Directional correlation analysis of local Haar binary pattern for text detection. In: IEEE international conference on multimedia and expo, pp 885–888. doi:10.1109/ICME.2008.4607577

  36. Pan Y-F, Cheng-Lin L, Hou X (2010) Fast scene text localization by learning-based filtering and verification. In: Proceedings IEEE international conference on image processing, pp 2269–2272. doi:10.1109/ICIP.2010.5651862

  37. Sanketi P, Shen H,Coughlan JM (2011) Localizing blurry and low-resolution text in natural images. In: IEEE workshop on applications of computer vision (WACV), pp 503–510. doi:10.1109/WACV.2011.5711546

  38. Angadi SA, Kodabagi MM (2009) A texture based methodology for text region extraction from low resolution natural scene images. Int J Image Process (IJIP) 3(5):229–245

    Google Scholar 

  39. Li C, Ding X, Wu Y (2001) Automatic text location in natural scene images. In: Proceedings of sixth international conference on document analysis and recognition, pp 1069–1073. doi:10.1109/ICDAR.2001.953950

  40. Wang H, Kangas J (2001) Character-like region verification for extracting text in scene images. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 957–962. doi:10.1109/ICDAR.2001.953927

  41. Wang K, Kangas JA (2003) Character location in scene images from digital camera. Pattern Recognit 36(10):2287–2299. doi:10.1016/S0031-3203(03)00082-7

    Article  MATH  Google Scholar 

  42. Yi C, Tian Y (2011) Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans Image Process 20(9):2594–2605. doi:10.1109/TIP.2011.2126586

    Article  MathSciNet  Google Scholar 

  43. Tang P, Yuan Y, Fang J, Zhao Y (2015) A novel similar background components connection algorithm for colorful text detection in natural images. In: IEEE international conference on signal processing, communications and computing (ICSPCC), pp 1-5. doi:10.1109/ICSPCC.2015.7338913

  44. Gatos B, Pratikakis I (2005) Text detection in indoor/outdoor scene images. In: Proceedings of first workshop of camera-based document analysis and recognition, pp 127–132

  45. Zongyi L,Sarkar S (2008) Robust outdoor text detection using text intensity and shape features. In: Proceedings of international conference on pattern recognition, pp 1–4. doi:10.1109/ICPR.2008.4761432

  46. Karaoglu S, Fernando B, Tremeau A (2010) A novel algorithm for text detection and localization in natural scene images. In: International conference on digital image computing: techniques and applications (DICTA), pp 635–642. doi:10.1109/DICTA.2010.115

  47. Kim E, Lee S, Kim J (2009) Scene text extraction using focus of mobile camera. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 166–170. doi:10.1109/ICDAR.2009.21

  48. Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 2963–2970. doi:10.1109/CVPR.2010.5540041

  49. Liu X, Lu Z, Li J, Jiang W (2014) Detection and segmentation text from natural scene images based on graph model. WSEAS Trans Signal Process 10:124–135

    Google Scholar 

  50. Huang W, Lin Z, Yang J, Wang J (2013) Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE international conference on computer vision, pp 1241–1248. doi:10.1109/ICCV.2013.157

  51. Neumann L, Matas J (2011) A method for text localization and recognition in real-world images. In: Kimmel R (ed) Computer vision, ACCV. Springer, Berlin, pp 770–783

    Google Scholar 

  52. Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: IEEE international conference on image processing, pp 2609–2612. doi:10.1109/ICIP.2011.6116200

  53. Neumann L, Matas J (2011) Text localization in real-world images using efficiently pruned exhaustive search. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 687–691. doi:10.1109/ICDAR.2011.144

  54. Yin X-C, Huang K, Hao H-W (2013) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):1–14

    Google Scholar 

  55. He T, Huang W, Qiao Y, Yao J (2015) Text-attentional convolutional neural networks for scene text detection. In: IEEE transactions on image processing, pp 1–10

  56. Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition CVPR, pp 3538–3545. doi:10.1109/CVPR.2012.6248097

  57. Sun L, Huo Q, Jia W, Chen K (2015) A robust approach for text detection from natural scene images. Pattern Recognit 48(9):2906–2920. doi:10.1016/j.patcog.2015.04.002

    Article  Google Scholar 

  58. Gomez L, Karatzas D (2013) Multi-script text extraction from natural scenes. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 467–471. doi:10.1109/ICDAR.2013.100

  59. Zhang H, Liu C, Yang C, Ding X, Wang K (2011) An improved scene text extraction method using conditional random field and optical character recognition. In: International conference on document analysis and recognition, pp 708–712. doi:10.1109/ICDAR.2011.148

  60. Zhou G, Jia Z, Liu Y, Xu L (2015) Scene text detection method based on the hierarchical model. IET Comput Vis 9(4):500–510. doi:10.1049/iet-cvi.2014.0297

    Article  Google Scholar 

  61. Kumar M, Lee G (2010) Automatic text location from complex natural scene images. In: International conference on computer and automation engineering (ICCAE), vol 3, pp 594–597. doi:10.1109/ICCAE.2010.5451808

  62. Lu S, Chen T, Tian S, Lim J-H, Tan C-L (2015) Scene text extraction based on edges and support vector regression. Int J Doc Anal Recognit (IJDAR) 18(2):125–135. doi:10.1007/s10032-015-0237-z

    Article  Google Scholar 

  63. Zhu A, Wang G, Dong Y (2015) Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification. Pattern Recognit Lett 67:153–162. doi:10.1016/j.patrec.2015.06.009

    Article  Google Scholar 

  64. Yu C, Song Y, Zhang Y (2016) Scene text localization using edge analysis and feature pool. Neurocomputing 175:652–661. doi:10.1016/j.neucom.2015.10.105

    Article  Google Scholar 

  65. Liu Y, Member S, Goto S, Ikenaga T (2006) A contour-based robust algorithm for text detection in color. IEICE Trans Inf Syst 89(3):1221–1230. doi:10.1093/ietisy/e89

    Article  Google Scholar 

  66. Pan Y-F, Hou X, Liu C-L (2009) Text localization in natural scene images based on conditional random field. In: International conference on document analysis and recognition, pp 6–10. doi:10.1109/ICDAR.2009.97

  67. Petter M, Fragoso V, Turk M, Baur C (2011) Automatic text detection for mobile augmented reality translation. In: IEEE international conference on computer vision workshops (ICCV), pp 48–55. doi:10.1109/ICCVW.2011.6130221

  68. Pan Y-F, Hou X, Liu C-L (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813. doi:10.1109/TIP.2010.2070803

    Article  MathSciNet  Google Scholar 

  69. Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recognit Lett 34(2):107–116. doi:10.1016/j.patrec.2012.09.019

    Article  Google Scholar 

  70. Huang W, Qiao Y, Tang X (2014) Robust scene text detection with convolution neural network induced MSER trees. In: Fleet D (ed) Computer vision–ECCV. Springer International Publishing, New York, pp 497–511

    Google Scholar 

  71. Wang R, Sang N, Gao C (2015) Text detection approach based on confidence map and context information. Neurocomputing 157:153–165. doi:10.1016/j.neucom.2015.01.023

    Article  Google Scholar 

  72. Zhao Z, Fang C, Lin Z, Wu Y (2015) A robust hybrid method for text detection in natural scenes by learning-based partial differential equations. Neurocomputing 168:23–34. doi:10.1016/j.neucom.2015.06.019

    Article  Google Scholar 

  73. Zhong Y, Karu K, Jain AK (1995) Locating text in complex color images. Proceedings of the third international conference on document analysis and recognition 1:146–149. doi:10.1109/ICDAR.1995.598963

    Article  Google Scholar 

  74. Wang K, Serge B, (2010) Word spotting in the wild ECCV, (2010) In: Daniilidis K (ed) Computer vision–ECCV. Springer, Berlin, pp 591–604

  75. Silapachote P, Weinman J, Hanson A, Weiss R, Mattar MA (2005) Automatic sign detection and recognition in natural scenes. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), pp 27–34. doi:10.1109/CVPR.2005.417

  76. Nguyen MH, Choi H, Lee G (2015) Text detection in scene images based on feature detection and tensor voting. In: Proceedings of the international conference on ubiquitous information management and communication (IMCOM), pp 1–6. doi:10.1145/2701126.2701170

  77. Lahari KS, Haritha M, Krishna P (2015) Text detection from natural image using MSER and BOW. Int J Emerg Eng Res Technol 3(11):152–156

    Google Scholar 

  78. Huang X, Shen T, Wang R, Gao C (2015) Text detection and recognition in natural scene images. In: International conference on communication and signal processing (ICCSP), pp 1068–1072. doi:10.1109/ICCSP.2014.6950011

  79. Greenhalgh J, Mirmehdi M (2014) Recognizing text-based traffic signs. IEEE Trans Intell Transp Syst 16(3):1360–1369. doi:10.1109/TITS.2014.2363167

    Article  Google Scholar 

  80. Talukder KH, Mallick T (2014) Connected component based approach for text extraction from color image. In: International conference on computer and information technology (ICCIT), pp 204–209. doi:10.1109/ICCITechn.2014.7073114

  81. Gonzalez A, Bergasa LM (2013) A text reading algorithm for natural images. Image Vis Comput 31(3):255–274. doi:10.1016/j.imavis.2013.01.003

    Article  Google Scholar 

  82. Novikova T, Barinova O, Kohli P, Lempitsky V (2012) Large-lexicon attribute-consistent text recognition in natural images. Proc Eur Conf Comput Vis ECCV 6:752–765. doi:10.1007/978-3-642-33783-3_54

    Google Scholar 

  83. Zhao M, Li S, Kwok J (2010) Text detection in images using sparse representation with discriminative dictionaries. Image Vis Comput 28(12):1590–1599. doi:10.1016/j.imavis.2010.04.002

    Article  Google Scholar 

  84. Lu S, Tan CL (2006) Camera text recognition based on perspective invariants. In: International conference on pattern recognition (ICPR), vol 2, pp 2–5. doi:10.1109/ICPR.2006.351

  85. Qi Z, Kimachi M (2005) Using adaboost to detect and segment characters from natural scenes. In: Proceedings of CBDAR, pp 52–59

  86. Ezaki N, Bulacu M, Schomaker L (2004) Text detection from natural scene images: towards a system for visually impaired persons. Proceedings of the international conference on pattern recognition 2:2–5. doi:10.1109/ICPR.2004.1334351

    Google Scholar 

  87. Minetto R, Thome N, Cord M, Stolfi J, Precioso F, Guyomard J, Leite N (2011) Text detection and recognition in urban scenes. In: IEEE international conference on computer vision workshops (ICCV) (1), pp 227–234. doi:10.1109/ICCVW.2011.6130247

  88. Kong LY (2012) Detection and recognition for text in traffic sign images. In: Proceedings of the international conference on industrial control and electronics engineering, pp 2043–2045. doi:10.1109/ICICEE.2012.543

  89. Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern (SMC) 9(1):62–66. doi:10.1109/TSMC.1979.4310076

    Article  Google Scholar 

  90. Bawa RK, Sethi GK (2012) A review on binarization algorithms for camera based natural scene images. In: Proceedings of the international conference on advances in computing, communications and informatics (ICACCI), pp 873–878. doi:10.1145/2345396.2345537

  91. Fernando B, Etienne S, Etienne S, Tremeau A (2011) Extreme value theory based text binarization in documents and natural scenes. In: International conference on machine vision (ICMV), pp 144–151

  92. Sauvola J, Pietika M (2000) Adaptive document image binarization. Pattern Recognit 33(2):225–236. doi:10.1016/S0031-3203(99)00055-2

    Article  Google Scholar 

  93. Kasar T, Kumar J, Ramakrishnan AG (2007) Font and background color independent text binarization. Camera based document analysis and recognition international workshop ICDAR 2(1):3–9

    Google Scholar 

  94. Song Y, Liu A, Pang L, Lin S, Zhang Y, Tang S (2008) A novel image text extraction method based on K-means clustering. In: IEEE international conference on computer and information science (ICIS), pp 185–190. doi:10.1109/ICIS.2008.31

  95. Gosselin B (2007) Color text extraction with selective metric-based clustering, vol 107, pp 97–107. doi:10.1016/j.cviu.2006.11.010

  96. Mishra A, Alahari K, Jawahar CV (2011) An MRF model for binarization of natural scene text. In: Proceedings of the international conference on document analysis and recognition ICDAR, pp 11–16. doi:10.1109/ICDAR.2011.12

  97. Chaudhuri B, Pal U (1998) A complete printed Bangla OCR system. Pattern Recognit 31(5):531–549. doi:10.1016/S0031-3203(97)00078-2

    Article  Google Scholar 

  98. Jayadevan R, Kolhe SR, Patil PM, Pal U (2011) Offline recognition of Devanagari script: a survey. IEEE Trans Syst Man Cybern C Appl Rev 41(6):782–796. doi:10.1109/TSMCC.2010.2095841

    Article  Google Scholar 

  99. Gaikwad-Patil S (2014) A neural network based handwritten character recognition for Marathi script. Dissertation, Shivaji University

  100. Mittal R (2015) Detection and segmentation of text in handwritten Hindi documents. Dissertation, Thapar University

  101. Desai AA (2010) Gujarati handwritten numeral optical character reorganization through neural network. Pattern Recognit 43(7):2582–2589. doi:10.1016/j.patcog.2010.01.008

    Article  MATH  Google Scholar 

  102. Kulkarni SA, Borde PR, Manza RL, Yannawar P (2015) Review on recent advances in automatic handwritten MODI script recognition. Int J Comput Appl 115(19):5–12. doi:10.5120/20257-2636

    Google Scholar 

  103. Gupta V, Rathna GN, Ramakrishnan KR (2008) Automatic Kannada text extraction from camera captured images. MSDES

  104. Bhattacharya U, Parui SK, Mondal S (2009) Devanagari and Bangla text extraction from natural scene images. In: Proceedings of the international conference on document analysis and recognition, pp 171–175. doi:10.1109/ICDAR.2009.178

  105. Angadi SA (2010) Text region extraction from low resolution natural scene images using texture features. In: IEEE Int Adv Comput Conf, pp 121–128

  106. Raj H, Ghosh R (2014) Devanagari text extraction from natural scene images. In: International conference on advances in computing, communications and informatics (ICACCI), pp 513–517

  107. Angadi SA, Kodabagi M (2014) A robust segmentation technique for line, word and character extraction from Kannada text in low resolution display board images. In: International conference on signal and image processing, pp 42–49. doi:10.1109/ICSIP.2014.11

  108. Kaur TP, Garg N (2015) A survey on machine printed Gurmukhi text recognition. Int J Eng Res Technol (IJERT) 4(10):360–366

    Google Scholar 

  109. Lehal GS (2001) Text segmentation of machine printed Gurmukhi script. Doc Recognit Retr 4307:223–231. doi:10.1117/12.410840

    Google Scholar 

  110. Lehal GS, Singh C (1999) Feature extraction and classification for OCR of Gurmukhi script. Vivek 12:2–12

    Google Scholar 

  111. Lehal GS, Singh C (2000) A Gurmukhi script recognition system. International conference on pattern recognition 2:557–560. doi:10.1109/ICPR.2000.906135

    Article  MATH  Google Scholar 

  112. Lehal GS (2009) A complete machine printed Gurmukhi OCR system. In: Govindaraju V (ed) Guide to OCR for Indic scripts. Springer, London, pp 43–71

    Chapter  Google Scholar 

  113. Jindal MK, Lehal GS, Sharma RK (2006) Segmentation problems and solutions in printed degraded Gurmukhi script. Int J Signal Process 2(9):258–267

  114. Jindal MK (2007) A study of different kinds of degradation in printed Gurmukhi script. In: International conference on computing: theory and applications, pp 538–544. doi:10.1109/ICCTA.2007.19

  115. Jindal MK, Sharma RK, Lehal GS (2008) Structural features for recognizing degraded printed Gurmukhi script. In: Fifth international conference on information technology: new generations, pp 668–673. doi:10.1109/ITNG.2008.223

  116. Singh G (2009) Optical character recognition of Gurmukhi script using multiple classifiers. In: Proceedings of the international workshop on multilingual OCR. doi:10.1145/1577802.1577810

  117. Singla G, Kumar P (2013) Extract the Punjabi word from machine printed document images. Int J Eng Res Appl 3(5):343–348

    Google Scholar 

  118. Arora S, Sharma D, Arora S (2014) Recognition of Gurmukhi text from sign board images captured from mobile camera. Int J Inf Comput Technol 4(17):1839–1845

    Google Scholar 

  119. Sharma DV, Singh S (2014) Gurmukhi text detection and localization in natural scene images. Int J Adv Res Comput Sci Softw Eng 4(5):442–458

    Google Scholar 

  120. Lucas SM, Panaretos A, Sosa L, Tang A, Wong S, Young R (2003) ICDAR 2003 robust reading competitions. In: Proceedings of international conference on document analysis and recognition, pp 682-687. doi:10.1109/ICDAR.2003.1227749

  121. Lucas SM (2005) ICDAR 2005 text locating competition results. International conference on document analysis and recognition 1:80–84. doi:10.1109/ICDAR.2005.231

    Google Scholar 

  122. Karatzas D, Robles Mestre S, Mas J, Nourbakhsh F, Roy P (2011) ICDAR 2011 robust reading competition challenge 1? Reading text in born-digital images (Web and Email). In: International conference on document analysis and recognition, pp 1485–1490. doi:10.1109/ICDAR.2011.295

  123. Shahab A, Shafait F, Dengel A (2011) ICDAR 2011 robust reading competition challenge 2: reading text in scene images. In: International conference on document analysis and recognition, pp 1491–1496. doi:10.1109/ICDAR.2011.296

  124. Nagy R, Dicker A, Meyer-wegener K (2011) NEOCR? A configurable dataset for natural image text recognition. Proceedings of international workshop on camera-based document analysis and recognition 6:150–163. doi:10.1007/978-3-642-29364-1_12

    Google Scholar 

  125. Lee S, Cho MS, Jung K, Kim JH (2010) Scene text extraction with edge constraint and text collinearity. In: Proceedings of international conference on pattern recognition, pp 3983–3986. doi:10.1109/ICPR.2010.969

  126. Netzer Y, Wang T (2011) Reading digits in natural images with unsupervised feature learning. In: Proceedings of NIPS workshop on deep learning and unsupervised feature learning, pp 1–9

  127. Mishra A, Alahari K, Jawahar C (2012) Scene text recognition using higher order language priors. Proceedings of the British machine vision conference 127:1–11. doi:10.5244/C.26.127

    Google Scholar 

  128. De Campos TE, Babu BR, Varma M (2013) Character recognition in natural images. Document analysis and recognition, pp 1–8

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amandeep Kaur.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kaur, A., Dhir, R. & Lehal, G.S. A survey on camera-captured scene text detection and extraction: towards Gurmukhi script. Int J Multimed Info Retr 6, 115–142 (2017). https://doi.org/10.1007/s13735-016-0116-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13735-016-0116-5

Keywords

Navigation