A survey on camera-captured scene text detection and extraction: towards Gurmukhi script

Kaur, Amandeep; Dhir, Renu; Lehal, Gurpreet Singh

doi:10.1007/s13735-016-0116-5

A survey on camera-captured scene text detection and extraction: towards Gurmukhi script

Trends and Surveys
Published: 19 January 2017

Volume 6, pages 115–142, (2017)
Cite this article

International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Amandeep Kaur¹,
Renu Dhir¹ &
Gurpreet Singh Lehal²

687 Accesses
10 Citations
Explore all metrics

Abstract

Owing to technological advancements and new inventions, the world is transforming into a digital world where cameras have replaced the scanners for converting the hard copy documents into their soft versions. And, nowadays, these gadgets are used to capture the images of any text, present anywhere in real-life scenes, and process it, hence making the area of camera-captured scene text extraction a key area of interest for researchers. This paper discusses the certain background concepts and provides a categorization of the major challenges faced while extracting the camera-captured scene text, and scope and applications of the scene text extraction. Moreover, it presents a comprehensive literature survey of different techniques and methods adopted by researchers to extract the text from the scene images captured via cameras with a special emphasis on Gurmukhi script. For better understanding, this paper classifies various techniques by three different parameters and discusses the available datasets along with performance metrics. To summarize, this survey provides a fundamental study and review of latest happenings in the area of camera-captured scene text extraction and offers researchers an overview of the work done so far along with certain future directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The First Robust Mongolian Text Reading Dataset CSIMU-MTR

Downtown Osaka Scene Text Dataset

An Evaluation and Validation of Contemporary Approaches in Scene Text Extraction and Recognition

Notes

Google Goggles available at https://play.google.com/store/apps.
The voice for android available at http://www.artificialvision.com/android.htm.
Automatic book scanners available at http://www.kirtas.com/kabisIII.php.
Available at http://www.isical.ac.in/~ujjwal/download/database.html.
Available at http://yann.lecun.com/exdb/mnist/.
Available at: http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/: USTB-SV.

References

Chen D, Luettin J, Shearer K (2000) A survey of text detection and recognition in images and videos. Technical Report, Institute Dalle Molled Intelligence Perceptive Research Report IDIAP, pp 0–38
Jung K, Kim KI, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recognit 37(5):977–997. doi:10.1016/j.patcog.2003.10.012
Article Google Scholar
Liang J, Doermann D, Li H (2005) Camera-based analysis of text and documents: a survey. Int J Doc Anal Recognit 7(2–3):84–104. doi:10.1007/s10032-004-0138-z
Article Google Scholar
Zhu Y, Yao C, Bai X (2015) Scene text detection and recognition: recent advances and future trends. Front Comput Sci. doi:10.1007/s11704-015-4488-0
Google Scholar
Ye Q, Doermann D (2015) Text detection and recognition in imagery? A survey. IEEE Trans Pattern Anal Mach Intell 8828(7):1–20. doi:10.1109/TPAMI.2014.2366765
Google Scholar
Zhang H, Zhao K, Song YZ, Guo J (2013) Text extraction from natural scene image: a survey. Neurocomputing 122:310–323. doi:10.1016/j.neucom.2013.05.037
Article Google Scholar
Karatzas D, Shafait F, Uchida S, Iwamura M, Gomez, L (2013) ICDAR 2013 robust reading competition. In: Proceedings of international conference on document analysis and recognition, pp 1484–1493. doi:10.1109/ICDAR.2013.221
Karatzas D, Gomez-Bigorda L, Nicolaou A et al (2015) ICDAR 2015 competition on robust reading. In: Proceedings of international conference on document analysis and recognition (ICDAR), pp 1156–1160. doi:10.1109/ICDAR.2015.7333942
Kasar T (2011) Camera-captured document image analysis. Dissertation, Indian Institute of Science, Bangalore
Simon C, Williem Park IK (2015) Correcting geometric and photometric distortion of document images on a smartphone. J Electron Imaging 24(1):1–14. doi:10.1117/1.JEI.24.1.013038
Article Google Scholar
Simon C, Cho S, Park I K (2014) Fast and robust perspective rectification of document images on a smartphone. In: IEEE conference on computer vision and pattern recognition workshops, pp 196–197. doi:10.1109/CVPRW.2014.37
Phan TQ, Shivakumara P, Tian S, Tan CL (2013) Recognizing text with perspective distortion in natural scenes. In: IEEE international conference on computer vision (ICCV), pp 569–576. doi:10.1109/ICCV.2013.76
Merino-Gracia C, Mirmehdi M, Sigut J, González-Mora JL (2013) Fast perspective recovery of text in natural scenes. Image Vis Comput 31(10):714–724. doi:10.1016/j.imavis.2013.07.002
Article Google Scholar
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. IEEE Conf Comput Vis Pattern Recognit (CVPR) 8:1083–1090. doi:10.1109/CVPR.2012.6247787
Google Scholar
Yin XC, Pei W-Y, Zhang J, Hao H-W (2015) Multi-orientation scene text detection with adaptive clustering. IEEE Trans Pattern Anal Mach Intell 37(9):1930–1937. doi:10.1109/TPAMI.2014.2388210
Article Google Scholar
Black M, Berard F, Jepson A, Newman W, Saund E, Socher G, Taylor M (1998) The digital office: overview. In: Proceedings of the aaai spring symposium on intelligent environments, pp 1–6
Arth C, Limberger F, Bischof H (2007) Real-time license plate recognition on an embedded DSP-platform. In: IEEE conference on computer vision and pattern recognition, pp 1–8. doi:10.1109/CVPR.2007.38341
Sermanet P, Chintala S, LeCun Y (2012) Convolutional neural networks applied to house numbers digit classification. In: Proceedings of international conference on pattern recognition ICPR, pp 10–13
Lee C-M, Kankanhalli A (1995) Automatic extraction of characters in complex scene images. Int J Pattern Recognit Artif Intell 9(1):67–82. doi:10.1142/S0218001495000043
Article Google Scholar
Liu X, Samarabandu J (2005) An edge-based text region extraction algorithm for indoor mobile robot navigation. IEEE Inte Conf Mech Autom 2(7):2008–2015. doi:10.1109/ICMA.2005.1626635
Google Scholar
Ye Q, Huang Q, Gao W, Zhao D (2005) Fast and robust text detection in images and video frames. Image Vis Comput 23(6):565–576. doi:10.1016/j.imavis.2005.01.004
Article Google Scholar
Frames V (2003) Robust text detection algorithm in images and video frames. ICICS PCM 2:802–806. doi:10.1109/ICICS.2003.1292567
Google Scholar
Jain AK, Yu B (1998) Automatic text location in images and video frames. Proceedings of fourteenth international conference on pattern recognition 2:3–5. doi:10.1109/ICPR.1998.711990
Google Scholar
Lee CW, Jung K, Kim HJ (2003) Automatic text detection and removal in video sequences. Pattern Recognit Lett 24(15):2607–2623. doi:10.1016/S0167-8655(03)00105-3
Article Google Scholar
Tehsin S, Masood A, Kausar S, Javed Y (2015) A caption text detection method from images/videos for efficient indexing and retrieval of multimedia data. Int J Pattern Recognit Artif Intell 29(1):1–23. doi:10.1142/S0218001415550034
Article Google Scholar
Shivakumara P, Phan TQ, Tan CL (2011) A Laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell 33(2):412–419. doi:10.1109/TPAMI.2010.166
Article Google Scholar
Lienhart R, Wernicke A (2002) Localizing and segmenting text in images and videos. IEEE Trans Circuits Syst Video Technol 12(4):256–268. doi:10.1109/76.999203
Article Google Scholar
Ohya J, Shio A, Akamatsu S (1994) Recognizing characters in scene images. IEEE Trans Pattern Anal Mach Intell 16(2):214–220. doi:10.1109/34.273729
Article Google Scholar
Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intell 25(12):1631–1639. doi:10.1109/TPAMI.2003.1251157
Article Google Scholar
Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition (CVPR), vol 2, pp 366–373. doi:10.1109/CVPR.2004.1315187
Hanif SM, Prevost L, Negri PA, Paris UU, Syst I, Robotique D (2008) A cascade detector for text detection in natural scene images. In: Proceedings of international conference on pattern recognition (ICPR), pp 1–4. doi:10.1109/ICPR.2008.4761536
Zhou G, Liu Y, Meng Q, Zhang Y (2011) Detecting multilingual text in natural scene. In: IEEE international symposium on access spaces (IEEE-ISAS), pp 116–120. doi:10.1109/ISAS.2011.5960931
Gllavata J, Ewerth R, Freisleben B (2004) Text detection in images based on unsupervised classification of high-frequency wavelet coefficients. In: Proceedings of international conference on pattern recognition (ICPR), vol 1, pp 425–428. doi:10.1109/ICPR.2004.1334146
Saoi T, Goto H (2005) Text detection in color scene images based on unsupervised clustering of multi-channel wavelet features. In: Proceedings of the eight international conference on document analysis and recognition (ICDAR), vol 2, pp 690–694. doi:10.1109/ICDAR.2005.227
Ji R, Xu P, Yao H, Zhang Z, Sun X, Liu T (2008) Directional correlation analysis of local Haar binary pattern for text detection. In: IEEE international conference on multimedia and expo, pp 885–888. doi:10.1109/ICME.2008.4607577
Pan Y-F, Cheng-Lin L, Hou X (2010) Fast scene text localization by learning-based filtering and verification. In: Proceedings IEEE international conference on image processing, pp 2269–2272. doi:10.1109/ICIP.2010.5651862
Sanketi P, Shen H,Coughlan JM (2011) Localizing blurry and low-resolution text in natural images. In: IEEE workshop on applications of computer vision (WACV), pp 503–510. doi:10.1109/WACV.2011.5711546
Angadi SA, Kodabagi MM (2009) A texture based methodology for text region extraction from low resolution natural scene images. Int J Image Process (IJIP) 3(5):229–245
Google Scholar
Li C, Ding X, Wu Y (2001) Automatic text location in natural scene images. In: Proceedings of sixth international conference on document analysis and recognition, pp 1069–1073. doi:10.1109/ICDAR.2001.953950
Wang H, Kangas J (2001) Character-like region verification for extracting text in scene images. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 957–962. doi:10.1109/ICDAR.2001.953927
Wang K, Kangas JA (2003) Character location in scene images from digital camera. Pattern Recognit 36(10):2287–2299. doi:10.1016/S0031-3203(03)00082-7
Article MATH Google Scholar
Yi C, Tian Y (2011) Text string detection from natural scenes by structure-based partition and grouping. IEEE Trans Image Process 20(9):2594–2605. doi:10.1109/TIP.2011.2126586
Article MathSciNet Google Scholar
Tang P, Yuan Y, Fang J, Zhao Y (2015) A novel similar background components connection algorithm for colorful text detection in natural images. In: IEEE international conference on signal processing, communications and computing (ICSPCC), pp 1-5. doi:10.1109/ICSPCC.2015.7338913
Gatos B, Pratikakis I (2005) Text detection in indoor/outdoor scene images. In: Proceedings of first workshop of camera-based document analysis and recognition, pp 127–132
Zongyi L,Sarkar S (2008) Robust outdoor text detection using text intensity and shape features. In: Proceedings of international conference on pattern recognition, pp 1–4. doi:10.1109/ICPR.2008.4761432
Karaoglu S, Fernando B, Tremeau A (2010) A novel algorithm for text detection and localization in natural scene images. In: International conference on digital image computing: techniques and applications (DICTA), pp 635–642. doi:10.1109/DICTA.2010.115
Kim E, Lee S, Kim J (2009) Scene text extraction using focus of mobile camera. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 166–170. doi:10.1109/ICDAR.2009.21
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 2963–2970. doi:10.1109/CVPR.2010.5540041
Liu X, Lu Z, Li J, Jiang W (2014) Detection and segmentation text from natural scene images based on graph model. WSEAS Trans Signal Process 10:124–135
Google Scholar
Huang W, Lin Z, Yang J, Wang J (2013) Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE international conference on computer vision, pp 1241–1248. doi:10.1109/ICCV.2013.157
Neumann L, Matas J (2011) A method for text localization and recognition in real-world images. In: Kimmel R (ed) Computer vision, ACCV. Springer, Berlin, pp 770–783
Google Scholar
Chen H, Tsai SS, Schroth G, Chen DM, Grzeszczuk R, Girod B (2011) Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: IEEE international conference on image processing, pp 2609–2612. doi:10.1109/ICIP.2011.6116200
Neumann L, Matas J (2011) Text localization in real-world images using efficiently pruned exhaustive search. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 687–691. doi:10.1109/ICDAR.2011.144
Yin X-C, Huang K, Hao H-W (2013) Robust text detection in natural scene images. IEEE Trans Pattern Anal Mach Intell 36(5):1–14
Google Scholar
He T, Huang W, Qiao Y, Yao J (2015) Text-attentional convolutional neural networks for scene text detection. In: IEEE transactions on image processing, pp 1–10
Neumann L, Matas J (2012) Real-time scene text localization and recognition. In: IEEE conference on computer vision and pattern recognition CVPR, pp 3538–3545. doi:10.1109/CVPR.2012.6248097
Sun L, Huo Q, Jia W, Chen K (2015) A robust approach for text detection from natural scene images. Pattern Recognit 48(9):2906–2920. doi:10.1016/j.patcog.2015.04.002
Article Google Scholar
Gomez L, Karatzas D (2013) Multi-script text extraction from natural scenes. In: Proceedings of the international conference on document analysis and recognition (ICDAR), pp 467–471. doi:10.1109/ICDAR.2013.100
Zhang H, Liu C, Yang C, Ding X, Wang K (2011) An improved scene text extraction method using conditional random field and optical character recognition. In: International conference on document analysis and recognition, pp 708–712. doi:10.1109/ICDAR.2011.148
Zhou G, Jia Z, Liu Y, Xu L (2015) Scene text detection method based on the hierarchical model. IET Comput Vis 9(4):500–510. doi:10.1049/iet-cvi.2014.0297
Article Google Scholar
Kumar M, Lee G (2010) Automatic text location from complex natural scene images. In: International conference on computer and automation engineering (ICCAE), vol 3, pp 594–597. doi:10.1109/ICCAE.2010.5451808
Lu S, Chen T, Tian S, Lim J-H, Tan C-L (2015) Scene text extraction based on edges and support vector regression. Int J Doc Anal Recognit (IJDAR) 18(2):125–135. doi:10.1007/s10032-015-0237-z
Article Google Scholar
Zhu A, Wang G, Dong Y (2015) Detecting natural scenes text via auto image partition, two-stage grouping and two-layer classification. Pattern Recognit Lett 67:153–162. doi:10.1016/j.patrec.2015.06.009
Article Google Scholar
Yu C, Song Y, Zhang Y (2016) Scene text localization using edge analysis and feature pool. Neurocomputing 175:652–661. doi:10.1016/j.neucom.2015.10.105
Article Google Scholar
Liu Y, Member S, Goto S, Ikenaga T (2006) A contour-based robust algorithm for text detection in color. IEICE Trans Inf Syst 89(3):1221–1230. doi:10.1093/ietisy/e89
Article Google Scholar
Pan Y-F, Hou X, Liu C-L (2009) Text localization in natural scene images based on conditional random field. In: International conference on document analysis and recognition, pp 6–10. doi:10.1109/ICDAR.2009.97
Petter M, Fragoso V, Turk M, Baur C (2011) Automatic text detection for mobile augmented reality translation. In: IEEE international conference on computer vision workshops (ICCV), pp 48–55. doi:10.1109/ICCVW.2011.6130221
Pan Y-F, Hou X, Liu C-L (2011) A hybrid approach to detect and localize texts in natural scene images. IEEE Trans Image Process 20(3):800–813. doi:10.1109/TIP.2010.2070803
Article MathSciNet Google Scholar
Shi C, Wang C, Xiao B, Zhang Y, Gao S (2013) Scene text detection using graph model built upon maximally stable extremal regions. Pattern Recognit Lett 34(2):107–116. doi:10.1016/j.patrec.2012.09.019
Article Google Scholar
Huang W, Qiao Y, Tang X (2014) Robust scene text detection with convolution neural network induced MSER trees. In: Fleet D (ed) Computer vision–ECCV. Springer International Publishing, New York, pp 497–511
Google Scholar
Wang R, Sang N, Gao C (2015) Text detection approach based on confidence map and context information. Neurocomputing 157:153–165. doi:10.1016/j.neucom.2015.01.023
Article Google Scholar
Zhao Z, Fang C, Lin Z, Wu Y (2015) A robust hybrid method for text detection in natural scenes by learning-based partial differential equations. Neurocomputing 168:23–34. doi:10.1016/j.neucom.2015.06.019
Article Google Scholar
Zhong Y, Karu K, Jain AK (1995) Locating text in complex color images. Proceedings of the third international conference on document analysis and recognition 1:146–149. doi:10.1109/ICDAR.1995.598963
Article Google Scholar
Wang K, Serge B, (2010) Word spotting in the wild ECCV, (2010) In: Daniilidis K (ed) Computer vision–ECCV. Springer, Berlin, pp 591–604
Silapachote P, Weinman J, Hanson A, Weiss R, Mattar MA (2005) Automatic sign detection and recognition in natural scenes. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), pp 27–34. doi:10.1109/CVPR.2005.417
Nguyen MH, Choi H, Lee G (2015) Text detection in scene images based on feature detection and tensor voting. In: Proceedings of the international conference on ubiquitous information management and communication (IMCOM), pp 1–6. doi:10.1145/2701126.2701170
Lahari KS, Haritha M, Krishna P (2015) Text detection from natural image using MSER and BOW. Int J Emerg Eng Res Technol 3(11):152–156
Google Scholar
Huang X, Shen T, Wang R, Gao C (2015) Text detection and recognition in natural scene images. In: International conference on communication and signal processing (ICCSP), pp 1068–1072. doi:10.1109/ICCSP.2014.6950011
Greenhalgh J, Mirmehdi M (2014) Recognizing text-based traffic signs. IEEE Trans Intell Transp Syst 16(3):1360–1369. doi:10.1109/TITS.2014.2363167
Article Google Scholar
Talukder KH, Mallick T (2014) Connected component based approach for text extraction from color image. In: International conference on computer and information technology (ICCIT), pp 204–209. doi:10.1109/ICCITechn.2014.7073114
Gonzalez A, Bergasa LM (2013) A text reading algorithm for natural images. Image Vis Comput 31(3):255–274. doi:10.1016/j.imavis.2013.01.003
Article Google Scholar
Novikova T, Barinova O, Kohli P, Lempitsky V (2012) Large-lexicon attribute-consistent text recognition in natural images. Proc Eur Conf Comput Vis ECCV 6:752–765. doi:10.1007/978-3-642-33783-3_54
Google Scholar
Zhao M, Li S, Kwok J (2010) Text detection in images using sparse representation with discriminative dictionaries. Image Vis Comput 28(12):1590–1599. doi:10.1016/j.imavis.2010.04.002
Article Google Scholar
Lu S, Tan CL (2006) Camera text recognition based on perspective invariants. In: International conference on pattern recognition (ICPR), vol 2, pp 2–5. doi:10.1109/ICPR.2006.351
Qi Z, Kimachi M (2005) Using adaboost to detect and segment characters from natural scenes. In: Proceedings of CBDAR, pp 52–59
Ezaki N, Bulacu M, Schomaker L (2004) Text detection from natural scene images: towards a system for visually impaired persons. Proceedings of the international conference on pattern recognition 2:2–5. doi:10.1109/ICPR.2004.1334351
Google Scholar
Minetto R, Thome N, Cord M, Stolfi J, Precioso F, Guyomard J, Leite N (2011) Text detection and recognition in urban scenes. In: IEEE international conference on computer vision workshops (ICCV) (1), pp 227–234. doi:10.1109/ICCVW.2011.6130247
Kong LY (2012) Detection and recognition for text in traffic sign images. In: Proceedings of the international conference on industrial control and electronics engineering, pp 2043–2045. doi:10.1109/ICICEE.2012.543
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern (SMC) 9(1):62–66. doi:10.1109/TSMC.1979.4310076
Article Google Scholar
Bawa RK, Sethi GK (2012) A review on binarization algorithms for camera based natural scene images. In: Proceedings of the international conference on advances in computing, communications and informatics (ICACCI), pp 873–878. doi:10.1145/2345396.2345537
Fernando B, Etienne S, Etienne S, Tremeau A (2011) Extreme value theory based text binarization in documents and natural scenes. In: International conference on machine vision (ICMV), pp 144–151
Sauvola J, Pietika M (2000) Adaptive document image binarization. Pattern Recognit 33(2):225–236. doi:10.1016/S0031-3203(99)00055-2
Article Google Scholar
Kasar T, Kumar J, Ramakrishnan AG (2007) Font and background color independent text binarization. Camera based document analysis and recognition international workshop ICDAR 2(1):3–9
Google Scholar
Song Y, Liu A, Pang L, Lin S, Zhang Y, Tang S (2008) A novel image text extraction method based on K-means clustering. In: IEEE international conference on computer and information science (ICIS), pp 185–190. doi:10.1109/ICIS.2008.31
Gosselin B (2007) Color text extraction with selective metric-based clustering, vol 107, pp 97–107. doi:10.1016/j.cviu.2006.11.010
Mishra A, Alahari K, Jawahar CV (2011) An MRF model for binarization of natural scene text. In: Proceedings of the international conference on document analysis and recognition ICDAR, pp 11–16. doi:10.1109/ICDAR.2011.12
Chaudhuri B, Pal U (1998) A complete printed Bangla OCR system. Pattern Recognit 31(5):531–549. doi:10.1016/S0031-3203(97)00078-2
Article Google Scholar
Jayadevan R, Kolhe SR, Patil PM, Pal U (2011) Offline recognition of Devanagari script: a survey. IEEE Trans Syst Man Cybern C Appl Rev 41(6):782–796. doi:10.1109/TSMCC.2010.2095841
Article Google Scholar
Gaikwad-Patil S (2014) A neural network based handwritten character recognition for Marathi script. Dissertation, Shivaji University
Mittal R (2015) Detection and segmentation of text in handwritten Hindi documents. Dissertation, Thapar University
Desai AA (2010) Gujarati handwritten numeral optical character reorganization through neural network. Pattern Recognit 43(7):2582–2589. doi:10.1016/j.patcog.2010.01.008
Article MATH Google Scholar
Kulkarni SA, Borde PR, Manza RL, Yannawar P (2015) Review on recent advances in automatic handwritten MODI script recognition. Int J Comput Appl 115(19):5–12. doi:10.5120/20257-2636
Google Scholar
Gupta V, Rathna GN, Ramakrishnan KR (2008) Automatic Kannada text extraction from camera captured images. MSDES
Bhattacharya U, Parui SK, Mondal S (2009) Devanagari and Bangla text extraction from natural scene images. In: Proceedings of the international conference on document analysis and recognition, pp 171–175. doi:10.1109/ICDAR.2009.178
Angadi SA (2010) Text region extraction from low resolution natural scene images using texture features. In: IEEE Int Adv Comput Conf, pp 121–128
Raj H, Ghosh R (2014) Devanagari text extraction from natural scene images. In: International conference on advances in computing, communications and informatics (ICACCI), pp 513–517
Angadi SA, Kodabagi M (2014) A robust segmentation technique for line, word and character extraction from Kannada text in low resolution display board images. In: International conference on signal and image processing, pp 42–49. doi:10.1109/ICSIP.2014.11
Kaur TP, Garg N (2015) A survey on machine printed Gurmukhi text recognition. Int J Eng Res Technol (IJERT) 4(10):360–366
Google Scholar
Lehal GS (2001) Text segmentation of machine printed Gurmukhi script. Doc Recognit Retr 4307:223–231. doi:10.1117/12.410840
Google Scholar
Lehal GS, Singh C (1999) Feature extraction and classification for OCR of Gurmukhi script. Vivek 12:2–12
Google Scholar
Lehal GS, Singh C (2000) A Gurmukhi script recognition system. International conference on pattern recognition 2:557–560. doi:10.1109/ICPR.2000.906135
Article MATH Google Scholar
Lehal GS (2009) A complete machine printed Gurmukhi OCR system. In: Govindaraju V (ed) Guide to OCR for Indic scripts. Springer, London, pp 43–71
Chapter Google Scholar
Jindal MK, Lehal GS, Sharma RK (2006) Segmentation problems and solutions in printed degraded Gurmukhi script. Int J Signal Process 2(9):258–267
Jindal MK (2007) A study of different kinds of degradation in printed Gurmukhi script. In: International conference on computing: theory and applications, pp 538–544. doi:10.1109/ICCTA.2007.19
Jindal MK, Sharma RK, Lehal GS (2008) Structural features for recognizing degraded printed Gurmukhi script. In: Fifth international conference on information technology: new generations, pp 668–673. doi:10.1109/ITNG.2008.223
Singh G (2009) Optical character recognition of Gurmukhi script using multiple classifiers. In: Proceedings of the international workshop on multilingual OCR. doi:10.1145/1577802.1577810
Singla G, Kumar P (2013) Extract the Punjabi word from machine printed document images. Int J Eng Res Appl 3(5):343–348
Google Scholar
Arora S, Sharma D, Arora S (2014) Recognition of Gurmukhi text from sign board images captured from mobile camera. Int J Inf Comput Technol 4(17):1839–1845
Google Scholar
Sharma DV, Singh S (2014) Gurmukhi text detection and localization in natural scene images. Int J Adv Res Comput Sci Softw Eng 4(5):442–458
Google Scholar
Lucas SM, Panaretos A, Sosa L, Tang A, Wong S, Young R (2003) ICDAR 2003 robust reading competitions. In: Proceedings of international conference on document analysis and recognition, pp 682-687. doi:10.1109/ICDAR.2003.1227749
Lucas SM (2005) ICDAR 2005 text locating competition results. International conference on document analysis and recognition 1:80–84. doi:10.1109/ICDAR.2005.231
Google Scholar
Karatzas D, Robles Mestre S, Mas J, Nourbakhsh F, Roy P (2011) ICDAR 2011 robust reading competition challenge 1? Reading text in born-digital images (Web and Email). In: International conference on document analysis and recognition, pp 1485–1490. doi:10.1109/ICDAR.2011.295
Shahab A, Shafait F, Dengel A (2011) ICDAR 2011 robust reading competition challenge 2: reading text in scene images. In: International conference on document analysis and recognition, pp 1491–1496. doi:10.1109/ICDAR.2011.296
Nagy R, Dicker A, Meyer-wegener K (2011) NEOCR? A configurable dataset for natural image text recognition. Proceedings of international workshop on camera-based document analysis and recognition 6:150–163. doi:10.1007/978-3-642-29364-1_12
Google Scholar
Lee S, Cho MS, Jung K, Kim JH (2010) Scene text extraction with edge constraint and text collinearity. In: Proceedings of international conference on pattern recognition, pp 3983–3986. doi:10.1109/ICPR.2010.969
Netzer Y, Wang T (2011) Reading digits in natural images with unsupervised feature learning. In: Proceedings of NIPS workshop on deep learning and unsupervised feature learning, pp 1–9
Mishra A, Alahari K, Jawahar C (2012) Scene text recognition using higher order language priors. Proceedings of the British machine vision conference 127:1–11. doi:10.5244/C.26.127
Google Scholar
De Campos TE, Babu BR, Varma M (2013) Character recognition in natural images. Document analysis and recognition, pp 1–8

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Dr. B.R. Ambedkar National Institute of Technology, Jalandhar, Punjab, India
Amandeep Kaur & Renu Dhir
Department of Computer Science and Engineering, Punjabi University, Patiala, Punjab, India
Gurpreet Singh Lehal

Authors

Amandeep Kaur
View author publications
You can also search for this author in PubMed Google Scholar
Renu Dhir
View author publications
You can also search for this author in PubMed Google Scholar
Gurpreet Singh Lehal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amandeep Kaur.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kaur, A., Dhir, R. & Lehal, G.S. A survey on camera-captured scene text detection and extraction: towards Gurmukhi script. Int J Multimed Info Retr 6, 115–142 (2017). https://doi.org/10.1007/s13735-016-0116-5

Download citation

Received: 21 September 2016
Revised: 18 December 2016
Accepted: 24 December 2016
Published: 19 January 2017
Issue Date: June 2017
DOI: https://doi.org/10.1007/s13735-016-0116-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey on camera-captured scene text detection and extraction: towards Gurmukhi script

Abstract

Access this article

Similar content being viewed by others

The First Robust Mongolian Text Reading Dataset CSIMU-MTR

Downtown Osaka Scene Text Dataset

An Evaluation and Validation of Contemporary Approaches in Scene Text Extraction and Recognition

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A survey on camera-captured scene text detection and extraction: towards Gurmukhi script

Abstract

Access this article

Similar content being viewed by others

The First Robust Mongolian Text Reading Dataset CSIMU-MTR

Downtown Osaka Scene Text Dataset

An Evaluation and Validation of Contemporary Approaches in Scene Text Extraction and Recognition

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation