ABSTRACT
In recent years, many script recognition methods have emerged since they were studied as a front-end technique of OCR. These methods generally have a pleasing effect on a particular script, but they are not suitable for all languages. In this paper, we utilize the block finite ridgelet transform(BFRT) and discrete curvelet transform(DCT) and propose a fusion method in series for a total of 10,000 document images of 10 scripts including English, Chinese, Uyghur, Tibetan, Arabic, Turkish, Mongolian, Russian, Kazakhstan, Kyrgyzstan. The experimental results show that average accuracy is 99.35% in the classifier of linear discriminant analysis. Comparative experiments showed that the recognition rates of single BFRT and DCT were 89.03% and 86.3%, respectively. It demonstrates the effectiveness of the proposed method than the sole method. The validity of this method is proved by comparing it with some existing methods.
- S. Ben Moussa, A. Zahour, A. Benabdelhafid, and A. M. Alimi, "Fractal- based system for Arabic/Latin, printed/handwritten script identification," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4Google Scholar
- S. Ben Moussa, A. Zahour, A. Benabdelhafid, and A. M. Alimi, "Fractal- based system for Arabic/Latin, printed/handwritten script identification," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4..Google Scholar
- S. Chanda, U. Pal, and F. Kimura, "Identification of Japanese and English script from a single document page," in Proc. IEEE-CIT, Oct. 2007, pp. 656–661.Google Scholar
- A. Busch, W. W. Boles, and S. Sridharan, "Texture for script identification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 11, pp. 1720–1732, Nov. 2005.Google ScholarDigital Library
- S. Chanda, S. Pal, and U. Pal, "Word-wise Sinhala Tamil and English script identification using Gaussian kernel SVM," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4.Google Scholar
- S. Chanda and U. Pal, "English, Devanagari, and Urdu text identification," in Proc. Int. Conf. Cognit. Recognit., 2005, pp. 538–546.Google Scholar
- S. Chanda, U. Pal, K. Franke, and F. Kimura, "Script identification — A han and roman script perspective," in Proc. ICPR, Istanbul, Turkey, Aug. 2010, pp. 2708–2711.Google Scholar
- B. B. Chaudhuri, "On multi-script OCR system evaluation," in Proc.Int.Workshop Perform. Eval. Issues Multi-Lingual (OCR), 1999, p. 1. [Online].Google Scholar
- U. Pal, N. Sharma, T. Wakabayashi, and F. Kimura, "Handwritten numeral recognition of six popular Indian scripts," in Proc. ICDAR, Parana, Sep. 2007, pp. 749–753.Google Scholar
- D.K. Vishwakarma, Prachi Rawat, Rajiv Kapoor, Human Activity Recognition Using Gabor Wavelet Transform and Ridgelet Transform, Procedia Computer Science, Volume 57,2015.Google ScholarCross Ref
- S.Arivazhagan,L.Ganesan,T.G.SubashKumar,Texture classification using ridgelet transform,Pattern Recognition Letters,Volume 27, Issue 16,2006Google ScholarCross Ref
- S. B. Nikam and S. Agarwal, "Fingerprint Anti-Spoofing Using Ridgelet Transform," 2008 IEEE Second International Conference on Biometrics: Theory, Applications, and Systems, Arlington, VA, 2008, pp. 1-6.Google Scholar
- M. N. Do and M. Vetterli, "The finite ridgelet transform for image representation," in IEEE Transactions on Image Processing, vol. 12, no. 1, pp. 16-28, Jan. 2003.Google ScholarDigital Library
- LI Shun, Mutelep · Mamut, Hornisa · Mamat, Alim · Aysa, Kurban · Ubul. 'recognition of multilingual document images based on discrete quad transform [J]'. Computer engineering and design,2019,40(05):1376-1382. (in Chinese)Google Scholar
- M. N. Do and M. Vetterli, "Orthogonal finite ridgelet transform for image compression", IEEE International Conference on Image Processing (ICIP), Vancouver, Canada, September 2000.Google Scholar
- M. N. Do and M. Vetterli, "Image denoising using orthonormal finite ridgelet transform", Proc. of SPIE Conf. on Wavelet Applications in Signal and Image Processing VIII, San Diego, USA, August 2000.Google ScholarCross Ref
- M. N. Do and M. Vetterli, "The contourlet transform: an efficient directional multiresolution image representation," in IEEE Transactions on Image Processing, vol. 14, no. 12, pp. 2091-2106, Dec. 2005.Google ScholarDigital Library
- F. Matus and J. Flusser, "Image representation via a finite Radon transform," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 996-1006, Oct. 1993.Google ScholarDigital Library
- C. M. Brislawn, "Classification of nonexpansive symmetric extension transforms for multirate filter banks," Appl. Comput. Harmon. Anal., vol. 3, pp. 337–357, 1996.Google ScholarCross Ref
- E.J. Candes, L. Demanet, D. Donoho, and L. Ying. Fast discrete curvelet transforms. SIAM Multiscale Model. Simul., 5/3:861–899, 2006.Google ScholarCross Ref
- E.J.Candes and D. L. Donoho. New tight frames of curvelets and optimal representations of objects with smooth singularities. Technical report, Statistics, Stanford University, 2002.Google Scholar
- S. J. Lu and C. L. Tan, "Automatic Detection of Document Script and Orientation," in Document Analysis and Recognition, 2007. ICDAR'07, 7th International Conference on, vol. 1, pp. 237-241, 2007.Google Scholar
- Buagaguri Migitti, kuerban Ubul, Nurbiya Yadikal, Turgen ibulain, Alimjan Aysa. Texture feature weighted fusion for document image identification of multiple languages in central Asia [J]. Computer engineering and applications,2017,53(20):187-194.Google Scholar
- U. Pal, S. Sinha, and B. B. Chaudhuri, "Multi-Script Line identification from Indian Documents," in Document Analysis and Recognition, 2003. ICDAR' 03. 3th International Conference on, vol. 3163, pp. 880-884, 2003.Google Scholar
- Li shun, Mutelipu Mamuti, Urnisha Mamuti, Alimjan Aysa, kuerban Ubul. Recognition of multilingual document images based on discrete curvilinear transform [J]. Computer engineering and design,2019,40(05):1376-1382.Google Scholar
- P. S. Hiremath and S. Shivashankar, "Wavelet based co-occurrence histogram features for texture classification," Pattern Recognition Letters, vol.29, pp. 1182–1189, 2008.Google ScholarDigital Library
- Han xingkun, alimujiang aisha, nurbiya yadikar, zhu yali, kuerban wubuli. Recognition of central Asian languages with texture feature fusion in NSCT subregion [J]. Computer engineering and design, 2018,39(09):2848-2855.Google Scholar
- M. A. Ferrer, A. Morales, and U. Pal, "LBP Based Line-Wise Script Identification," in Document Analysis and Recognition, 2013. ICDAR'12. 12th International Conference on, pp. 369-373, 2013.Google Scholar
Recommendations
A 4-quadrant Curvelet Transform for Denoising Digital Images
The conventional discrete wavelet transform (DWT) introduces artifacts during denoising of images containing smooth curves. Finite ridgelet transform (FRIT) solved this problem by mapping the curves in terms of small curved ridges. However, blind ...
Radon transform and dynamic programming for the Persian handwritten zip code recognition
Pattern recognition is one of the major research areas in computer sciences. Optical character recognition OCR as one of the pattern recognition topics has specifically attracted the interests of many researchers. This paper presents a method for ...
Local features-based script recognition from printed bilingual document images
Classification and identification of language in a biscript document is one of the important steps in the design of an OCR system for successful analysis and recognition. This paper presents architecture for script recognition of bilingual document ...
Comments