skip to main content
10.1145/3421558.3421572acmotherconferencesArticle/Chapter ViewAbstractPublication PagesipmvConference Proceedingsconference-collections
research-article

Script Identification of Multilingual Document Images Based on Block Finite Ridgelet Transform and Discrete Curvelet Transform

Published:25 November 2020Publication History

ABSTRACT

In recent years, many script recognition methods have emerged since they were studied as a front-end technique of OCR. These methods generally have a pleasing effect on a particular script, but they are not suitable for all languages. In this paper, we utilize the block finite ridgelet transform(BFRT) and discrete curvelet transform(DCT) and propose a fusion method in series for a total of 10,000 document images of 10 scripts including English, Chinese, Uyghur, Tibetan, Arabic, Turkish, Mongolian, Russian, Kazakhstan, Kyrgyzstan. The experimental results show that average accuracy is 99.35% in the classifier of linear discriminant analysis. Comparative experiments showed that the recognition rates of single BFRT and DCT were 89.03% and 86.3%, respectively. It demonstrates the effectiveness of the proposed method than the sole method. The validity of this method is proved by comparing it with some existing methods.

References

  1. S. Ben Moussa, A. Zahour, A. Benabdelhafid, and A. M. Alimi, "Fractal- based system for Arabic/Latin, printed/handwritten script identification," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4Google ScholarGoogle Scholar
  2. S. Ben Moussa, A. Zahour, A. Benabdelhafid, and A. M. Alimi, "Fractal- based system for Arabic/Latin, printed/handwritten script identification," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4..Google ScholarGoogle Scholar
  3. S. Chanda, U. Pal, and F. Kimura, "Identification of Japanese and English script from a single document page," in Proc. IEEE-CIT, Oct. 2007, pp. 656–661.Google ScholarGoogle Scholar
  4. A. Busch, W. W. Boles, and S. Sridharan, "Texture for script identification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 11, pp. 1720–1732, Nov. 2005.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Chanda, S. Pal, and U. Pal, "Word-wise Sinhala Tamil and English script identification using Gaussian kernel SVM," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4.Google ScholarGoogle Scholar
  6. S. Chanda and U. Pal, "English, Devanagari, and Urdu text identification," in Proc. Int. Conf. Cognit. Recognit., 2005, pp. 538–546.Google ScholarGoogle Scholar
  7. S. Chanda, U. Pal, K. Franke, and F. Kimura, "Script identification — A han and roman script perspective," in Proc. ICPR, Istanbul, Turkey, Aug. 2010, pp. 2708–2711.Google ScholarGoogle Scholar
  8. B. B. Chaudhuri, "On multi-script OCR system evaluation," in Proc.Int.Workshop Perform. Eval. Issues Multi-Lingual (OCR), 1999, p. 1. [Online].Google ScholarGoogle Scholar
  9. U. Pal, N. Sharma, T. Wakabayashi, and F. Kimura, "Handwritten numeral recognition of six popular Indian scripts," in Proc. ICDAR, Parana, Sep. 2007, pp. 749–753.Google ScholarGoogle Scholar
  10. D.K. Vishwakarma, Prachi Rawat, Rajiv Kapoor, Human Activity Recognition Using Gabor Wavelet Transform and Ridgelet Transform, Procedia Computer Science, Volume 57,2015.Google ScholarGoogle ScholarCross RefCross Ref
  11. S.Arivazhagan,L.Ganesan,T.G.SubashKumar,Texture classification using ridgelet transform,Pattern Recognition Letters,Volume 27, Issue 16,2006Google ScholarGoogle ScholarCross RefCross Ref
  12. S. B. Nikam and S. Agarwal, "Fingerprint Anti-Spoofing Using Ridgelet Transform," 2008 IEEE Second International Conference on Biometrics: Theory, Applications, and Systems, Arlington, VA, 2008, pp. 1-6.Google ScholarGoogle Scholar
  13. M. N. Do and M. Vetterli, "The finite ridgelet transform for image representation," in IEEE Transactions on Image Processing, vol. 12, no. 1, pp. 16-28, Jan. 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. LI Shun, Mutelep · Mamut, Hornisa · Mamat, Alim · Aysa, Kurban · Ubul. 'recognition of multilingual document images based on discrete quad transform [J]'. Computer engineering and design,2019,40(05):1376-1382. (in Chinese)Google ScholarGoogle Scholar
  15. M. N. Do and M. Vetterli, "Orthogonal finite ridgelet transform for image compression", IEEE International Conference on Image Processing (ICIP), Vancouver, Canada, September 2000.Google ScholarGoogle Scholar
  16. M. N. Do and M. Vetterli, "Image denoising using orthonormal finite ridgelet transform", Proc. of SPIE Conf. on Wavelet Applications in Signal and Image Processing VIII, San Diego, USA, August 2000.Google ScholarGoogle ScholarCross RefCross Ref
  17. M. N. Do and M. Vetterli, "The contourlet transform: an efficient directional multiresolution image representation," in IEEE Transactions on Image Processing, vol. 14, no. 12, pp. 2091-2106, Dec. 2005.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. F. Matus and J. Flusser, "Image representation via a finite Radon transform," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 996-1006, Oct. 1993.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. M. Brislawn, "Classification of nonexpansive symmetric extension transforms for multirate filter banks," Appl. Comput. Harmon. Anal., vol. 3, pp. 337–357, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  20. E.J. Candes, L. Demanet, D. Donoho, and L. Ying. Fast discrete curvelet transforms. SIAM Multiscale Model. Simul., 5/3:861–899, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  21. E.J.Candes and D. L. Donoho. New tight frames of curvelets and optimal representations of objects with smooth singularities. Technical report, Statistics, Stanford University, 2002.Google ScholarGoogle Scholar
  22. S. J. Lu and C. L. Tan, "Automatic Detection of Document Script and Orientation," in Document Analysis and Recognition, 2007. ICDAR'07, 7th International Conference on, vol. 1, pp. 237-241, 2007.Google ScholarGoogle Scholar
  23. Buagaguri Migitti, kuerban Ubul, Nurbiya Yadikal, Turgen ibulain, Alimjan Aysa. Texture feature weighted fusion for document image identification of multiple languages in central Asia [J]. Computer engineering and applications,2017,53(20):187-194.Google ScholarGoogle Scholar
  24. U. Pal, S. Sinha, and B. B. Chaudhuri, "Multi-Script Line identification from Indian Documents," in Document Analysis and Recognition, 2003. ICDAR' 03. 3th International Conference on, vol. 3163, pp. 880-884, 2003.Google ScholarGoogle Scholar
  25. Li shun, Mutelipu Mamuti, Urnisha Mamuti, Alimjan Aysa, kuerban Ubul. Recognition of multilingual document images based on discrete curvilinear transform [J]. Computer engineering and design,2019,40(05):1376-1382.Google ScholarGoogle Scholar
  26. P. S. Hiremath and S. Shivashankar, "Wavelet based co-occurrence histogram features for texture classification," Pattern Recognition Letters, vol.29, pp. 1182–1189, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Han xingkun, alimujiang aisha, nurbiya yadikar, zhu yali, kuerban wubuli. Recognition of central Asian languages with texture feature fusion in NSCT subregion [J]. Computer engineering and design, 2018,39(09):2848-2855.Google ScholarGoogle Scholar
  28. M. A. Ferrer, A. Morales, and U. Pal, "LBP Based Line-Wise Script Identification," in Document Analysis and Recognition, 2013. ICDAR'12. 12th International Conference on, pp. 369-373, 2013.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    IPMV '20: Proceedings of the 2020 2nd International Conference on Image Processing and Machine Vision
    August 2020
    194 pages
    ISBN:9781450388412
    DOI:10.1145/3421558

    Copyright © 2020 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 25 November 2020

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)1

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format