research-article

Script Identification of Multilingual Document Images Based on Block Finite Ridgelet Transform and Discrete Curvelet Transform

Authors:
Zheng-Jian Wu

Xinjiang University, China

Xinjiang University, China
View Profile

,
Reyihanguli Hasimu

Xinjiang University, China

Xinjiang University, China
View Profile

,
Hornisa Mamat

Xinjiang University, China

Xinjiang University, China
View Profile

,
Alimjan Aysa

Xinjiang University

Xinjiang University
View Profile

,
Kurban Ubul

Xinjiang University, China

Xinjiang University, China
View Profile

IPMV '20: Proceedings of the 2020 2nd International Conference on Image Processing and Machine VisionAugust 2020Pages 87–93https://doi.org/10.1145/3421558.3421572

Published:25 November 2020Publication History

IPMV '20: Proceedings of the 2020 2nd International Conference on Image Processing and Machine Vision

Pages 87–93

ABSTRACT

In recent years, many script recognition methods have emerged since they were studied as a front-end technique of OCR. These methods generally have a pleasing effect on a particular script, but they are not suitable for all languages. In this paper, we utilize the block finite ridgelet transform(BFRT) and discrete curvelet transform(DCT) and propose a fusion method in series for a total of 10,000 document images of 10 scripts including English, Chinese, Uyghur, Tibetan, Arabic, Turkish, Mongolian, Russian, Kazakhstan, Kyrgyzstan. The experimental results show that average accuracy is 99.35% in the classifier of linear discriminant analysis. Comparative experiments showed that the recognition rates of single BFRT and DCT were 89.03% and 86.3%, respectively. It demonstrates the effectiveness of the proposed method than the sole method. The validity of this method is proved by comparing it with some existing methods.

References

S. Ben Moussa, A. Zahour, A. Benabdelhafid, and A. M. Alimi, "Fractal- based system for Arabic/Latin, printed/handwritten script identification," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4Google Scholar
S. Ben Moussa, A. Zahour, A. Benabdelhafid, and A. M. Alimi, "Fractal- based system for Arabic/Latin, printed/handwritten script identification," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4..Google Scholar
S. Chanda, U. Pal, and F. Kimura, "Identification of Japanese and English script from a single document page," in Proc. IEEE-CIT, Oct. 2007, pp. 656–661.Google Scholar
A. Busch, W. W. Boles, and S. Sridharan, "Texture for script identification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 11, pp. 1720–1732, Nov. 2005.Google ScholarDigital Library
S. Chanda, S. Pal, and U. Pal, "Word-wise Sinhala Tamil and English script identification using Gaussian kernel SVM," in Proc. ICPR, Tampa, FL, USA, Dec. 2008, pp. 1–4.Google Scholar
S. Chanda and U. Pal, "English, Devanagari, and Urdu text identification," in Proc. Int. Conf. Cognit. Recognit., 2005, pp. 538–546.Google Scholar
S. Chanda, U. Pal, K. Franke, and F. Kimura, "Script identification — A han and roman script perspective," in Proc. ICPR, Istanbul, Turkey, Aug. 2010, pp. 2708–2711.Google Scholar
B. B. Chaudhuri, "On multi-script OCR system evaluation," in Proc.Int.Workshop Perform. Eval. Issues Multi-Lingual (OCR), 1999, p. 1. [Online].Google Scholar
U. Pal, N. Sharma, T. Wakabayashi, and F. Kimura, "Handwritten numeral recognition of six popular Indian scripts," in Proc. ICDAR, Parana, Sep. 2007, pp. 749–753.Google Scholar
D.K. Vishwakarma, Prachi Rawat, Rajiv Kapoor, Human Activity Recognition Using Gabor Wavelet Transform and Ridgelet Transform, Procedia Computer Science, Volume 57,2015.Google ScholarCross Ref
S.Arivazhagan,L.Ganesan,T.G.SubashKumar,Texture classification using ridgelet transform,Pattern Recognition Letters,Volume 27, Issue 16,2006Google ScholarCross Ref
S. B. Nikam and S. Agarwal, "Fingerprint Anti-Spoofing Using Ridgelet Transform," 2008 IEEE Second International Conference on Biometrics: Theory, Applications, and Systems, Arlington, VA, 2008, pp. 1-6.Google Scholar
M. N. Do and M. Vetterli, "The finite ridgelet transform for image representation," in IEEE Transactions on Image Processing, vol. 12, no. 1, pp. 16-28, Jan. 2003.Google ScholarDigital Library
LI Shun, Mutelep · Mamut, Hornisa · Mamat, Alim · Aysa, Kurban · Ubul. 'recognition of multilingual document images based on discrete quad transform [J]'. Computer engineering and design,2019,40(05):1376-1382. (in Chinese)Google Scholar
M. N. Do and M. Vetterli, "Orthogonal finite ridgelet transform for image compression", IEEE International Conference on Image Processing (ICIP), Vancouver, Canada, September 2000.Google Scholar
M. N. Do and M. Vetterli, "Image denoising using orthonormal finite ridgelet transform", Proc. of SPIE Conf. on Wavelet Applications in Signal and Image Processing VIII, San Diego, USA, August 2000.Google ScholarCross Ref
M. N. Do and M. Vetterli, "The contourlet transform: an efficient directional multiresolution image representation," in IEEE Transactions on Image Processing, vol. 14, no. 12, pp. 2091-2106, Dec. 2005.Google ScholarDigital Library
F. Matus and J. Flusser, "Image representation via a finite Radon transform," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 996-1006, Oct. 1993.Google ScholarDigital Library
C. M. Brislawn, "Classification of nonexpansive symmetric extension transforms for multirate filter banks," Appl. Comput. Harmon. Anal., vol. 3, pp. 337–357, 1996.Google ScholarCross Ref
E.J. Candes, L. Demanet, D. Donoho, and L. Ying. Fast discrete curvelet transforms. SIAM Multiscale Model. Simul., 5/3:861–899, 2006.Google ScholarCross Ref
E.J.Candes and D. L. Donoho. New tight frames of curvelets and optimal representations of objects with smooth singularities. Technical report, Statistics, Stanford University, 2002.Google Scholar
S. J. Lu and C. L. Tan, "Automatic Detection of Document Script and Orientation," in Document Analysis and Recognition, 2007. ICDAR'07, 7th International Conference on, vol. 1, pp. 237-241, 2007.Google Scholar
Buagaguri Migitti, kuerban Ubul, Nurbiya Yadikal, Turgen ibulain, Alimjan Aysa. Texture feature weighted fusion for document image identification of multiple languages in central Asia [J]. Computer engineering and applications,2017,53(20):187-194.Google Scholar
U. Pal, S. Sinha, and B. B. Chaudhuri, "Multi-Script Line identification from Indian Documents," in Document Analysis and Recognition, 2003. ICDAR' 03. 3th International Conference on, vol. 3163, pp. 880-884, 2003.Google Scholar
Li shun, Mutelipu Mamuti, Urnisha Mamuti, Alimjan Aysa, kuerban Ubul. Recognition of multilingual document images based on discrete curvilinear transform [J]. Computer engineering and design,2019,40(05):1376-1382.Google Scholar
P. S. Hiremath and S. Shivashankar, "Wavelet based co-occurrence histogram features for texture classification," Pattern Recognition Letters, vol.29, pp. 1182–1189, 2008.Google ScholarDigital Library
Han xingkun, alimujiang aisha, nurbiya yadikar, zhu yali, kuerban wubuli. Recognition of central Asian languages with texture feature fusion in NSCT subregion [J]. Computer engineering and design, 2018,39(09):2848-2855.Google Scholar
M. A. Ferrer, A. Morales, and U. Pal, "LBP Based Line-Wise Script Identification," in Document Analysis and Recognition, 2013. ICDAR'12. 12th International Conference on, pp. 369-373, 2013.Google Scholar

Recommendations

A 4-quadrant Curvelet Transform for Denoising Digital Images

The conventional discrete wavelet transform (DWT) introduces artifacts during denoising of images containing smooth curves. Finite ridgelet transform (FRIT) solved this problem by mapping the curves in terms of small curved ridges. However, blind ...
Read More
Radon transform and dynamic programming for the Persian handwritten zip code recognition

Pattern recognition is one of the major research areas in computer sciences. Optical character recognition OCR as one of the pattern recognition topics has specifically attracted the interests of many researchers. This paper presents a method for ...
Read More
Local features-based script recognition from printed bilingual document images

Classification and identification of language in a biscript document is one of the important steps in the design of an OCR system for successful analysis and recognition. This paper presents architecture for script recognition of bilingual document ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

IPMV '20: Proceedings of the 2020 2nd International Conference on Image Processing and Machine Vision
August 2020
194 pages
ISBN:9781450388412
DOI:10.1145/3421558

Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 November 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Curvelet transform
Feature fusion
Ridgelet transform
Script recognition
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 36
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Script Identification of Multilingual Document Images Based on Block Finite Ridgelet Transform and Discrete Curvelet Transform

IPMV '20: Proceedings of the 2020 2nd International Conference on Image Processing and Machine Vision

ABSTRACT

References

Cited By

Recommendations

A 4-quadrant Curvelet Transform for Denoising Digital Images

Radon transform and dynamic programming for the Persian handwritten zip code recognition

Local features-based script recognition from printed bilingual document images

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Script Identification of Multilingual Document Images Based on Block Finite Ridgelet Transform and Discrete Curvelet Transform

IPMV '20: Proceedings of the 2020 2nd International Conference on Image Processing and Machine Vision

ABSTRACT

References

Cited By

Recommendations

A 4-quadrant Curvelet Transform for Denoising Digital Images

Radon transform and dynamic programming for the Persian handwritten zip code recognition

Local features-based script recognition from printed bilingual document images

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media