Loading [MathJax]/extensions/MathMenu.js
Bangla text processing and recognition based on Fuzzy unsupervised Feature Extraction and SVM | IEEE Conference Publication | IEEE Xplore

Bangla text processing and recognition based on Fuzzy unsupervised Feature Extraction and SVM


Abstract:

Optical character recognition (OCR) is a widely used technology to convert text images to editable text. Researchers already proposed many machine learning algorithms to ...Show More

Abstract:

Optical character recognition (OCR) is a widely used technology to convert text images to editable text. Researchers already proposed many machine learning algorithms to address this problem. However, Bangla text recognition is highly challenging job for its complicated writing style, compound characters and highly diversified fonts. To address the segmentation problem we have proposed an algorithm namely Blob-Labeled character Segmentation (BLCS) that initiates with an extensive preprocessing to extract the characters from text. Our novel character segmentation procedure extracts characters maintaining 97.5% accuracy. Unsupervised feature learning becomes a powerful tool in machine learning nowadays. To increase the recognition rate of the characters, we have introduced a fuzzy unsupervised feature learning algorithm to learn features of individual characters. We then use Artificial Neural Network (ANN) and Support Vector Machine (SVM) to classify the characters. The SVM provides 99.4% accuracy which outperforms all other approaches.
Date of Conference: 14-17 July 2013
Date Added to IEEE Xplore: 08 September 2014
Electronic ISBN:978-1-4799-0260-6

ISSN Information:

Conference Location: Tianjin, China

Contact IEEE to Subscribe

References

References is not available for this document.