Enhancement of ANN-Based Offline Hand Written Character Recognition Using Gradient and Geometric Feature Extraction Techniques

Joarder, Y. A.; Barman, Paresh Chandra; Islam, Md Zahidul

doi:10.1007/978-3-319-58750-9_20

Y. A. Joarder¹¹,
Paresh Chandra Barman¹¹ &
Md Zahidul Islam¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 713))

Included in the following conference series:

International Conference on Human-Computer Interaction

2207 Accesses

Abstract

Offline handwritten character recognition has been one of the foremost difficult analysis areas within the field of image processing and pattern recognition in the recent years. Handwritten character recognition is a terribly problematic analysis space, because writing styles might vary from one user to another. The main goal of this research is to recognize the characters from a given scanned document or an image file where Multilayered Feed Forward network with Back propagation algorithm including two feature extraction techniques have been implemented at the same system. We have considered parameters like number of Hidden Layer, size of Hidden Layer and Epochs and applied some basic algorithms for segmentation of characters and normalization of characters and thrown light on Gradient and Geometry based feature extraction techniques for feature extraction respectively, because Feature Extraction is an integral part of any recognition system as well as improves recognition rate and misclassification. We have described step by step procedure of character recognition using ANN and calculated the number of hidden layer as well.

You have full access to this open access chapter, Download conference paper PDF

A novel handwritten character recognition system using gradient based features and run length count

Article 12 August 2014

Automatic handwritten character recognition of Devanagari language: a hybrid training algorithm for neural network

Article 12 April 2021

Isarn Dharma Handwritten Character Recognition Using Neural Network and Support Vector Machine

Keywords

1 Introduction

The purpose of this research is to take handwritten English characters as input, process the character, train the neural network algorithm, to recognize the pattern and modify the character to a beautified version of the input as well as explain the mechanism of ANN including the calculation of number of hidden layer; by finding the number of hidden layer, it is easy to understand the complexity of the system. Though this work is restricted to English characters solely, this research is aimed toward developing software which can be useful in recognizing characters of English language. It can be additional developed to recognize the characters of various languages later. It engulfs the idea of neural network. One of the first suggests by which computers are dowered with human-like skills is through the utilization of a neural network. Neural networks are notably helpful for resolution issues that cannot be expressed as a series of steps, like recognizing patterns, classifying them into groups, series prediction and data mining. A neural network trained for classification is intended to take input samples and classify them into groups. These groups could also be fuzzy, while not clearly outlined boundaries. This project engages detecting free handwritten characters.

2 Related Work

Today Neural Networks are mostly used for Pattern Recognition. Optical character recognition (OCR) is widespread use of Neural Network. Different Models of Neural Network have been applied on the test set on each to find the accuracy of the respective Neural Network [1]. However, handwritten character and optical character are different format; optical character recognition is easy for recognition, because of its pattern which easy to recognition. On the contrary, handwritten character recognition is difficult because the large range of writing style from one person to another [2]. Feature extraction which improves recognition rate and misclassification is an integral part of any recognition system [2, 3]. The aim of feature extraction is to describe the pattern by means of minimum number of features that are effective in discriminating pattern classes. The gradient measures the magnitude and direction of the greatest change in intensity in a small neighborhood of each pixel where gradient refers to both the gradient magnitude and direct ion). Gradients are computed by means of the Sobel operator. Due to its logical simplicity, ease of use and high recognition rate, Gradient Features should be used for recognition purposes [3]. Recognition of Handwritten text has numerous applications which include, reading aid for blind and conversion of any hand written document into structural text form. To recognize handwritten characters by projecting them on different sized grids by using Mat lab Neural Network toolbox is the best way. The first step is image acquisition which acquires the scanned image followed by noise filtering, smoothing and normalization of scanned image, rendering image suitable for segmentation where image is decomposed into sub images. Character extraction and edge detection algorithm have been used for training the neural network to classify and recognize the handwritten characters [4]. This paper explores the existing ring based method (W.I. Reber 1987), the new sector based method and the combination of these, termed the Fusion method for the recognition of handwritten English capital letters. The variability associated with the characters is accounted for by way of considering a fixed number of concentric rings in the case of the ring based approach and a fixed number of sectors in the case of the sector approach. Structural features such as end points, junction points and the number of branches are used for the pre-classification of characters, the local features such as normalized vector lengths and angles derived from either ring or sector approaches are used in the training using the reference characters and subsequent recognition of the test characters. The recognition rates obtained are encouraging [5]. A geometry based technique for feature extraction is applicable to segmentation-based word recognition systems. It extracts the geometric features of the character contour. These features are based on the basic line types that form the character skeleton. The system gives a feature vector as its output. The feature vectors so generated from a training set were then used to train a pattern recognition engine based on Neural Networks so that the system can be benchmarked [6]. In computer vision research, object detection based on image processing is the task of identifying a designated object on a static image or a sequence of video frames. Projects based on such research works have been widely adapted to various industrial and social applications. The field to which those applications apply includes but not limited to, security surveillance, intelligent transportation system, automated manufacturing, and quality control and supply chain management. The popular computer vision methods have been extensively studied in various research papers and their significance to computer vision research has been proven by subsequent research works. In general, by categorizing those methods into to gradient-based and edge based feature extraction methods, depending on the low level features they use [7].

3 Proposed Approach

We have used two Feature Extraction techniques in the same system so that the system is more flexible; if any of them out of work for any technical issues, then other one work without any problem. We have used Gradient and Geometry based feature extraction techniques for feature extraction respectively because Feature Extraction is an integral part of any recognition system as well as improves recognition rate and misclassification. The proposed method comprises of 4 phases:

1.
Pre-processing
2.
Segmentation
3.
Feature Extraction
4.
Classification and Recognition

3.1 Pre-processing

In image representation one is concerned with the characterization of the number that every pixel represents. The number of pixels per unit area i.e. sampling rate must be massive enough to preserve the helpful in-formation within the image.

3.2 Segmentation

In the segmentation stage, an image of sequence of characters is rotten into sub-images of individual character. The pre-processed input image is divided into isolated characters by distribution variety to every character employing a labeling method. This labeling provides info concerning range of characters within the image. Every individual character is uniformly resized into pixels. In normalization, we want to normalize the size of the characters. There are massive variations within the sizes of every Character hence we need a technique to normalize the size. For normalizing the size we have used Character Extraction Algorithm and Edge Detection Algorithm.

3.3 Feature Extraction

There are two Feature Extraction methods have been employed:

1.
Feature Extraction Using Gradient Feature
2.
Feature Extraction Based on Character Geometry

Feature Extraction Using Gradient Feature

The gradient measures the magnitude and direction of the best modification in intensity in an exceedingly tiny neighborhood of every pixel. (In what follows, “gradient” refers to each the gradient magnitude and direction) Gradients are computed by means that of the Sobel operator. The Sobel templates accustomed compute the horizontal (X) and vertical (Y) parts of the gradient are shown below (Table 1):

Table 1. Sobel masks for gradient (Source: [3])

Full size table

Given an input image of size G₁ × G₂, each pixel neighborhood is convolved with these templates to work out these X and Y parts, H_x and H_y, severally. Equations (1) and (2) represent their mathematical representation:

$$ \begin{array}{*{20}c} {{\text{H}}\left( {{\text{m}},{\text{n}}} \right) = {\text{I}}\left( {{\text{m}} - 1,{\text{n}} + 1} \right) + 2*{\text{I}}\left( {{\text{m}},{\text{n}} + 1} \right) + {\text{I}}\left( {{\text{m}} + 1,{\text{n}} + 1} \right) - {\text{I}}\left( {{\text{m}} - 1,{\text{n}} - 1} \right) - 2*{\text{I}}\left( {{\text{m}},{\text{n}} - 1} \right)} \\ { - {\text{I}}\left( {{\text{m}} + 1,{\text{n}} - 1} \right)} \\ \end{array} $$

(1)

$$ \begin{array}{*{20}c} {{\text{H}}\left( {{\text{m}},{\text{n}}} \right) = {\text{I}}\left( {{\text{m}} - 1,{\text{n}} - 1} \right) + 2*{\text{I}}\left( {{\text{m}} - 1,{\text{n}}} \right) + {\text{I}}\left( {{\text{m}} - 1,{\text{n}} + 1} \right){\text{y}} - {\text{I}}\left( {{\text{m}} + 1,{\text{n}} - 1} \right) - 2*{\text{I}}\left( {{\text{m + 1}},{\text{n}}} \right)} \\ { - {\text{I}}\left( {{\text{m}} + 1,{\text{n}} + 1} \right)} \\ \end{array} $$

(2)

Here, (m, n) range over the image rows (G₁) and columns (G₂), respectively. The gradient strength and direction can be computed from the gradient vector [H_x, H_y].

After getting gradient vector of every pixel, the gradient image is decomposed into four orientation planes or eight direction planes.

Feature Extraction Based on Character Geometry

It extracts completely different line sorts that form a selected character. It additionally concentrates on the point options of identical. The feature extraction technique explained was tested employing a Neural Network that was trained with the feature vectors obtained from the system proposed.

Universe of Discourse.

Shortest matrix that matches the whole character skeleton.

Zoning.

After the universe of discourse is chosen, the image is split into windows of equal size, and also the feature is completed on individual windows. The image was zoned into nine equal sized windows. Feature extraction was applied to individual zones, instead of the full image.

Starters.

Starters are those pixels with one neighbor within the character skeleton.

Intersections.

It ought to have over one neighbor.

3.4 Classification and Recognition

In Neural Network, each node perform some straight-forward computation and each affiliation conveys a proof from one node to a unique labelled by selection called the “connection strength” or weight indicating the extent thereto signal is amplified or diminished by the connection. Different selections for weight leads to totally different functions are being evaluated by the network. If in a given network whose weight are initial random and provided that we all know the task to be accomplished by the network, a learning algorithm should be accustomed verify the values of the weight which will reach the required task. Learning algorithm qualifies the computing system to be referred to as Artificial Neural Network (Fig. 1).

Hidden Layer Calculation

$$ {\text{T}}_{\text{h}} = {\text{T}}_{\text{s}} /\{ {\text{A}}*({\text{T}}_{\text{i}} + {\text{T}}_{\text{o}} )\} $$

(3)

Here,

T_s = Number of Samples in Training Data Set
T_h = Number of Hidden Layer
T_o = Number of Output Neuron
T_i = Number of Input Neuron
A = Arbitrary Scaling Factor 2-10

Sample Input and Output

The match pattern is obtained to get the associated character, once the network is trained (Fig. 2).

Test Result Comparison

The given line graph shows the variations of Gradient Feature and Character Geometry Feature Extraction methods on the basis of number of Epochs (Fig. 3).

4 Conclusion

The implementation of the absolutely connected Back propagation network gave affordable results toward recognizing characters. The two strategies specified for feature extraction yield desired and right smart accuracies for recognition. The foremost notable is that the proven fact that it cannot handle major variations in translation, rotation, or scale. Whereas some preprocessing steps is enforced so as to account for these variances, as we did generally they are tough to solve fully.

References

Patel, C.I., Patel, R., Patel, P.: Handwritten character recognition using neural networks. Int. J. Sci. Eng. Res. 2(5), 1–6 (2011)
Google Scholar
Rani, M., Meena, Y.K.: An efficient feature extraction method for handwritten character recognition. In: Panigrahi, B.K., Suganthan, P.N., Das, S., Satapathy, S.C. (eds.) SEMCCO 2011. LNCS, vol. 7077, pp. 302–309. Springer, Heidelberg (2011). doi:10.1007/978-3-642-27242-4_35
Chapter Google Scholar
Aggarwal, A., Rani, R., Dhir, R.: Handwritten character recognition using gradient features. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2(5), 234–240 (2012)
Google Scholar
Prasad, K., Nigam, D.C., Lakhotiya, A., Umre, D.: Character recognition using Matlab’s neural toolbox. Int. J. u- and e- Serv. Sci. Technol. 6(1), 13–20 (2013)
Google Scholar
Hanmandlu, M., Murali Mohan, K.R., Kumar, H.: Neural based handwritten character recognition. In: Proceeding of ICDAR 1999, Proceedings of 5th International Conference on Document Analysis and Recognition, p. 241 (1999)
Google Scholar
Dileep, D.: A Feature Extraction Technique Based on Character Geometry for Character Recognition
Google Scholar
Wang, S.: A review of gradient-based and edge-based feature extraction methods for object detection. In: 2011 IEEE 11th International Conference on Computer and Information Technology (CIT) (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Communication Engineering, Islamic University, Kushtia, Bangladesh
Y. A. Joarder, Paresh Chandra Barman & Md Zahidul Islam

Authors

Y. A. Joarder
View author publications
You can also search for this author in PubMed Google Scholar
Paresh Chandra Barman
View author publications
You can also search for this author in PubMed Google Scholar
Md Zahidul Islam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Y. A. Joarder .

Editor information

Editors and Affiliations

Foundation for Research & Technology – Hellas (FORTH), University of Crete, Heraklion, Crete, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Joarder, Y.A., Barman, P.C., Islam, M.Z. (2017). Enhancement of ANN-Based Offline Hand Written Character Recognition Using Gradient and Geometric Feature Extraction Techniques. In: Stephanidis, C. (eds) HCI International 2017 – Posters' Extended Abstracts. HCI 2017. Communications in Computer and Information Science, vol 713. Springer, Cham. https://doi.org/10.1007/978-3-319-58750-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-58750-9_20
Published: 13 May 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58749-3
Online ISBN: 978-3-319-58750-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics