A simple computational model for image retrieval with weighted multifeatures based on orthogonal polynomials and genetic algorithm

doi:10.1016/j.neucom.2012.05.030

Neurocomputing

Volume 116, 20 September 2013, Pages 165-181

https://doi.org/10.1016/j.neucom.2012.05.030 Get rights and content

Abstract

This paper proposes a simple and new image retrieval method with weighted multifeature set based on multiresolution enhanced orthogonal polynomials model and genetic algorithm. In the proposed method, initially the orthogonal polynomials model coefficients are computed and reordered into multiresolution subband like structure. Then the statistical, directional, perceptual and invariant texture, shape and color features are directly extracted from the subband coefficients. The extracted texture, shape and color features are integrated into linear multifeature set and the significance of each feature in the multifeature set is determined by assigning appropriate weight. This paper also proposes a method to compute the optimized weight for each feature in the integrated linear multifeature multi feature set using genetic algorithm. Then the obtained optimized weight is multiplied with the corresponding features in the multifeature set and the weighted Manhattan distance metric is used for retrieving similar images. The efficiency of the proposed method is experimented on the standard subset of Corel and Caltech database images. The performance of the proposed method is compared with other existing retrieval methods such as Haar wavelet and Contourlet Transform based retrieval schemes. The proposed method yields high average recall and precision of 92.6% and 71% for Corel database and 90.5% and 72.3% of Caltech database images when compared with other existing methods.

Introduction

With the rapid growth of digital and information technologies, more and more multimedia data are generated and made available in digital form. Searching and retrieving relevant images in this huge volume of data is a difficult task and has created an urgent need to develop new tools and techniques. One such solution is the Content Based Image Retrieval (CBIR). As the image databases grow larger, the traditional keyword-based approach for retrieving a particular image becomes inefficient and suffers from the following limitations: (i) vast amount of labor is required for manual image annotation and (ii) limited capacity for retrieving the visual content of the image and subjectivity of human perception. Hence, to overcome these difficulties of manual annotation approach, content based image retrieval has emerged. CBIR is a collection of techniques and algorithms which enable querying the image databases with low level image content such as color, texture, objects and their geometries rather than textual attributes such as image name or other keywords [1]. Many image retrieval systems have been developed using all or some of these features. It includes Chabot [2], Photobook [3], QBIC [4], Virage [5], VisualSeek [6], MARS [7], Netra [8] and Excalibur [9]. The extensive literature and the state of art methods about content based image retrieval can be found in [10], [11], [12], [13], [14], [15]. Though some of the image retrieval applications such as trademark retrieval, character recognition and leaf image retrieval [16], [17] are implemented based on single feature (either texture, shape or color), but the single feature is found to be insufficient for natural, web based image retrieval applications as it affects the retrieval performance. Hence recently general-purpose CBIR systems concentrate on multiple features such as color, texture and shape along with some domain specific features for improving the performance of the image retrieval.

Sklansky [18] defined the texture as a set of local properties in the image region with a constant, slowly varying or approximately periodic pattern and it is measured using its distinct properties such as periodicity, coarseness, directionality and pattern complexity for efficient image retrieval particularly on the aspects of orientation and scale [19], [20]. In a typical CBIR system, identification of the proper features that maximizes the differentiation of the texture is an important step. There are many categories of methods that exist for identifying and manipulating the texture: (i) statistical methods (Gray level Co occurrence matrix (GLCM) [21]), (ii) model based methods such as Markov Random Fields (MRF) [22], Simultaneous Auto Regression (SAR) [23], Wold decomposition [24] and (iii) signal processing methods (Gabor filters [25], Wavelet Transforms [26], [27]). Some of these techniques depend on the comparison values of second order statistics obtained from query and stored images [28], [27] for measuring the texture similarity.

The shape of an image is effectively perceived by the human eye than color or texture. Hence shape-based searching and retrieving has gained much attention in CBIR. Shape-based retrieval involves three primary issues: shape representation, shape similarity measure and shape indexing. Among them, shape representation is the most important issue in shape based image retrieval. The shape representation methods reported in the literature can be classified into two categories: region based and contour based [29]. Region based techniques have frequently used moment descriptors to obtain shape representation [30], [31], [32], [33] which considers all the pixels inside the shape to compute the shape features. Contour based shape representation [29] only exploits shape boundary information and are classified into continuous approach (global) and discrete approach (structural). Both region and contour based representation methods compute shape features either in spatial or frequency domain. Spatial domain descriptors are sensitive to noise and not robust. In addition, it requires intensive computation during similarity calculation, due to the hard normalization of rotation invariance [34]. As the result, these spatial representations need further processing using spectral transform such as Fourier transform and wavelet transform. Wavelet descriptors have the advantage over Fourier descriptors in that they achieve localization of shape features in joint-space, i.e., in both spatial and frequency domains.

Early color based retrieval systems have used the global RGB histogram information such as the Local Color histogram [35], histogram difference approach [36], histogram intersection [37], [38], [39] and quadratic histogram comparison [40]. Though the color histogram based approaches are extremely easy to compute and insensitive to small changes in viewing positions and partial occlusion, they do not capture local spatial color information. Hence this approach is liable to false positives and is not robust to large appearance changes. Several recent schemes viz, Color Coherence Vector [41], Color Correlogram [42] and Binary Color Set [43], [44] incorporate spatial correlation of color regions as well as the global distribution of local spatial correlation of colors to improve upon the histogram method. Though these techniques perform better than traditional histograms, they require intensive computation. Generally color spatial techniques are classified into three categories [45]: (i) partition based approach (ii) signature based approach and (iii) cluster based approach.

Saber and Tekalp [46] have introduced a region based algorithm for automatic image annotation and retrieval with color, edge, shape and texture features. The regions are extracted based on edge information and Bayesian techniques with texture and shape features are computed on these regions. Region-wise feature extraction becomes computationally intensive and degrades the retrieval performance. The combination of structure, color and texture features for image retrieval is reported in [47]. But in this method the automatic adjustment of weight for the features are missing and obtaining texture feature using gabor filter becomes computationally difficult. Howe and Huttenlocher [48] have used a technique that integrates diverse and expandable set of image properties such as color, texture and location in a retrieval framework. The retrieval performance of this method is fairly better (68.3%) compared to other methods and the substantial control is given to end user for retrieving relevant images. Wavelet transform based color, shape and texture feature extraction for image retrieval is reported in [49], [50]. The multifeatures such as color, shape and texture are extracted from low and high frequency band of wavelet transform and the weight for each feature is determined using relevance feedback technique. Katare et al. [51] have integrated the shape and color feature for multi object image retrieval. In this method, the object with different orientation could not be identified with active contour based shape feature representation and it degrades the retrieval performance. Xiaojuan et al. [52] have established a method for image retrieval based on multifeatures with color, shape and texture and also introduced a method for normalization of the multifeature set. In this work, the multifeature normalization is performed in two stages: (i) Internal Normalization and (ii) Exterior Normalization and the authors themselves claimed that it is computationally intensive. An online application called garment image retrieval has used multifeatures for retrieving similar garment images [53]. Since the physical meaning, importance and value ranges of each feature are different in the linear combination of multifeature set, the similarity score computation with single distance metric becomes a series problem and degrades the retrieval accuracy. This problem can be solved using two methods viz., (i) relevance feedback [54], [55], [56] and (ii) appropriate weight generation [57], [58]. Relevance feedback is computationally high demanding and difficult to incorporate human into the loop. The latter method can be viewed as an optimization problem and a suitable optimization technique has to be incorporated to generate the weight in an adaptive manner for effective discrimination and retrieval with less computational cost. Hence this paper proposes a method for optimized weight generation using genetic algorithm for multifeature representation with multiresolution enhanced orthogonal polynomials model for efficient image retrieval. This paper is organized as follows. The orthogonal polynomials model and reordering the transformed coefficients into multiresolution subband like structure have been described in Section 2. The multi feature extraction is presented in Section 3. The evolution and the process of genetic algorithm are presented in Section 4. The genetic algorithm based optimized weight generation process is presented in Section 5. The performance metric is described in Section 6. Experiments and results are discussed in Section 7 and conclusion is drawn in Section 8.

Section snippets

Multiresolution reordering with orthogonal polynomials model coefficients

In this section the orthogonal polynomials model and the reordering of the orthogonal polynomials coefficients into multiresolution subband structure is presented. Multiresolution analysis plays a vital role in Human Visual System (HVS) and the experimental studies have shown that the eye's sensitivity to a visual stimulus strongly depends upon the spatial frequency contents of this stimulus. Hence, HVS motivates the use of multiscale image decompositions as a front end to complex image

Proposed rotation invariant feature extraction

In this section, the feature extraction process of the image under analysis based on the orthogonal polynomials model with the multiresolution reordered subbands is presented. In the orthogonal polynomials model, it is observed that when the block of an image is rotated the coefficient's magnitude remains unaltered but their position and the sign vary. At the same time the absolute difference between the corresponding coefficients of the original and the rotated block in the zig-zag sequence is

Evolution of genetic algorithm

This section describes the general procedure for computing Genetic Algorithm (GA). The basic concept behind GA is the survival of the fittest and is inspired by and named after biological processes of inheritance, mutation, natural selection and the genetic crossover that occurs when parents mate to produce offspring [66]. GA searches the solution by maintaining a population of solutions from which better solutions are created, whereas the conventional non-linear optimization techniques making

Proposed GA based weight generation

In this section, GA based feature weight generation process for the proposed multifeature representation scheme using texture and shape is presented. In the proposed weight generation method, initially the database (DB) images considered for experimentation are subjected to (i) Training and (ii) Testing. In the training portion, training pair (TP) is generated as described below:

Training Pair: A training Pair (TP) is the pair which consists of query image I_Q and the user defined best matched

Performance measure

In this section the performance evaluation metric for the proposed method is presented. The performance of the proposed method is measured in terms of overall retrieval accuracy and accuracy of retrieval for a particular class. Accuracy of retrieval RetAcc_i for a particular class is the number of top images retrieved corresponding to the query image class: $R e t A c c_{i} = \frac{1}{| D B_{i} | ⁎ H} \sum_{j = 0}^{D B_{i}} \sum_{k = 0}^{H} h (D B_{j}^{i}, R_{k})$ where DB denotes the database and $D B_{j}^{i}$ denote the jth image belonging to ith class and H is the number

Experiments and results

The retrieval efficiency of the proposed multiresolution enhanced orthogonal polynomials with weighted multifeature using genetic algorithm is experimented with a subset of popular image databases COREL [69] and Caltech [70]. The experimental results are presented in this section. The images in these databases are of color images and are resized into (256×256). The databases contain various classes of images and some of the sample images from each database are shown in Fig. 5.

During

Conclusion

In this paper a new image retrieval method with weighted multi feature in multiresolution enhanced orthogonal polynomials model and genetic algorithm is proposed. The transformed coefficients are reordered into multiresolution subband like structure and the statistical and invariant texture, shape and color features are directly extracted from them. The extracted texture, shape and color features are integrated into linear multifeature set and the significance of each feature in the

References (70)

Y. Rui et al.
Image retrieval: current techniques, promising directions and open issues
J. Vis. Commun. Image Represent.
(1999)
X.F. Wang et al.
Classification of plant leaf images with complicated background
Appl. Math. Comput.
(2008)
J.X. Du et al.
Shape recognition based on neural networks trained by differential evolution algorithm
Neurocomputing
(2007)
J. Mao et al.
Texture classification and segmentation using multi resolution simultaneous autoregressive models
Pattern Recognition
(1992)
D. Zhang et al.
Review of shape representation and description techniques
Pattern Recognition
(2004)
H.W. Yoo et al.
Visual information retrieval system via content based approach
Pattern Recognition
(2002)
R. Krishnamoorthi
Transform coding of monochrome image with statistical design of experiments approach to separate noise
Pattern Recognition Lett.
(2007)
R. Krishnamoorthi et al.
A new integer image coding technique based on orthogonal polynomials
Image Vision Comput.
(2009)
Z. He et al.
Texture image retrieval based on non-tensor product wavelet filter banks
Signal Process.
(2009)
S.A. Rangachar Kasturi et al.
A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video
Pattern Recognition
(2002)

V.E. Ogle et al.

Chabot: retrieval from a relational database of images

IEEE Comput.

(1995)

A. Pentland et al.

Photobook : content based manipulation of image databases

Int. J. Comput. Vision

(1996)

M. Flickner et al.

Query by image and video content: the QBIC system

IEEE Comput.

(1995)

J.R. Batch, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, B. Horowitz, R. Humphery, R. Jain, C.F. Shu, The virage...

J.R. Smith et al.

Querying by Color Region using the VisualSEEK Content Based Visual Query System, Intelligent Multimedia Information Retrieval

(1997)

T.S. Huang, S. Mehrotra, K. Ramachandran, Multimedia Analysis and Retrieval System(MARS) Project, in: Proceedings of...

W.Y. Ma, B.S. Manjunath, Netra: a toolbox for navigating large image databases, in: Proceedings of IEEE International...

J. Feder

Towards image content based retrieval for world wide web

J. Adv. Imaging

(1997)

R. Datta, J. Li, J.Z. Wang, Content based image retrieval—approaches and trends of the new age, Proceedings of the 7th...

A. Smeulders et al.

Content based image retrieval at the end of the early years

IEEE Trans. Pattern Anal. Mach. Intell.

(2000)

M.L. Kherfi et al.

Image retrieval from the world wide web: issues, techniques and systems

ACM Comput. Surv.

(2004)

M.S. Lew et al.

Content-based multimedia information retrieval: state of the art and challenges

ACM Trans. Multimedia Comput. Commun. Appl.

(2006)

M. Kokare et al.

A survey on current content based image retrieval methods

IETE J. Res.

(2002)

J. Sklansky

Image segmentation and feature extraction

IEEE Trans. Syst. Man Cybern.

(1978)

H. Tamura et al.

Texture features corresponding to visual perception

IEEE Trans. Syst. Man Cybern.

(1976)

W. Niblack, R. Barber, W. Equitz, M.D. Flickner, E.H. Glasman, D. Petkovicr, C. Faloutsos, G. Taubin, The QBIC Project:...

R.M. Haralick, Statistical and structural approaches to texture, Proceedings of IEEE, vol. 67, no. 5, 1979, pp....

G. Cross et al.

Markov random field texture models

IEEE Trans. Pattern Anal. Mach. Intell.

(1983)

F. Liu et al.

Periodicity, directionality and randomness: wold features for image modeling and retrieval

IEEE Trans. Pattern Anal. Mach. Intell.

(1996)

A.K. Jain et al.

Unsupervised texture segmentation using Gabor filters

Pattern Recognition

(1991)

T. Chang et al.

Texture analysis and classification with tree structured wavelet transform

IEEE Trans. Image Process.

(1992)

A. Laine et al.

Texture classification by wavelet packet signatures

IEEE Trans. Pattern Anal. Mach. Intell.

(1993)

J.P. Eakins, M.E. Gratam, CBIR: A Report to the JISC Technology Applications Program....

C.H. Teh et al.

On image analysis by the methods of moments

IEEE Trans. Pattern Anal. Mach. Intell.

(1988)

G. Taubin, D.B. Cooper, Recognition and positioning of rigid objects using algebraic moment invariants, International...

Cited by (11)

Utilization of rotation-invariant uniform LBP histogram distribution and statistics of connected regions in automatic image annotation based on multi-label learning
2017, Neurocomputing
Citation Excerpt :
Caicedo and Jaafar BenAbdallah, etc. presented a method based on non-negative matrix factorization to generate multimodal image representations that integrate visual features and text information [1]. Krishnamoorthi et al. proposed a image retrieval method with weighted multifeature set based on multiresolution enhanced orthogonal polynomials model and genetic algorithm [18]. In order to reduce the semantics gap brought by the incompliance between the machine understanding of structural image difference by machine learning mechanism [24] and that of human beings perception of difference and similarity in semantic cognitions, accurate image annotation marked by semantic labels can work out the semantic gap puzzle [25] somewhat via whether professional labors or with the help of automatic image semantics annotation in which most of the time can be saved.
A method for automatic image annotation based on multi-feature fusion and multi-label learning algorithm was proposed in this paper. In the process of feature fusion, rotation-invariant uniform local binary pattern histogram distribution and counting of connected regions in image were extracted and utilized fully. Besides traditional n-order color moments and texture information, rotation-invariant uniform LBP histogram distribution, connected regions number, weighted histogram's integral were appended to image features which aided to improve the average precision. Based on multi-label learning k-nearest neighbor algorithm and Corel5 k image data set, comparisons among different dimensional features combinations were made to show that the proposed method outperformed that of traditional one with only basic color moments and texture distribution. The average precision was showed to be improved from 0.2898 to 0.3954 in automatic image annotation in our experimental results.
Quaternion polar complex exponential transform for invariant color image description
2015, Applied Mathematics and Computation
Citation Excerpt :
The second one decomposes the color image into three channels, and then calculates the moment invariants of these three channels separately. Obviously, these two approaches cannot capture the correlation among color channels, and can hardly produce the most compact and effective feature description of color image [15–19]. In recent years, quaternion has been utilized more and more in color image processing, and the use of quaternion-based moment functions to color image has been investigated [17–19].
Moments and moment invariants have been widely used as a basic feature descriptors in image analysis, pattern recognition, and image retrieval. However, they are mainly used to deal with the binary or gray-scale images, which lose some significant color information. Recently, quaternion techniques were introduced to conventional image moments (including Fourier–Mellin moments, Zernike/Pseudo Zernike moments, and Bessel–Fourier moments, etc.) for describing color images, and some quaternion moment and moment invariants were developed. But, the conventional image moments usually cannot effectively capture the image information, especially the edges. Besides, the kernel computation of them involves computation of a number of factorial terms, which inevitably cause the numerical stability of these moments. Based on effective polar complex exponential transform (PCET) and algebra of quaternions, we introduced the quaternion polar complex exponential transform (QPCET) for describing color images in this paper, which can be seen as the generalization of PCET for gray-level images. It is shown that the QPCETs can be obtained from the PCET of each color channel. We derived and analyzed the rotation, scaling, and translation (RST) invariant property of QPCET. We also discussed the problem of color image retrieval using QPCET. Experimental results are provided to illustrate the efficiency of the proposed color image descriptors.
Colour Image Retrieval Based on Adaptive Jseg Image Segmentation and Statistical Distance Measure
2022, SSRN
Quaternion Polar Complex Exponential Transform and Local Binary Pattern-Based Fusion Features for Content-Based Image Retrieval
2021, Lecture Notes in Electrical Engineering
Research on Image Retrieval with Multi-features
2019, Journal of Physics: Conference Series
Weighted feature voting technique for content-based image retrieval
2018, International Journal of Computational Vision and Robotics

View all citing articles on Scopus

Dr. R. Krishnamoorthi received his M.Tech in Computer Science and Engineering at IIT, Kanpur and Ph.D. from IIT, Kharagpur. He has 27 years of teaching and research experience and currently working as Professor and Dean, in Anna University of Technology Trichirappalli, Tamilnadu, India. He has published 24 papers in International journals and presented 51 Conference papers in various International Conferences. He is also an active reviewer for many international journals such as IEEE Transactions on Systems, Man and Cybernetics, IEEE Transactions on Pattern Analysis and Machine Intelligence, Pattern Recognition and Pattern Recognition Letters. He received a sum of Rs. 63.15 Lakhs from various sponsoring agency to promote research. His area of interest is Image Annotation and Retrieval, Digital watermarking, Compression and Biometric.

Mrs. S. Sathiya Devi received her BE in computer Science Engineering from Madras University and M.Tech in Computer Science and Engineering from Pondicherry University. She has 13 years of experience in teaching and 5 years of research experience. She is doing research in Intelligent Image Retrieval. Currently working as an Assistant Professor in Anna University of Technology, Trichirappalli, Tamilnadu, India. Her research interest includes Image retrieval, Web mining and application of soft computing techniques in remote sensing and medical image processing.

View full text

A simple computational model for image retrieval with weighted multifeatures based on orthogonal polynomials and genetic algorithm

Abstract

Introduction

Section snippets

Multiresolution reordering with orthogonal polynomials model coefficients

Proposed rotation invariant feature extraction

Evolution of genetic algorithm

Proposed GA based weight generation

Performance measure

Experiments and results

Conclusion

J. Vis. Commun. Image Represent.

Appl. Math. Comput.

Neurocomputing

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition Lett.

Image Vision Comput.

Signal Process.

A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video

Pattern Recognition

Chabot: retrieval from a relational database of images

IEEE Comput.

Photobook : content based manipulation of image databases

Int. J. Comput. Vision

Query by image and video content: the QBIC system

IEEE Comput.

Querying by Color Region using the VisualSEEK Content Based Visual Query System, Intelligent Multimedia Information Retrieval

Towards image content based retrieval for world wide web

J. Adv. Imaging

Content based image retrieval at the end of the early years

IEEE Trans. Pattern Anal. Mach. Intell.

Image retrieval from the world wide web: issues, techniques and systems

ACM Comput. Surv.

Content-based multimedia information retrieval: state of the art and challenges

ACM Trans. Multimedia Comput. Commun. Appl.

A survey on current content based image retrieval methods

IETE J. Res.

Image segmentation and feature extraction

IEEE Trans. Syst. Man Cybern.

Texture features corresponding to visual perception

IEEE Trans. Syst. Man Cybern.

Markov random field texture models

IEEE Trans. Pattern Anal. Mach. Intell.

Periodicity, directionality and randomness: wold features for image modeling and retrieval

IEEE Trans. Pattern Anal. Mach. Intell.

Unsupervised texture segmentation using Gabor filters

Pattern Recognition

Texture analysis and classification with tree structured wavelet transform

IEEE Trans. Image Process.

Texture classification by wavelet packet signatures

IEEE Trans. Pattern Anal. Mach. Intell.

On image analysis by the methods of moments

IEEE Trans. Pattern Anal. Mach. Intell.