A new SVM-based relevance feedback image retrieval using probabilistic feature and weighted kernel function

doi:10.1016/j.jvcir.2016.03.008

Journal of Visual Communication and Image Representation

Volume 38, July 2016, Pages 256-275

https://doi.org/10.1016/j.jvcir.2016.03.008 Get rights and content

Highlights

•
By using PCA and AGMM, probabilistic features are extracted and used for fast image retrieval.
•
By using improved Relief algorithm, all training sample’ weight values are computed and utilized for feedback.
•
SVM kernel function is optimized dynamically according to the feedback samples’ weight values.

Abstract

Relevance feedback (RF) is an effective approach to bridge the gap between low-level visual features and high-level semantic meanings in content-based image retrieval (CBIR). The support vector machine (SVM) based RF mechanisms have been used in different fields of image retrieval, but they often treat all positive and negative feedback samples equally, which will inevitably degrade the effectiveness of SVM-based RF approaches for CBIR. In fact, positive and negative feedback samples, different positive feedback samples, and different negative feedback samples all always have distinct properties. Moreover, each feedback interaction process is usually tedious and time-consuming because of complex visual features, so if too many times of iteration of feedback are asked, users may be impatient to interact with the CBIR system. To overcome the above limitations, we propose a new SVM-based RF approach using probabilistic feature and weighted kernel function in this paper. Firstly, the probabilistic features of each image are extracted by using principal components analysis (PCA) and the adapted Gaussian mixture models (AGMM) based dimension reduction, and the similarity is computed by employing Kullback–Leibler divergence. Secondly, the positive feedback samples and negative feedback samples are marked, and all feedback samples’ weight values are computed by utilizing the samples-based Relief feature weighting. Finally, the SVM kernel function is modified dynamically according to the feedback samples’ weight values. Extensive simulations on large databases show that the proposed algorithm is significantly more effective than the state-of-the-art approaches.

Introduction

Over the past years, content-based image retrieval (CBIR) applications have become increasingly popular. Two potential reasons for that are the vastly growing amount of digital image data and the free access to public image repositories. Both reasons are obviously connected to the dramatic price decline in the digital camera marked. Systems that allow searching, sorting, or retrieval of visual content have turned out to be essential in order to handle the vast amount of data [1]. Many CBIR systems have been developed, including QBIC, Photobook, MARS, NeTra, PicHunter, Blobworld, VisualSEEK, SIMPLIcity, and others [2]. In a typical CBIR system, low-level visual image features (e.g., color, texture, and shape) are automatically extracted for image descriptions and indexing purpose [3]. To search for desirable images, a user presents an image as an example of similarity, and the system returns a set of similar images based on the extracted features. Despite the power of the search strategies, it is very difficult to optimize the retrieval quality of CBIR within only one query process. The hidden problem is that the extracted visual features are too diverse to capture the concept of the user’s query. To solve such problems, the users can pick up some preferred images to refine the image explorations iteratively. The feedback procedure, called relevance feedback (RF), repeats until the user is satisfied with the retrieval results [4], [5].

RF, originally developed for text retrieval systems, refers to a set of approaches learning from an assortment of users’ browsing behaviors on image retrieval [6]. The main idea of RF is to let the user guide the system. The conventional process of RF is as follows: (1) from the retrieved images, the user labels a number of relevant samples as positive feedbacks, and a number of irrelevant samples as negative feedbacks and (2) the CBIR system then refines its retrieval procedure based on these labeled feedback samples to improve retrieval performance. These two steps can be carried out iteratively. As a result, the performance of the system can be enhanced by gradually learning the user’s preferences.

In recent years, there is an unprecedented development in the RF CBIR field, and many RF methods have been introduced. These RF schemes can be roughly divided into Query reweighting, Query point movement, Query expansion, and support vector machine (SVM) based methods [6], [7], [8], as shown in Fig. 1.

In the query reweighting approach, each image is modeled as a set of low-level visual features, each of which is associated with a set of representations. Relevance feedback is achieved by dynamically updating the weights of visual features, as well as the weights of feature representations, in order to accommodate the information needs of the user. Query Reweighting (QR) approaches were first proposed by Rui et al. [9], which convert image feature vectors to weighted-term vectors in early version of MARS. Chang and Chen [10] used genetic algorithms to reweight a user’s query vector, based on the user’s relevance feedback, to improve the performance of information retrieval systems. Based on region representations, Kim and Yu [11] proposed a new region filtering and region weighting method, which filters out unnecessary regions from images and learns region importance from the region size and the spatial location of regions in an image. They weighted the regions optimally and improved the performance of the region-based retrieval system based on relevance feedback. For the query reweighting approaches, no matter how the weighted or generalized distance function is adapted, the diverse visual features extremely limit the effort of image retrieval.

Another solution for enhancing the accuracy of image retrieval is moving the query point toward the contour of the user’s preference in feature space. Query point movement regards multiple positive examples as a new query point at each feedback. After several forceful changes of location and contour, the query point should be close to a convex region of the user’s interest. Liu et al. [12] proposed an efficient query point movement methods, which is able to reach any given target image with fewer iterations in the worst and average cases. They also considered the strategies to minimize the effects of users’ inaccurate relevance feedback. Nguyen et al [13] presented a cluster-based relevance feedback method, which combines two popular techniques of relevance feedback: query point movement and query expansion. From a single point initial query, query expansion provides a multiple point query, which is then enhanced using query point movement. Su et al. [14] proposed an efficient relevance feedback for content-based image retrieval, which made use of three kinds of query refinement strategies, Query Point Movement (QPM), Query Reweighting (QR), and Query Expansion (QEX), to converge the search space toward the user’s intention effectively. Wang et al. [15] proposed a new relevance feedback mechanism, which combines query point movement method and rough set theory. However, the modified query point of each feedback probably moves toward the local optimal centroid. Thus, global optimal results are not easily touched in query point movement like work.

Query expansion usually refers to the technique that uses blind relevance feedback to expand a query with new query terms, and reweigh the query terms, by taking into account a pseudo relevance set. Usually, the pseudo relevance set consists of the top ranked images returned by the first-pass retrieval. These top-ranked images are assumed to be relevant to the topic. Query expansion has proved to be an effective technique for image retrieval [16]. Chum et al. [17] bring query expansion into the visual domain via two novel contributions. Firstly, strong spatial constraints between the query image and each result allow us to accurately verify each return, suppressing the false positives which typically ruin text-based query expansion. Secondly, the verified images can be used to learn a latent feature model to enable the controlled construction of expanded queries. Okabe and Yamada [18] made two refinements to a well-known query expansion method. One uses transductive learning to obtain pseudo relevant images, thereby increasing the total number of source images from which expansion terms can be extracted. The other is a modified parameter estimation method that aggregates the predictions of multiple learning trials to sort candidate terms for expansion by importance. Singh et al. [19] propose a query expansion framework which explores user’s real time implicit feedback provided at the time of search to determine user’s search context and identify relevant query expansion terms. Kaczmarek [20] concerns clustering-by-directions algorithm. The algorithm introduces a novel approach to interactive query expansion. It is designed to support users of search engines in forming Web search queries. When a user executes a query, the algorithm shows potential directions in which the search can be continued. Rahman et al. [21] presented an image retrieval framework based on automatic query expansion in a concept feature space by generalizing the vector space model of information retrieval. Weerkamp et al. [22] proposed a general generative query expansion model that uses external document collections for query expansion: the external expansion model (EEM). The main rationale behind the model is the hypothesis that each query requires its own mixture of external collections for expansion and that an expansion model should account for this. The effectiveness of query expansion is usually better than those of query reweighting and query point movement. Nevertheless, there are still some problems unsolved for query expansion. For example, the relevant query points are too many to be efficient. Adjusting the disjunctive queries causes the expensive search cost and the results cannot escape from the restricted range that the users are able to specify. On the whole, query expansion brings out higher computation cost and more feedbacks in RF.

More recent work on image retrieval treats the relevance feedback problem as a classification problem in which machine learning techniques, for example, support vector machines (SVM), are employed. In such a paradigm, the goal is to discover a decision boundary based on relevant and irrelevant image information collected from the user. Rahman et al. [23] presented a classification-driven biomedical image retrieval framework based on image filtering and similarity fusion by employing supervised learning techniques. In this framework, the probabilistic outputs of a multiclass SVM classifier as category prediction of query and database images are exploited at first to filter out irrelevant images, thereby reducing the search space for similarity matching. Huang et al. [24] proposed a novel paired feature AdaBoost learning system for relevance feedback-based image retrieval. To facilitate density estimation in the feature learning, the author proposed an ID3-like balance tree quantization method to preserve most discriminative information. By using paired feature combination, they mapped all training samples obtained in the relevance feedback process onto paired feature spaces and employed the AdaBoost algorithm to select a few feature pairs with best discrimination capabilities in the corresponding paired feature spaces. Li et al. [25] proposed a method to alleviate the small sample problem in SVM based RF by using semi-supervised active learning algorithm which uses a large amount of unlabeled data together with labeled data to build better models. Wang et al. [26] presented a SVM relevance feedback CBIR algorithm based on feature reconstruction, in which the covariance matrix based kernel empirical orthogonal complement component analysis is utilized. Zagoris et al. [27] proposed an MPEG-like descriptor that contains conventional contour and region shape features with a wide applicability from any arbitrary shape to document retrieval through word spotting. Its size and storage requirements are kept to minimum without limiting its discriminating ability. In addition to that, a RF technique based on SVMs is provided that employs the proposed descriptor with the purpose to measure how well it performs with it. Zhang et al. [28] proposed a novel method of relevance feedback based on support vector machine learning in the content-based image retrieval system. A SVM classifier can be learned from training data of relevance images and irrelevance images marked by users. Using the classifier, the system can retrieve more images relevant to the query in the database efficiently. Liu et al. [29] proposed a SVM-based active feedback in image retrieval using clustering and unlabeled data, in which a new active selection criterion to select images for user’s feedback is designed, and unlabeled images are incorporated within the co-training framework. Lee and Lee [30] proposed an enhanced SVM which not only better selects positive/negative examples considering the reliability of the spoken segments, but emphasizes more on more reliable training examples by modifying the SVM formulation. Besides, Jun et al. [31] proposed a novel ranking model based on the learning to rank framework, in which visual features and click features are simultaneously utilized. Specifically, the proposed approach is based on large margin structured output learning and the visual consistency is integrated with the click features through a hypergraph regularizer term. Jun et al. [32] proposed a multimodal hypergraph learning-based sparse coding method for image click prediction, and applied the obtained click data to the reranking of images. In order to further improve the relevance feedback performance, Tao et al. [33] built an asymmetric bagging and random subspace SVM (ABRS-SVM) by integrating asymmetric bagging-based SVM (AB-SVM) and random subspace SVM (RS-SVM). Yu et al. [34] proposed a novel multi-view hypergraph-based learning (MHL) method that adaptively integrates click data with varied visual features. In particular, MHL considers pairwise discriminative constraints from click data to maximally distinguish images with high click counts from images with no click counts.

Owing to strong generalization capability, SVM-based relevance feedback (RF) image retrieval has become an active area of research, and many SVM-based RF approaches have been proposed in recent years. But, the existing SVM-based RF schemes have two main drawbacks: First, each feedback interaction is usually tedious and time-consuming because original visual features always have poor adaptability, flexibility and robustness. Second, the RF mechanisms treat all positive and negative feedback samples equally, which will inevitably degrade the effectiveness of SVM-based RF approaches for CBIR. In fact, positive and negative training samples, different positive training samples, and different negative training samples all always have distinct properties [35].

In this paper, we present a new SVM-based RF approach using probabilistic feature and weighted kernel function. The novelty of the proposed approach includes: (1) Probabilistic features, which have a lot of merits, such as adaptability to the data, modeling flexibility and robustness, are utilized instead of the original features. (2) All training sample’ weight values are adaptively computed using fast Relief algorithm, and SVM kernel function is optimized dynamically, which can improve significantly the effectiveness of SVM-based RF approaches.

The rest of this paper is organized as follows. Section 2 discusses the probabilistic features extraction and similarity computation. In Section 3, the feedback samples based SVM Kernel function weighting is introduced. Section 4 contains the description of our SVM-based relevance feedback system. Simulation results in Section 5 will show the performance of our scheme. Finally, Section 6 concludes this presentation.

Section snippets

Adapted Gaussian mixture models (AGMM)

Mixture models are a well-established methodology to model probability density functions (PDFs), which has been proven to have a lot of merits, such as adaptability to the data, modeling flexibility, and robustness. Gaussian mixture models (GMM) are very popular and promising PDF models, and GMM have been shown to provide powerful tools in several image processing and related applications, such as data classification, information retrieval, and image segmentation [36]. The EM algorithm is the

A improved Relief feature weighting

Feature selection is one of the fundamental problems in machine learning. Not only can its proper design reduce system complexity and processing time, but it can also enhance system performance in many cases. It becomes even more critical to the success of a machine learning algorithm in problems involving a large amount of irrelevant features. The research on feature selection is very active in the past decade. The existing feature selection algorithms can be generally categorized as filter

Image retrieval system

Due to the limitations in low-level feature representations and motivated by advances in machine learning, we present a SVM-based relevance feedback image retrieval framework that uses probabilistic feature and weighted kernel function. Fig. 2 describes our CBIR system framework with relevance feedback. From Fig. 2, we can see that our CBIR system has four main components: query, retrieval, labeling, and learning. When a query is input, its low level visual features are extracted. Then, all

Experimental results

To evaluate the performance of the proposed algorithm [46], [47], we conduct an extensive set of CBIR experiments by comparing the proposed algorithm to several state-of-the-art feedback methods [24], [26], [28], [29] that have been used in image retrieval. Here,

•
RF Methods proposed by Huang et al. [24]: a novel paired feature AdaBoost learning system for relevance feedback-based image retrieval;
•
RF Methods proposed by Wang et al. [26]: a SVM relevance feedback CBIR algorithm based on feature

Conclusion

In this paper, we have presented a new SVM-based relevance feedback image retrieval using probabilistic feature and weighted kernel function. The novelty of our work lies in the use of an improved SVM classifier, and in which: (1) the probabilistic features are extracted by using PCA and the AGMM based dimension reduction, and the similarity is computed by employing Kullback–Leibler divergence, (2) the samples’ weight values are computed by utilizing the samples-based Relief feature weighting,

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Grant No. 61472171 & 61272416, and Liaoning Research Project for Institutions of Higher Education of China under Grant No. L2013407.

References (47)

D.H. Kim et al.
A new region filtering and region weighting approach to relevance feedback in content-based image retrieval
J. Syst. Softw.
(2008)
Y. Wang et al.
Interactive relevance feedback mechanism for image retrieval using rough set
Knowl.-Based Syst.
(2006)
M.M. Rahman et al.
A query expansion framework in image retrieval domain based on local and global analysis
Inf. Process. Manage.
(2011)
Xiang-Yang Wang et al.
An image retrieval scheme with relevance feedback using feature reconstruction and SVM reclassification
Neurocomputing
(2014)
K. Zagoris et al.
Image retrieval systems based on compact shape descriptor and relevance feedback information
J. Vis. Commun. Image Represent.
(2011)
Rujie Liu et al.
SVM-based active feedback in image retrieval using clustering and unlabeled data
Pattern Recogn.
(2008)
M.H. Zangooei et al.
PSSP with dynamic weighted kernel fusion based on SVM-PHGS
Knowl.-Based Syst.
(2012)
S. Ozer et al.
A set of new Chebyshev kernel functions for support vector machine pattern classification
Pattern Recogn.
(2011)
R. Datta et al.
Image retrieval: ideas, influences, and trends of the new age
ACM Comput. Surv.
(2008)
W. Hu et al.
A survey on visual content-based video indexing and retrieval
IEEE Trans. Syst. Man Cybern. Part C: Appl. Rev.
(2011)

L. Zhang et al.

Geometric optimum experimental design for collaborative image retrieval

IEEE Trans. Circ. Syst. Video Technol.

(2014)

M. Wang et al.

Active Learning in multimedia annotation and retrieval: a survey

ACM Trans. Intell. Syst. Technol.

(2011)

T. Mei et al.

Multimedia search reranking: a literature survey

ACM Comput. Surv.

(2014)

J. Li et al.

Relevance feedback in content-based image retrieval: a survey

Wu Jun et al.

Learning a hybrid similarity measure for image retrieval

Pattern Recogn.

(2013)

G.T. Papadopoulos et al.

Gaze-based relevance feedback for realizing region-based image retrieval

IEEE Trans. Multimedia

(2014)

Y. Rui, T.S. Huang, S. Mehrotra, Content-based image retrieval with relevance feedback in MARS, in: Proceedings of IEEE...

Y.C. Chang et al.

A new query reweighting method for document retrieval based on genetic algorithms

IEEE Trans. Evol. Comput.

(2006)

D. Liu et al.

Fast query point movement techniques for large CBIR systems

IEEE Trans. Knowl. Data Eng.

(2009)

N.V. Nguyen et al.

Cluster-based relevance feedback for CBIR: a combination of query point movement and query expansion

J. Ambient Intell. Humanized Comput.

(2012)

J.H. Su et al.

Efficient relevance feedback for content-based image retrieval by mining user navigation patterns

IEEE Trans. Knowl. Data Eng.

(2011)

C. Carpineto et al.

A survey of automatic query expansion in information retrieval

ACM Comput. Surv. (CSUR)

(2012)

O. Chum, J. Philbin, J. Sivic. Total recall: Automatic query expansion with a generative feature model for object...

Cited by (33)

Interactive object retrieval via cost-minimizing queries
2018, Applied Soft Computing Journal
Citation Excerpt :
Then, it needs extra labeling costs to obtain enough positive examples for an effective training set. In hopes of making full use of limited human resources, interactive retrieval systems are being investigated in many laboratories around the world, and a vast body of literature has been accrued in this regard using current techniques of active learning [11–14]. In a typical interactive retrieval system, a small number of the target and backgrounds are labeled as the training set.
Studies using machine learning algorithms have documented advantage in applications of visual object retrieval. While interactive techniques have also been reported, the relevance feedback by minimizing the costs of time and annotation has not been investigated due to the requirement of data storage and online toolbox. In this work, we present an interactive scheme of object retrieval coupled with cost-minimizing queries in order to provide accurate results in a short time. We investigate a fast mechanism of relevance feedback to reduce time expense and explore the most reliable annotator in crowds to control annotation costs. Our experimental results demonstrate the effectiveness and efficiency of proposed framework.
2D-LPCCA and 2D-SPCCA: Two new canonical correlation methods for feature extraction, fusion and recognition
2018, Neurocomputing
Citation Excerpt :
Data classification and dimensionality reduction are two important procedures in the fields of machine learning and pattern recognition. K-nearest neighbors (KNN) [1], support vector machines (SVM) [2], twin SVM (TSVM) [3], naive bayes [4], and so on, are common classifiers, and have been used in many applications, such as pattern recognition [5,6], image retrieval [7], remotely sensed images [8], text analysis and retrieval [9], information fusion [10] and bioinformatics. But the data that these classifiers used always have very high dimensionality, processing this data needs a lot of memory and time.
Two-dimensional canonical correlation analysis (2D-CCA) is an effective and efficient method for two-view feature extraction and fusion. Since it is a global linear method, it fails to find the nonlinear correlation between different features. In contrast, in this paper we propose a novel two-view method named as two-dimensional locality preserving canonical correlation analysis (2D-LPCCA), which uses the neighborhood information to discover the intrinsic structure of data. In other words, it uses many local linear problems to approximate the global nonlinear case. In addition, inspired by sparsity preserving projections (SPP), the two-dimensional sparsity preserving canonical correlation analysis (2D-SPCCA) framework is also developed, which consists of three models. Experimental results on real world databases demonstrate the viability of the formulation, they also show that the classification results of our methods are higher than the other’s.
A new kernel function of support vector regression combined with probability distribution and its application in chemometrics and the QSAR modeling
2017, Chemometrics and Intelligent Laboratory Systems
Citation Excerpt :
Joachims [15] and Vapnik formulated methods based on cross-validation. Wang [16] modified the SVR kernel function by adding the feedback weight values of the samples, which constantly change, to the inner product matrix. Tian [17] proposed a novel SVR based on cluster assumption and equipped with different kernel methods.
Quantitative structure–activity relationship (QSAR) models are extensively used to identify new chemicals affecting human health and speed up the drug discovery process. The development of accurate QSAR models can lead to a reduced number of experiments conducted on rats and mice to analyze new compounds. In a typical QSAR model, only the relationship among variables is considered, and the probability distribution of the samples is disregarded. Thus, a new kernel function of support vector regression (SVR) that integrates probability distribution is proposed. The proposed kernel function, called SVR-pk, satisfies kernel function theory, and the mean and variance of the sample are used to reflect the main distribution information. To verify the performance of the new kernel function, simulation example, two sets of data from UCI (University of California, Irvine) and two experiments about the compounds toxicity in rodents data obtained from the Carcinogenic Potency Database are employed. Results show that compared with other SVR models utilizing kernel functions, SVR-pk exhibits better performance and is more suitable for QSAR model.
Study on Screening Parameter Optimization of Wet Sand and Gravel Particles Using the GWO-SVR Algorithm
2023, Processes
ExpertosLF: dynamic late fusion of CBIR systems using online learning with relevance feedback
2023, Multimedia Tools and Applications
A Relief-PGS algorithm for feature selection and data classification
2023, Intelligent Data Analysis

View all citing articles on Scopus

^☆: This paper has been recommended for acceptance by Dacheng Tao.

View full text

A new SVM-based relevance feedback image retrieval using probabilistic feature and weighted kernel function☆

Highlights

Abstract

Introduction

Section snippets

Adapted Gaussian mixture models (AGMM)

A improved Relief feature weighting

Image retrieval system

Experimental results

Conclusion

Acknowledgements

J. Syst. Softw.

Knowl.-Based Syst.

Inf. Process. Manage.

Neurocomputing

J. Vis. Commun. Image Represent.

Pattern Recogn.

Knowl.-Based Syst.

Pattern Recogn.

Image retrieval: ideas, influences, and trends of the new age

ACM Comput. Surv.

A survey on visual content-based video indexing and retrieval

IEEE Trans. Syst. Man Cybern. Part C: Appl. Rev.

Geometric optimum experimental design for collaborative image retrieval

IEEE Trans. Circ. Syst. Video Technol.

Active Learning in multimedia annotation and retrieval: a survey

ACM Trans. Intell. Syst. Technol.

Multimedia search reranking: a literature survey

ACM Comput. Surv.

Relevance feedback in content-based image retrieval: a survey

Learning a hybrid similarity measure for image retrieval

Pattern Recogn.

Gaze-based relevance feedback for realizing region-based image retrieval

IEEE Trans. Multimedia

A new query reweighting method for document retrieval based on genetic algorithms

IEEE Trans. Evol. Comput.

Fast query point movement techniques for large CBIR systems

IEEE Trans. Knowl. Data Eng.

Cluster-based relevance feedback for CBIR: a combination of query point movement and query expansion

J. Ambient Intell. Humanized Comput.

Efficient relevance feedback for content-based image retrieval by mining user navigation patterns

IEEE Trans. Knowl. Data Eng.

A survey of automatic query expansion in information retrieval

ACM Comput. Surv. (CSUR)