Skip to main content
Log in

Context-sensitive queries for image retrieval in digital libraries

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

In this paper we show how to achieve a more effective Query By Example processing, by using active mechanisms of biological vision, such as saccadic eye movements and fixations. In particular, we discuss the way to generate two fixation sequences from a query image I q and a test image I t of the data set, respectively, and how to compare the two sequences in order to compute a similarity measure between the two images. Meanwhile, we show how the approach can be used to discover and represent the hidden semantic associations among images, in terms of categories, which in turn drive the query process.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Baeza-Yates, R., Cunto, W., Manber, U, & Wu, S. (1994). Proximity matching using fixed-queries trees. In Proceedings of the Fifth Combinatorial Pattern Matching(CPM94), Lecture Notes in Computer Science, vol. 807 (pp. 198–212).

  • Ballard, D. (1991). Animate vision. Artificial Intelligence, 48, 57–86. (London, UK: Springer)

    Article  Google Scholar 

  • Burkhard, W., & Keller, R. (1973). Some approaches to best-match file searching. Communications of the ACM, 16(4), 230–236.

    Article  MATH  Google Scholar 

  • Banerjee, A., Dhillon, I. S., Ghosh, J., & Sra, S. (2003). Clustering on hyperspheres using expectation maximization. Technical report TR-03-07, Department of Computer Sciences, University of Texas, (February).

  • Boccignone, G. Chianese, A., Moscato, V., & Picariello, A. (2005). Foveated Shot Detection for Video Segmentation. IEEE Transactions on Cicuits and Systems for Video Technology, 15(3), 365–377 (Marzo).

    Article  Google Scholar 

  • Carson, C., Belongie, S., Greenspan, H., & Malik, J. (2002). Blobworld: Image segmentation using expectation-maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(8), 1026–1038.

    Article  Google Scholar 

  • Celeux, G., & Govaert, G. (1992). A classification EM algorithm for clustering and two stochastic versions. Computational Statistics & Data Analysis, 14, 315–332.

    Article  MATH  MathSciNet  Google Scholar 

  • Chavez, E., Navarro, G., Baeza-Yates, R., & Marroquin, J. M. (2001). Searching in metric space. ACM Computing Surveys, 33, 273–321.

    Article  Google Scholar 

  • Ciaccia, P., Patella, M., & Zezula, P. (1997). M-tree: An efficient access method for similarity search in metric spaces. In Proc. of 23rd International Conference on VLDB, pp. 426–435.

  • Colombo, C., Del Bimbo, A., & Pala, P. (1999). Semantics in visual information retrieval. IEEE MultiMedia, 6(3), 38–53.

    Article  Google Scholar 

  • Corridoni, J. M., Del Bimbo, A., & Pala, P. (1999). Image retrieval by color semantics, Multimedia Systems, 7(3), 175–183.

    Article  Google Scholar 

  • Del Bimbo, A., Mugnaini, M., Pala, P., & Turco, F. (1998). Visual querying by color perceptive regions. Pattern Recognition, 31(9), 1241–1253.

    Article  Google Scholar 

  • Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977) Maximum likelihood from incomplete data. Journal of the Royal Statistical Society, 39, 1–38.

    MATH  MathSciNet  Google Scholar 

  • Duygulu, P., Barnard, K., de Freitas, N., & Forsyth, D. (2002). Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In Seventh European Conference on Computer Vision, pp. 97–112.

  • Djeraba, C. (2003). Association and content-based retrieval. IEEE Transactions on Knowledge and Data Engineering, 15(1), 118–135.

    Article  Google Scholar 

  • Edelman, S. (2002). Constraining the neural representation of the visual world. Trends in Cognitive Science, 6(3), 125–131.

    Article  Google Scholar 

  • Fan, W., Davidson, I., Zadrozny, B., & Yu, P. S. (2005). An improved categorization of classifier’s sensitivity on sample selection bias. In Proocedings of International Conference on Data Mining (ICDM05), pp. 605–608.

  • Fryer, R. G., & Jackson, M. O. (2003). Categorical cognition: A psychological model of categories and identification in decision making. NBER Working Paper no. W9579, March.

  • Hare, J. S., & Lewis, P. H. (2004). Salient regions for query by image content. Image and Video Retrieval (CIVR 2004), Dublin, Ireland, pp. 317–325, Springer ed.

  • Hare, J. S. & Lewis, P. H. (2005). On image retrieval using salient regions with vector-spaces and latent semantics. Image and Video Retrieval (CIVR 2005), Singapore, Springer Ed.

  • Itti, L., Koch, C., & Niebur, E. (1998). A model of saliency based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 1254–1259.

    Article  Google Scholar 

  • MacKay, D. J. C. (2003). Information theory, inference, and learning algorithms. UK: Cambridge University Press.

    MATH  Google Scholar 

  • Mallat, S. (1998). A wavelet tour of signal processing. San Diego, CA: Academic Press.

    MATH  Google Scholar 

  • MPEG-7 (1999). Visual part of eXperimentation Model (XM) version 2.0. MPEG-7 Output Document ISO/MPEG.

  • Neal, R. M., & Hinton, G. E. (1998). A view of the EM algorithm that justifies incremental, sparse, and other variants. M. J. Jordan (Ed.), Learning in graphical models (pp. 355–368). Cambridge, MA: MIT.

    Google Scholar 

  • Newsam, S., Sumengen, B., & Manjunath, B. S. (2001). Category-based image retrieval. In International Conference on Image Processing (ICIP), pp. 596–599.

  • Noton, D., & Stark, L. (1990). Scanpaths in the saccdice eye movements during pattern perception. Visual Research, 11, pp. 929–942.

    Google Scholar 

  • Santini, S. (2000). Evaluation vademecum for visual information systems. In Proc. of SPIE, vol. 3972. San Jose, USA.

  • Santini, S., Gupta, A., & Jain, R. (2001). Emergent Semantics through Interactions in image databases. IEEE Transactions on Knowledge and Data Engineering, 13, 337–351.

    Article  Google Scholar 

  • Sebe, N., Tian, Q., Loupias, E., Lew, M., & Huang, T. (2003). Evaluation of salient point techniques. Image and Vision Computing, 21, 1087–1095.

    Article  Google Scholar 

  • Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., & Jain, R. (2000). Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 1349–1379.

    Article  Google Scholar 

  • Uhlmann, J. (1991). Satisfying general proximity/similarity queries with metric trees. Information Processing Letters, 40, 175–179.

    Article  MATH  Google Scholar 

  • Walker-Smith, G. J., Gale, A. G., & Findlay, J. M. (1997). Eye movement strategies involved in face perception. Perception, 6, 313–326.

    Article  Google Scholar 

  • Wang, J. Z., Li, J., & Wiederhold, G. (2001). SIMPLIcity: Semantics-sensitive integrated matching for pictures libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23, 1–16, (Sept.)

    Google Scholar 

  • Yamanishi, K., Takeuchi, J.-I., Williams, G., & Melne, P. (2004). On-line unsupervised outlier detection using finite mixtures with discounting learning algorithms. Data Mining and Knowledge Discovery, 8, 275–300.

    Article  MathSciNet  Google Scholar 

  • Yu, D., & Zhang, A. (2003). ClusterTree: Integration of cluster representation and nearest-neighbor search for large data sets with high dimensions. IEEE Transactions on Knowledge and Data Engineering, 15(5), 1316–1337.

    Article  MathSciNet  Google Scholar 

  • Zhong, S. & Ghosh, J. (2003). A unified framework for model-based clustering. Journal of Machine Learning Research, 4, 1001–1037.

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Moscato.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Boccignone, G., Chianese, A., Moscato, V. et al. Context-sensitive queries for image retrieval in digital libraries. J Intell Inf Syst 31, 53–84 (2008). https://doi.org/10.1007/s10844-007-0040-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-007-0040-5

Keywords

Navigation