Abstract
The problem of content based image retrieval (CBIR) has traditionally been investigated within a framework that emphasises the explicit formulation of a query: users initiate an automated search for relevant images by submitting an image or draw a sketch that exemplifies their information need. Often, relevance feedback is incorporated as a post-retrieval step for optimising the way evidence from different visual features is combined. While this sustained methodological focus has helped CBIR to mature, it has also brought out its limitations more clearly: There is often little support for exploratory search and scaling to very large collections is problematic. Moreover, the assumption that users are always able to formulate an appropriate query is questionable. An effective, albeit much less studied, method of accessing image collections based on visual content is that of browsing. The aim of this survey paper is to provide a structured overview of the different models that have been explored over the last one to two decades, to highlight the particular challenges of the browsing approach and to focus attention on a few interesting issues that warrant more intense research.





Similar content being viewed by others
References
Anderson J (1983) A spreading activation theory of memory. J Verbal Learn Verbal Behav 22:261–295
Barnard K, Forsyth D (2001) Learning the semantics of words and pictures. In: Proc IEEE int’l conf computer vision, vol 2. IEEE, Piscataway, pp 408–415
Beckmann N, Kriegel H-P, Schneider R, Seeger B (1990) The R*-tree: an efficient and robust access method for points and rectangles. In: Proc int’l conf management of data, Atlantic City, 23–26 May 1990, pp 322–331
Bentley J (1975) Multidimensional binary search trees used for associative searching. Commun ACM 18(9):509–517
Berkhin P (2002) Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA
Beyer K, Goldstein J, Ramakrishnan R, Shaft U (1999) When is ‘nearest neighbour’ meaningful? In: Proc int’l conf data theory, Jerusalem, 10–12 January 1999, pp 217–235
Boucheron L, Creusere C (2005) Lossless wavelet-based compression of digital elevation maps for fast and efficient search and retrieval. IEEE Trans Geosci Remote Sens 43(5):1210–1214
Browne P, Smeaton A (2004) Video information retrieval using objects and ostensive relevance feedback. In: ACM symp applied computing. ACM, New York, pp 1084–1090
Campbell I (2000) The ostensive model of developing information-needs. PhD thesis, University of Glasgow
Carmel E, Crawford S, Chen H (1992) Browsing in hypertext: a cognitive study. IEEE Trans Syst Man Cybern 22:865–884
Chen C, Kuljis J (2003) The rising landscape: a visual exploration of superstring revolutions in physics. J Am Soc Inf Sci Technol 54(5):435–446
Chen C, Morris S (2003) Visualizing evolving networks: minimum spanning trees versus pathfinder networks. In: IEEE symp information visualization. IEEE, Piscataway, pp 67–74
Chen C, Gagaudakis G, Rosin P (2000) Similarity-based image browsing. In: Proc int’l conf intelligent information processing, Beijing, 22 August 2000, pp 206–213
Chen J, Bouman C, Dalton J (1998) Similarity pyramids for browsing and organization of large image databases. In: Proc SPIE conf human vision and electronic imaging III, vol 3299. SPIE, Bellingham, pp 563–575
Chen J, Bouman C, Dalton J (2000) Hierarchical browsing and search of large image databases. IEEE Trans Image Process 9(3):442–455
Cheung S, Zakhor A (2005) Fast similarity search and clustering of video sequences on the World-Wide-Web. IEEE Trans Multimedia 7(3):524–537
Clough P, Joho H, Sanderson M (2005) Automatically organising images using concept hierarchies. In: Proc ACM multimedia workshop (SIGIR), Singapore, 6–11 November 2005
Cox K (1992) Information retrieval by browsing. In: Proc int’l conf new information technology, Hong Kong, 30 November–2 December 1992
Cox K (1995) Searching through browsing. PhD thesis, University of Canberra
Croft B, Parenty T (1985) Comparison of a network structure and a database system used for document retrieval. Inf Syst 10:377–390
Crucianu M, Ferecatu M, Boujemaa N (2004) Relevance feedback for image retrieval: a short review. In: State of the art in audiovisual content-based retrieval, information universal access and interaction including datamodels and languages (DELOS2 report)
Datta R, Joshi D, Li J, Wang J (2008) Image retrieval: ideas, influences and trends of the new age. ACM Trans Comput Surv (in press)
Descampe A, Vleeschouwer C, Iregui M, Macq N, Marqués F (2007) Prefetching and caching strategies for remote and interactive browsing of JPEG2000 images. IEEE Trans Image Process 16(5):1339–1354
Duda R, Hart P, Stork D (2001) Pattern recognition. Wiley, New York
Fauqueur J, Boujemaa N (2006) Mental image search by boolean composition of region categories. Multimed Tools Appl 31(1):95–117
Feng S, Manmatha R, Lavrenko V (2004) Multiple Bernoulli relevance models for image and video annotation. In: Proc int’l conf computer vision and pattern recognition. IEEE, Piscataway, pp 1002–1009
Forsyth D (2001) Benchmarks for storage and retrieval in multimedia databases. In: Proc SPIE conf storage and retrieval for media databases, vol 4676. SPIE, Bellingham, pp 240–247
Fowler R, Wilson B, Fowler W (1992) Information navigator: an information system using associative networks for display and retrieval. Technical report, Department of Computer Science, University of Texas, No. 92-1
Fukunaga K, Narendra P (1975) A branch and bound algorithm for computing k-nearest neighbors. IEEE Trans Comput 24(7):750–753
Furnas G (1986) Generalized fisheye views. In: Proc SIGCHI conf human factors in computing systems, Boston, 13–17 April 1986, pp 16–23
Gevers T, Smeulders A (2004) Content-based image retrieval: an overview. In: Medioni G, Kang S (eds) Emerging topics in computer vision. Prentice Hall, Englewood Cliffs
Goldberger J, Gordon S, Greenspan H (2006) Unsupervised image set clustering using an information theoretic framework. IEEE Trans Image Process 15(2):449–458
Gupta A, Jain R (1997) Visual information retrieval. Commun ACM 40(5):71–79
Guttmann A (1984) R-trees: a dynamic index structure for spatial searching. In: Proc ACM int’l conf management of data (SIGMOD), ACM, New York, pp 47–57
Heesch D (2005) The NNk technique for image searching and browsing. PhD thesis, Imperial College London
Heesch D, Rüger S (2004) NNk networks for content-based image retrieval. In: Proc European conf information retrieval, LNCS 2997. Springer, Berlin Heidelberg New York, pp 253–266
Heesch D, Rüger S (2006) Interaction models and relevance feedback in content-based image retrieval. In: Zhang Y-J (ed) Semantic-based visual information retrieval. Idea-Group, Harrisburg, pp 160–186
Heesch D, Pickering M, Yavlinsky A, Rüger S (2004) Video retrieval within a browsing framework using keyframes. In: Proc TREC video. NIST, Gaithersburg
Heesch D, Yavlinsky A, Rüger S (2006) NNk networks and automated annotation for browsing large image collections from the World Wide Web. In: Proc ACM int’l conf multimedia (SIGMM). ACM, New York, pp 220–224
Hinton G, Roweis S (2002) Stochastic neighbour embedding. In: Advances in neural information processing systems, vol 15. MIT, Cambridge, pp 833–840
Hiroike T, Musha Y, Sugimoto A, Mori Y (1999) Visualization of information spaces to retrieve and browse image data. In: Visual information systems. Morgan Kaufmann, San Francisco, pp 155–162
Jacobs C, Finkelstein A, Salesin D (1995) Fast multiresolution image querying. Technical report, University of Washington, US
Katayama N, Satoh S (1997) SR-tree: an index structure for high-dimensional nearest neighbour queries. In: Proc ACM int’l conf management of data (SIGMOD), ACM, New York, pp 369–380
Keller I, Meiers T, Ellerbrock T, Sikora T (2001) Image browsing with PCA-assisted user-interaction. In: IEEE workshop content-based access of image and video libraries. IEEE, Piscataway, pp 102–108
Kohonen T (2001) Self-organizing maps, volume 30 of Springer series in information sciences. Springer, Berlin Heidelberg New York
Krishnamachari S, Abdel-Mottaleb M (1999) Image browsing using hierarchical clustering. In: IEEE symp computers and communications. IEEE, Piscataway, pp 301–307
Kurniawati R, Jin J, Shepherd J (1997) Techniques for supporting efficient content-based retrieval in multimedia databases. Aust Comput J 29(4):122–130
Laaksonen J, Oja E, Koskela M, Brandt S (2000) Analyzing low-level visual features using content-based image retrieval. In: Proc int’l conf neural information processing, Taejon, 14–18 November 2000
Lavrenko V, Manmatha R, Jeon J (2003) A model for learning the semantics of pictures. In: Advances in neural information processing systems, vol 16. MIT, Cambridge
Lim S, Chen L, Lu G, Smith R (2005) Browsing texture image databases. In: Proc int’l conf multimedia modelling. IEEE, Piscataway, pp 328–333
Liu T, Joung Y (2004) Multi-dimension browse. In: Proc IEEE int’l conf computer software and applications. IEEE, Piscataway, pp 480–485
MacCuish J, McPherson A, Barros J, Kelly P (1996) Interactive layout mechanisms for image database retrieval. In: Proc SPIE conf visual data exploration and analysis III, vol 2656. SPIE, Bellingham, pp 104–115
Milanese R, Squire D, Pun T (1996) Correspondence analysis and hierarchical indexing for content-based image retrieval. In: Proc IEEE int’l conf image processing. IEEE, Piscataway, pp 859–862
Minka T, Picard R (1996) Interactive learning using a society of models. In: Proc IEEE conf computer vision and pattern recognition. IEEE, Piscataway, pp 447–452
Moghaddam B, Tian Q, Lesh N, Shen C, Huang T (2004) Visualization and user-modeling for browsing personal photo libraries. Int J Comput Vis 56(1–2):109–130
Mukhopadhyay R, Ma A, Sethi I (2004) Pathfinder networks for content based image retrieval based on automated shape feature discovery. In: Proc IEEE int’l symp multimedia software engineering. IEEE, Piscataway, pp 522–528
Musha Y, Hiroike A, Mori Y, Sugimoto A (1998) An interface for visualizing feature space in image retrieval. In: Proc IAPR workshop machine vision applications, Chiba, 17–19 November 1998, pp 447–450
Newell A (1990) Unified theories of cognition. Harvard University Press, Cambridge
Nguyen G, Worring M (2008) Interactive access to large image collections using similarity-based visualization. J Vis Lang Comput (in press)
Obdržálek S, Matas J (2005) Sub-linear indexing for large scale object recognition. In: Proc conf British machine vision, Versailles, 5–8 September 2005, pp 1–10
Pečenović Z, Do M, Vetterli M, Pu P (2000) Integrated browsing and searching of large image collections. In: Proc int’l conf advances in visual information systems, LNCS 1929. Springer, Berlin Heidelberg New York, pp 279–289
Platt J, Czerwinski M, Field B (2002) PhotoTOC: automatic clustering for browsing personal photographs. Technical report, Microsoft Research
Rodden K, Basalaj W, Sinclair D, Wood K (2001) Does organization by similarity assist image browsing? In: Proc int’l conf computer human interaction, New Orleans, 5–10 August 2001, pp 190–197
Rogers T, McClelland J (2006) Semantic cognition: a parallel distributed processing approach. MIT, Cambridge
Roussinov D, Chen H (1998) A scalable self-organizing map algorithm for textual classification: a neural network approach to thesaurus generation. Commun Cogn 15(1–2):81–112
Roussopoulos N, Kelley S, Vincent F (1995) Nearest neighbor queries. In: Proc ACM int’l conf management of data (SIGMOD). ACM, New York
Roweis S, Saul L (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Rubner Y, Guibas L, Tomasi C (1997) The earth mover’s distance, multi-dimensional scaling, and color-based image retrieval. In: Proc ARPA image understanding workshop, New Orleans, May 1997, pp 661–668
Rubner Y, Tomasi C, Guibas L (1998) A metric for distributions with applications to image databases. In: Proc IEEE int’l conf computer vision. IEEE, Piscataway, pp 59–66
Salton G, Buckley C (1988) On the use of spreading activation methods in automatic information. In: Proc ACM int’l conf information retrieval (SIGIR). ACM, New York, pp 147–160
Sammon J (1969) A nonlinear mapping for data structure analysis. IEEE Trans Comput C-18(5):401–409
Santini S, Jain R (2000) Integrated browsing and querying for image databases. IEEE Multimed Mag 7(3):26–39
Santini S, Gupta A, Jain R (2001) Emergent semantics through interaction in image databases. IEEE Trans Knowl Data Eng 13(3):337–351
Schvanefeldt R (1990) Pathfinder associative networks: studies in knowledge organization. In: Ablex series in computational sciences. Ablex, Norwood
Schvaneveldt R, Durso F, Dearholt D (1989) Network structures in proximity data. In: Bower GH (ed) The psychology of learning and motivation. Academic, London, pp 249–284
Sclaroff S, Taycher L, La Cascia M (1997) ImageRover: a content-based image browser for the World Wide Web. Technical report, Boston University
Sloutsky V (2003) The role of similarity in the development of categorization. Trends Cogn Sci 7(6):246–252
Smeulders A, Worring M, Santini S, Gupta A, Jain R (2000) Content based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Smith J, Chang S-F (1996) VisualSEEk: a fully automated content-based image query system. In: Proc ACM int’l conf multimedia (SIGMM). ACM, New York
Spence R (1999) A framework for navigation. Int J Hum Comput Stud 51:919–945
Tenenbaum J (2006) Theory-based bayesian models of inductive learning and reasoning. Trends Cogn Sci 10(7):309–317
Tenenbaum J, de Silva V, Langford J (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500):2319–2323
Tishby N, Pereira F, Bialek W (1999) The information bottleneck method. In: Proc allerton conf communication, control and computing, Monticello, September 1999, pp 368–377
Urban J, Jose J, van Rijsbergen C (2003) An adaptive approach towards content-based image retrieval. In: Proc int’l workshop content-based multimedia indexing, Rennes, 22–24 September, pp 119–126
Vendrig J, Worring M, Smeulders A (1999) Filter image browsing: exploiting interaction in image retrieval. In: Visual information and information systems, Amsterdam, 2–4 June 1999, pp 147–154
Wang Q, You S (2006) Fast similarity search for high-dimensional datasets. In: Proc IEEE int’l symp multimedia. IEEE, Piscataway
Weber R, Blott S (1997) An approximation based data structure for similarity search. Technical Report 24, ETH Zurich, Switzerland
Weber R, Schek J-J, Blott S (1998) A quantitative analysis and performance study for similarity-search methods in high-dimensional space. In: Proc int’l conf very large databases, New York, 24–27 August 1998, pp 194–205
White D, Jain R (1996) Similarity indexing with the SS-tree. In: Proc IEEE int’l conf data engineering. IEEE, Piscataway, pp 516–523
Wolfram S (2004) A new kind of science. Wolfram, Champaign
Yang C (2004) Content-based image retrieval: a comparison between query by example and image browsing map approaches. J Inf Sci 30(3):254–267
Yang J, Fan J, Hubball D, Gao Y, Luo H, Ribarsky W, Ward M (2006) Semantic image browser: bridging information visualization with automated intelligent image analysis. In: IEEE symp visual analytics science and technology. IEEE, Piscataway, pp 191–198
Yavlinsky A, Heesch D (2007) An online system for gathering image similarity judgements. In: Proc ACM int’l conf multimedia (SIGMM). ACM, New York, pp 565–568
Yavlinsky A, Schofield E, Rüger S (2005) Automated image annotation using global features and robust nonparametric density estimation. In: Proc int’l conf video and image retrieval, LNCS 3568. Springer, Berlin Heidelberg New York, pp 507–517
Yeung M, Liu B (1995) Efficient matching and clustering of video shots. In: Proc IEEE int’l conf image processing. IEEE, Piscataway, pp 338–341
Yeung M, Yeo B-L (1997) Video visualization for compact presentation and fast browsing of pictorial content. IEEE Trans Circuits Syst Video Technol 7(5):771–785
Zass R, Shashua A (2005) A unified treatment of hard and probabilistic clustering methods. In: Proc int’l conf computer vision. IEEE, Piscataway
Zhang H, Zhong D (1995) A scheme for visual feature based image indexing. In: Proc SPIE/IS&T conf storage and retrieval for image and video databases III, vol 2420. SPIE, Bellingham, pp 36–46
Zhang R, Zhang Z, Li M, Ma W-Y, Zhang H-J (2005) A probabilistic semantic model for image annotation and multi-modal image retrieval. In: Proc int’l conf computer vision. IEEE, Piscataway, pp 846–851
Zhou X, Huang T (2003) Relevance feedback in image retrieval: a comprehensive review. ACM Multimed Syst 8(6):536–544
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Heesch, D. A survey of browsing models for content based image retrieval. Multimed Tools Appl 40, 261–284 (2008). https://doi.org/10.1007/s11042-008-0207-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-008-0207-2