Skip to main content

Exploit camera metadata for enhancing interesting region detection and photo retrieval

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Photographs taken by human beings differ from the images that taken by a lifeless device, such as a surveillance camera or a visual sensor on a robot, in that human being intentionally shoot photographs to express his/her feeling or photo-realistically record a scene. This creation process is accomplished by adjusting two factors: the setting of parameters on a camera and the position between the camera and the object which he or she is interested in. In this paper, this procedure is learned using the machine learning technique so that what the interest of the photographer is and what the core content of a photo wants to display can be reversely calculated. A photo retrieval system was built upon the category of interesting regions and the metadata is used for help. The research also explore the argument how local or global feature affects the performance of image retrieval. A novel stochastic segmentation algorithm called region restricted EM algorithm was applied in order to construct the interesting regions. Experimental evaluation on over 7000+ photos taken by 200+ different models of cameras with variety of interests has shown the robustness of our technique.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

  1. Achanta R, Estrada F, Wils P, Ssstrunk S (2008) Salient region detection and segmentation. Lect Notes Comput Sci 5008/2008:66–75

    Article  Google Scholar 

  2. Bouveyron C, Girard S, Schmid C (2007) High-dimensional data clustering. Comput Stat Data Anal 52:502–519

    Article  MATH  MathSciNet  Google Scholar 

  3. Boutell M, Luo J (2004) Bayesian fusion of camera metadata cues in semantic scene classification. In: IEEE computer society conference on computer vision and pattern recognition, vol 2, pp 623–630

  4. Brox T, Weickert J (2004) Level set based image segmentation with multiple regions. Lect Notes Comput Sci 3175/2004:415–423

    Google Scholar 

  5. Brox T, Rousson M, Deriche R, Weickert J (2003) Unsupervised segmentation incorporating colour, texture, and motion. 2756/2003:353–360

  6. Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24:603–619

    Article  Google Scholar 

  7. Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(5):1–60

    Article  Google Scholar 

  8. Dempster AP, Laird NM, Rubin DB (1997) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38

    MathSciNet  Google Scholar 

  9. Deng Y, Manjunath BS (2001) Unsupervised segmentation of color-texture regions in image and video. IEEE Trans Pattern Anal Mach Intell 8:800–810

    Article  Google Scholar 

  10. Desimone R, Duncan J (1995) Neural mechanisms of selective visual attention. Annu Rev Neurosci 18:193–222

    Article  Google Scholar 

  11. Fred ALN, Leitao JMN (2003) A new cluster isolation criterion based on dissimilarity increments. IEEE Trans Pattern Anal Mach Intell 25:944–958

    Article  Google Scholar 

  12. Fussenegger M, Deriche R, Pinz A (2006) A Multiphase level set based segmentation framework with pose invariant shape priors, ACCV (2). Springer, Hyderabad, pp 395–404

    Google Scholar 

  13. Kato Z (1999) Bayesian color image segmentation using reversible jump Markov chain Monte Carlo. University of Szeged, Institute of Informatics

  14. Lee TCM (2000) A minimum description length based image segmentation procedure, and its comparison with a cross-validation based segmentation procedure. J Am Stat Assoc 95:259–270

    Article  MATH  Google Scholar 

  15. LIBSVM: a library for support vector machines (2001). Chih-Chung Chang and Chih-Jen Lin. http://www.csie.ntu.edu.tw/~cjlin/libsvm

  16. Luo J, Savakis AE, Singhal A (2005) A Bayesian network-based framework for semantic image understanding. Pattern Recogn 38:919–934

    Article  Google Scholar 

  17. Ma Y, Derksen H, Hong W, Wright J (2007) Segmentation of multivariate mixed data via lossy data coding and compression. IEEE Trans Pattern Anal Mach Intell 29:1546–1562

    Article  Google Scholar 

  18. Mansouri A-R, Mitiche A, Vazquez C (2006) Multiregion competition: a level set extension of region competition to multiple region image partitioning. Comput Vis Image Underst 101:137–150

    Article  Google Scholar 

  19. Privitera CM, Stark LW (2000) Algorithms for defining visual region-of-interest: comparison with eye fixations. IEEE Trans Pattern Anal Mach Intell 22:970–982

    Article  Google Scholar 

  20. Rissanen J (1989) Stochastic complexity in statistical inquiry. World Scientific, Singapore

    MATH  Google Scholar 

  21. Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464

    Article  MATH  Google Scholar 

  22. Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22:888–905

    Article  Google Scholar 

  23. Shih FY, Cheng S (2005) Automatic seeded region growing for color image segmentation. Image Vis Comput 23:877–886

    Article  Google Scholar 

  24. Shyu C, Brodley C, Kak A, Kosaka A, Aisen A, Broderick L (1998) In: IEEE workshop on content-based access of image and video libraries. IEEE Computer Society, Washington, DC, pp 30–34

    Chapter  Google Scholar 

  25. Siagian C, Itti L (2007) Rapid biologically-inspired scene classification using features shared with visual attention. IEEE Trans Pattern Anal Mach Intell 29:300–312

    Article  Google Scholar 

  26. Smeulders A, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22:1349–1380

    Article  Google Scholar 

  27. Torralba A, Oliva A, Castelhano MS, Henderson JM (2006) Contextual guidance of eye movements and attention in real-world scenes: the role of global features on object search. Psychol Rev 113:766–786

    Article  Google Scholar 

  28. Ueda N, Nakano R, Ghahramani YZ, Hiton GE (2000) SMEM algorithm for mixture models. Neural Comput 12:2109–2128

    Article  Google Scholar 

  29. Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Netw 19:1395–1407

    Article  MATH  Google Scholar 

  30. Yang S, Kim S-K, Ro YM (2007) Semantic home photo categorization. IEEE Trans Circuits Syst Video Technol 17:324–335

    Article  Google Scholar 

  31. Zhang Z, Chen C, Sun J, Chan KL (2003) EM algorithm for Gaussian mixtures with split-and-merge operation. Pattern Recogn 36:1973–1983

    Article  MATH  Google Scholar 

  32. Zhu SC, Yuille A (1996) Region competition: unifying snakes, region growing, and Bayes/MDL for multiband image segmentation. IEEE Trans Pattern Anal Mach Intell 18:884–900

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianping Fan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Z., Fan, J. Exploit camera metadata for enhancing interesting region detection and photo retrieval. Multimed Tools Appl 46, 207–233 (2010). https://doi.org/10.1007/s11042-009-0346-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-009-0346-0

Keywords