Exploit camera metadata for enhancing interesting region detection and photo retrieval

Li, Zhong; Fan, Jianping

doi:10.1007/s11042-009-0346-0

Exploit camera metadata for enhancing interesting region detection and photo retrieval

Published: 26 August 2009

Volume 46, pages 207–233, (2010)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Zhong Li¹ &
Jianping Fan¹

134 Accesses
2 Citations
Explore all metrics

Abstract

Photographs taken by human beings differ from the images that taken by a lifeless device, such as a surveillance camera or a visual sensor on a robot, in that human being intentionally shoot photographs to express his/her feeling or photo-realistically record a scene. This creation process is accomplished by adjusting two factors: the setting of parameters on a camera and the position between the camera and the object which he or she is interested in. In this paper, this procedure is learned using the machine learning technique so that what the interest of the photographer is and what the core content of a photo wants to display can be reversely calculated. A photo retrieval system was built upon the category of interesting regions and the metadata is used for help. The research also explore the argument how local or global feature affects the performance of image retrieval. A novel stochastic segmentation algorithm called region restricted EM algorithm was applied in order to construct the interesting regions. Experimental evaluation on over 7000+ photos taken by 200+ different models of cameras with variety of interests has shown the robustness of our technique.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

GEO matching regions: multiple regions of interests using content based image retrieval based on relative locations

Article 05 September 2016

Multi-modal and multi-scale photo collection summarization

Article 24 May 2015

Comparative Study of a Computer Vision Technique for Locating Instances of Objects in Images Using YOLO Versions: A Review

References

Achanta R, Estrada F, Wils P, Ssstrunk S (2008) Salient region detection and segmentation. Lect Notes Comput Sci 5008/2008:66–75
Article Google Scholar
Bouveyron C, Girard S, Schmid C (2007) High-dimensional data clustering. Comput Stat Data Anal 52:502–519
Article MATH MathSciNet Google Scholar
Boutell M, Luo J (2004) Bayesian fusion of camera metadata cues in semantic scene classification. In: IEEE computer society conference on computer vision and pattern recognition, vol 2, pp 623–630
Brox T, Weickert J (2004) Level set based image segmentation with multiple regions. Lect Notes Comput Sci 3175/2004:415–423
Google Scholar
Brox T, Rousson M, Deriche R, Weickert J (2003) Unsupervised segmentation incorporating colour, texture, and motion. 2756/2003:353–360
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24:603–619
Article Google Scholar
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(5):1–60
Article Google Scholar
Dempster AP, Laird NM, Rubin DB (1997) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38
MathSciNet Google Scholar
Deng Y, Manjunath BS (2001) Unsupervised segmentation of color-texture regions in image and video. IEEE Trans Pattern Anal Mach Intell 8:800–810
Article Google Scholar
Desimone R, Duncan J (1995) Neural mechanisms of selective visual attention. Annu Rev Neurosci 18:193–222
Article Google Scholar
Fred ALN, Leitao JMN (2003) A new cluster isolation criterion based on dissimilarity increments. IEEE Trans Pattern Anal Mach Intell 25:944–958
Article Google Scholar
Fussenegger M, Deriche R, Pinz A (2006) A Multiphase level set based segmentation framework with pose invariant shape priors, ACCV (2). Springer, Hyderabad, pp 395–404
Google Scholar
Kato Z (1999) Bayesian color image segmentation using reversible jump Markov chain Monte Carlo. University of Szeged, Institute of Informatics
Lee TCM (2000) A minimum description length based image segmentation procedure, and its comparison with a cross-validation based segmentation procedure. J Am Stat Assoc 95:259–270
Article MATH Google Scholar
LIBSVM: a library for support vector machines (2001). Chih-Chung Chang and Chih-Jen Lin. http://www.csie.ntu.edu.tw/~cjlin/libsvm
Luo J, Savakis AE, Singhal A (2005) A Bayesian network-based framework for semantic image understanding. Pattern Recogn 38:919–934
Article Google Scholar
Ma Y, Derksen H, Hong W, Wright J (2007) Segmentation of multivariate mixed data via lossy data coding and compression. IEEE Trans Pattern Anal Mach Intell 29:1546–1562
Article Google Scholar
Mansouri A-R, Mitiche A, Vazquez C (2006) Multiregion competition: a level set extension of region competition to multiple region image partitioning. Comput Vis Image Underst 101:137–150
Article Google Scholar
Privitera CM, Stark LW (2000) Algorithms for defining visual region-of-interest: comparison with eye fixations. IEEE Trans Pattern Anal Mach Intell 22:970–982
Article Google Scholar
Rissanen J (1989) Stochastic complexity in statistical inquiry. World Scientific, Singapore
MATH Google Scholar
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464
Article MATH Google Scholar
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22:888–905
Article Google Scholar
Shih FY, Cheng S (2005) Automatic seeded region growing for color image segmentation. Image Vis Comput 23:877–886
Article Google Scholar
Shyu C, Brodley C, Kak A, Kosaka A, Aisen A, Broderick L (1998) In: IEEE workshop on content-based access of image and video libraries. IEEE Computer Society, Washington, DC, pp 30–34
Chapter Google Scholar
Siagian C, Itti L (2007) Rapid biologically-inspired scene classification using features shared with visual attention. IEEE Trans Pattern Anal Mach Intell 29:300–312
Article Google Scholar
Smeulders A, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22:1349–1380
Article Google Scholar
Torralba A, Oliva A, Castelhano MS, Henderson JM (2006) Contextual guidance of eye movements and attention in real-world scenes: the role of global features on object search. Psychol Rev 113:766–786
Article Google Scholar
Ueda N, Nakano R, Ghahramani YZ, Hiton GE (2000) SMEM algorithm for mixture models. Neural Comput 12:2109–2128
Article Google Scholar
Walther D, Koch C (2006) Modeling attention to salient proto-objects. Neural Netw 19:1395–1407
Article MATH Google Scholar
Yang S, Kim S-K, Ro YM (2007) Semantic home photo categorization. IEEE Trans Circuits Syst Video Technol 17:324–335
Article Google Scholar
Zhang Z, Chen C, Sun J, Chan KL (2003) EM algorithm for Gaussian mixtures with split-and-merge operation. Pattern Recogn 36:1973–1983
Article MATH Google Scholar
Zhu SC, Yuille A (1996) Region competition: unifying snakes, region growing, and Bayes/MDL for multiband image segmentation. IEEE Trans Pattern Anal Mach Intell 18:884–900
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, The University of North Carolina at Charlotte, Charlotte, NC, 28223-0001, USA
Zhong Li & Jianping Fan

Authors

Zhong Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Fan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianping Fan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Z., Fan, J. Exploit camera metadata for enhancing interesting region detection and photo retrieval. Multimed Tools Appl 46, 207–233 (2010). https://doi.org/10.1007/s11042-009-0346-0

Download citation

Published: 26 August 2009
Issue Date: January 2010
DOI: https://doi.org/10.1007/s11042-009-0346-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exploit camera metadata for enhancing interesting region detection and photo retrieval

Abstract

Access this article

Similar content being viewed by others

GEO matching regions: multiple regions of interests using content based image retrieval based on relative locations

Multi-modal and multi-scale photo collection summarization

Comparative Study of a Computer Vision Technique for Locating Instances of Objects in Images Using YOLO Versions: A Review

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Exploit camera metadata for enhancing interesting region detection and photo retrieval

Abstract

Access this article

Similar content being viewed by others

GEO matching regions: multiple regions of interests using content based image retrieval based on relative locations

Multi-modal and multi-scale photo collection summarization

Comparative Study of a Computer Vision Technique for Locating Instances of Objects in Images Using YOLO Versions: A Review

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation