short-paper

Incorporating camera metadata for attended region detection and consumer photo classification

Authors:

Zhong Li,

Hangzai Luo,

Jianping FanAuthors Info & Claims

MM '09: Proceedings of the 17th ACM international conference on Multimedia

Pages 517 - 520

https://doi.org/10.1145/1631272.1631345

Published: 19 October 2009 Publication History

Get Access

Abstract

Photos taken by human beings significantly differ from the pictures that are taken by a surveillance camera or a vision sensor on a robot, e.g., human beings may intentionally capture photos to express his/her feeling or record a memorial scene. Such a creative photo capture process is accomplished by adjusting two factors: (1) the parameters setting of a camera; and (2) the position between the camera and the interesting objects or scenes. To enable automatic understanding and interpretation of the semantics of photos, it is very important to take all these factors into account. Unfortunately, most existing algorithms for image understanding focus on only the content of the images while completely ignoring these two important factors. In this paper, we have developed a new algorithm to calculate what the interestingness of the photographer is and what the core content of a photo is. The gained information (i.e., attended regions and attention of the photographer) is further used to support more effective photo classification and retrieval. Our experiments on 70,000+ photos taken by 200+ different models of cameras have obtained very positive results.

References

[1]

A. Smeulders, M. Worring, S. Santini, A. Gupta and R. Jain, Content-based image retrieval at the end of the early years, PAMI, vol.22, pp.1349--1380, 2000.

Digital Library

Google Scholar

[2]

R. Desimone, J. Duncan,"Neural mechanisms of selective visual attention, annual reviews", Neuroscience, vol.18, pp.193--222, 1995.

Google Scholar

[3]

C. M. Privitera, L. W. Stark, "Algorithms for defining visual region-of-interest: Comparison with eye fixations", PAMI, vol.22, pp.970--982, 2000.

Digital Library

Google Scholar

[4]

C. Siagian, L. Itti, "Rapid biologically-inspired scene classification using features shared with visual attention", PAMI, vol.29, pp.300--312, 2007.

Digital Library

Google Scholar

[5]

Y. Ma, H. Zhang, "Contrast-based image attention analysis by using fuzzy growing", ACM Multimedia, pp.374--381, 2003.

Digital Library

Google Scholar

[6]

J. Luo, A.E. Savakis, A. Singhal, "A Bayesian network-based framework for semantic image understanding", Pattern Recognition, vol.38, pp.919--934, 2005.

Digital Library

Google Scholar

[7]

M. Boutell, J. Luo, "Bayesian fusion of camera metadata cues in semantic scene classification", IEEE CVPR, pp.623--630, 2004.

Digital Library

Google Scholar

[8]

J. Fan, Y. Gao, H. Luo, "Integrating concept ontology and multi-task learning to achieve more effective classifier training for multi-level image annotation", IEEE Trans. on Image Processing, vol. 17, no.3, pp.407--426, 2008.

Digital Library

Google Scholar

Cited By

View all

Ding XChen Z(2019)Improving Saliency Detection Based on Modeling Photographer's IntentionIEEE Transactions on Multimedia10.1109/TMM.2018.285138921:1(124-134)Online publication date: Jan-2019
https://doi.org/10.1109/TMM.2018.2851389
Rabbath MBoll S(2014)Personal Media ReunionProceedings of the 20th Anniversary International Conference on MultiMedia Modeling - Volume 832510.1007/978-3-319-04114-8_16(183-194)Online publication date: 6-Jan-2014
https://dl.acm.org/doi/10.1007/978-3-319-04114-8_16
Cavalcanti CGomes HDe Queiroz J(2013)A survey on automatic techniques for enhancement and analysis of digital photographyJournal of the Brazilian Computer Society10.1007/s13173-013-0102-119:3(341-359)Online publication date: 26-Mar-2013
https://doi.org/10.1007/s13173-013-0102-1
Show More Cited By

Index Terms

Incorporating camera metadata for attended region detection and consumer photo classification
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
  2. Machine learning

Recommendations

Exploit camera metadata for enhancing interesting region detection and photo retrieval

Photographs taken by human beings differ from the images that taken by a lifeless device, such as a surveillance camera or a visual sensor on a robot, in that human being intentionally shoot photographs to express his/her feeling or photo-realistically ...
Semantic Home Photo Categorization

A semantic categorization method for generic home photo is proposed. The main contribution of this paper is to exploit a two-layered classification model incorporating camera metadata with low-level features for multilabel detection. The two-layered ...
Unified Multi-Camera Detection and Tracking Using Region-Matching
WOMOT '01: Proceedings of the IEEE Workshop on Multi-Object Tracking (WOMOT'01)

Abstract: We describe an algorithm for detecting and tracking multiple people in a cluttered scene using multiple synchronized cameras located far away from each other. This camera arrangement results in multiple wide-baseline camera systems. We segment ...

Comments

Information & Contributors

Information

Published In

MM '09: Proceedings of the 17th ACM international conference on Multimedia

October 2009

1202 pages

ISBN:9781605586083

DOI:10.1145/1631272

General Chairs:
Wen Gao
Peking University, China
,
Yong Rui
Microsoft, China
,
Alan Hanjalic
Delft University of Technology, The Netherlands
,
Program Chairs:
Changsheng Xu
Institute of Automation, Chinese Academy of Sciences, China
,
Eckehard Steinbach
Technical University of Munich, Germany
,
Abdulmotaleb El Saddik
University of Ottawa, Canada
,
Michelle Zhou
IBM T. J. Watson Research Center, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM09

Sponsor:

SIGMM

MM09: ACM Multimedia Conference

October 19 - 24, 2009

Beijing, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
234
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 24 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Ding XChen Z(2019)Improving Saliency Detection Based on Modeling Photographer's IntentionIEEE Transactions on Multimedia10.1109/TMM.2018.285138921:1(124-134)Online publication date: Jan-2019
https://doi.org/10.1109/TMM.2018.2851389
Rabbath MBoll S(2014)Personal Media ReunionProceedings of the 20th Anniversary International Conference on MultiMedia Modeling - Volume 832510.1007/978-3-319-04114-8_16(183-194)Online publication date: 6-Jan-2014
https://dl.acm.org/doi/10.1007/978-3-319-04114-8_16
Cavalcanti CGomes HDe Queiroz J(2013)A survey on automatic techniques for enhancement and analysis of digital photographyJournal of the Brazilian Computer Society10.1007/s13173-013-0102-119:3(341-359)Online publication date: 26-Mar-2013
https://doi.org/10.1007/s13173-013-0102-1
Li HYi LTang JWang XCandan KPanchanathan SPrabhakaran BSundaram HFeng WSebe N(2011)Capturing a great photo via learning from community-contributed photo collectionsProceedings of the 19th ACM international conference on Multimedia10.1145/2072298.2072470(809-810)Online publication date: 28-Nov-2011
https://dl.acm.org/doi/10.1145/2072298.2072470
Zhong SLiu YLiu YCandan KPanchanathan SPrabhakaran BSundaram HFeng WSebe N(2011)Bilinear deep learning for image classificationProceedings of the 19th ACM international conference on Multimedia10.1145/2072298.2072344(343-352)Online publication date: 28-Nov-2011
https://dl.acm.org/doi/10.1145/2072298.2072344

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Exploit camera metadata for enhancing interesting region detection and photo retrieval

Semantic Home Photo Categorization

Unified Multi-Camera Detection and Tracking Using Region-Matching

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations