skip to main content
10.1145/1816041.1816103acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
research-article

Affine Stable Characteristic based sample expansion for object detection

Published: 05 July 2010 Publication History

Abstract

Generating better object model from automatic expanded samples is an effective approach to improve the performance of object detection. However, most existing methods either don't work well with limited relevance images in corpus, or result in redundant features and the decrease of detection speed. In this paper, we propose a novel method called Affine Stable Characteristic to generate an object feature model using only one object sample. By integrating affine simulation with stable characteristic mining, a compact and informative object model is generated with high robustness to viewpoint and scale transformations. For characteristic mining, two new notions, Global Stability and Local Stability, are introduced to calculate the robustness of each object feature from complementary hierarchies. And they are combined to generate the final object feature model. Experiments show that our novel method is capable of detecting objects in various geometric and photometric transformations, while only acquiring one sample image. In a compiled dataset composed of many famous test sets, the detection accuracy can be improved 35.8% compared with traditional methods at rapid on-line speed. The proposed approach can also be well generalized to other content analysis tasks.

References

[1]
O. Chum, J. Philbin, J. Sivic, M. Isard, A. Zisserman. Total recall: automatic query expansion with a generative feature model for object retrieval, International Conference on Computer Vision, 2007.
[2]
O. Chum and A. Zisserman. An exemplar model for learning object classes, Computer Vision and Pattern Recognition, 2007.
[3]
V. Ferrari, L. Fevrier, F. Jurie, and C. Schmid. Groups of adjacent contour segments for object detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 30:36--51, 2008.
[4]
J. Philbin, O. Chum, M. Isard, J. Sivic and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching, Computer Vision and Pattern Recognition, 2007.
[5]
Y. G. Jiang, C. W. Ngo and J. Yang. Towards optimal bag-of-features for object categorization and semantic video retrieval. International Conference on Computer Vision, 2007.
[6]
Y. H. Kuo, K. T. Chen, C. H. Chiang, and W. H. Hsu. Query expansion for hash-based image object retrieval, International Conference on Multimedia, 2009.
[7]
A. Holub, P. Perona, M. C. Burl. Entropy-based active learning for object recognition, Computer Vision and Pattern Recognition Workshops, 2008.
[8]
J. H. Hsiao, C. S. Chen, L. F. Chien, and M. S. Chen. A new approach to image copy detection based on extended feature sets, IEEE Transactions on Image Processing, 16(8):2069--2079, 2007.
[9]
W. Wu and J. Yang. Object fingerprints for content analysis with applications to street landmark localization. ACM Multimedia, 2008.
[10]
D. Pritchard and W. Heidrich. Cloth motion capture. Computer Graphics Forum, 22(3):263--271, 2003.
[11]
J. M. Morel and G. Yu, ASIFT: A new framework for fully affine invariant image comparison, SIAM Journal on Imaging Sciences, 2(2), 2009.
[12]
K. Mikolajczyk, T. Tuytelaars, C. Schmid, etc. A comparison of affine region detectors, In International Journal on Computer Vision, 65(1/2):43--72, 2005.
[13]
K. Mikolajczyk, and C. Schmid. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(10):1615--1630, 2005.
[14]
J. Matas, O. Chum, M. Urban, and T. Pajdla. Robust wide-baseline stereo from maximally stable extremal regions, British Machine Vision Conference, pp. 384--393, 2002.
[15]
K. Mikolajczyk, and C. Schmid. Scale & affine invariant interest point detectors, International Journal on Computer Vision, 60(1):63--86, 2004.
[16]
D. Lowe. Distinctive image features from scale invariant keypoints, In International Journal on Computer Vision, 60(2): 91--110, 2004.
[17]
H. Jegou, M. Douze and C. Schmid. Hamming embedding and weak geometry consistency for large scale image search, European conference on Computer vision, 2008.
[18]
J. Sivic, A. Zisserman. Video google: a text retrieval approach to object matching in vdeos, International Conference on Computer Vision, 2003.
[19]
H. Bay, T. Tuytelaars, L. V. Gool. SURF: speeded up robust features, European Conference on Computer Vision, 2006.
[20]
S. Arya, T. Malamatos, and D. M. Mount. Space-time tradeoffs for approximate nearest neighbor searching, Journal of the ACM, 57: 1--54, 2009.
[21]
K. Gao, S. L, Y. Z, S. T and D. Z. Logo detection based on spatial-spectral saliency and partial spatial context, IEEE International Conference on Multimedia and Expo, 2009.
[22]
Http://www-nlpir.nist.gov/projects/trecvid/
[23]
H. Xie, K. Gao, Y. Zhang, J. Li, Y. Liu. GPU-basd fast scale invariant interest point detector, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2010.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIVR '10: Proceedings of the ACM International Conference on Image and Video Retrieval
July 2010
492 pages
ISBN:9781450301176
DOI:10.1145/1816041
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 July 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. affine stable characteristic
  2. object detection
  3. sample expansion

Qualifiers

  • Research-article

Funding Sources

Conference

CIVR' 10
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)High-Throughput Content-Based Video Analysis TechnologiesJournal of Engineering Studies10.3724/SP.J.1224.2014.0029406:03(294-306)Online publication date: 13-Oct-2022
  • (2020)An OSM Data-Driven Method for Road-Positive Sample CreationRemote Sensing10.3390/rs1221361212:21(3612)Online publication date: 3-Nov-2020
  • (2017)Age classification with deep learning face representationMultimedia Tools and Applications10.1007/s11042-017-4646-576:19(20231-20247)Online publication date: 1-Oct-2017
  • (2016)Visual homographNeurocomputing10.1016/j.neucom.2016.04.057208:C(342-349)Online publication date: 5-Oct-2016
  • (2014)A mathematical model to detect hand object from the scene2014 IEEE International Advance Computing Conference (IACC)10.1109/IAdCC.2014.6779485(1133-1136)Online publication date: Feb-2014
  • (2013)Accurate off-line query expansion for large-scale mobile visual searchSignal Processing10.1016/j.sigpro.2012.10.01193:8(2305-2315)Online publication date: 1-Aug-2013
  • (2012)Visual stem mapping and Geometric Tense coding for Augmented Visual Vocabulary2012 IEEE Conference on Computer Vision and Pattern Recognition10.1109/CVPR.2012.6248059(3234-3241)Online publication date: Jun-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media