Skip to main content

Performance Analysis of Multiple Classifier Fusion for Semantic Video Content Indexing and Retrieval

  • Conference paper
Advances in Multimedia Modeling (MMM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4351))

Included in the following conference series:

Abstract

In this paper we compare a number of classifier fusion approaches within a complete and efficient framework for video shot indexing and retrieval. The aim of the fusion stage of our sytem is to detect the semantic content of video shots based on classifiers output obtained from low level features. An overview of current research in classifier fusion is provided along with a comparative study of four combination methods. A novel training technique called Weighted Ten Folding based on Ten Folding principle is proposed for combining classifier. The experimental results conducted in the framework of the TrecVid’05 features extraction task report the efficiency of different combination methods and show the improvement provided by our proposed scheme.

This work is funded by France Télécom R&D under CRE 46134752.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Naphade, M., Kristjansson, T., Frey, B., Huang, T.: Probabilistic multimedia objects (multijets): a novel approach to video indexing and retrieval. IEEE Trans. Image Process. 3, 536–540 (1998)

    Google Scholar 

  2. TRECVID, Digital video retrieval at NIST, http://www-nlpir.nist.gov/projects/trecvid/

  3. Felzenszwalb, P., Huttenlocher, D.: Efficiently computing a good segmentation. In: Proceedings of IEEE CVPR, pp. 98–104 (1998)

    Google Scholar 

  4. Souvannavong, F.: Indexation et recherche de plans video par contenu semantique. Ph.D. dissertation, Phd thesis of Eurecom Institute, France (2005)

    Google Scholar 

  5. Ma, W., Zhang, H.: Benchmarking of image features for content-based image retrieval. In: Thirtysecond Asilomar Conference on Signals, System and Computers, pp. 253–257 (1998)

    Google Scholar 

  6. Carson, C., Thomas, M., Belongie, S.: Blobworld: A system for region-based image indexing and retrieval. In: Third international conference on visual information systems (1999)

    Google Scholar 

  7. Souvannavong, D., Merialdo, B., Huet, B.: Multi modal classifier fusion for video shot content retrieval. In: Proceedings of WIAMIS (2005)

    Google Scholar 

  8. Allwein, E., Schapire, R., Singer, Y.: Reducing multiclass to binary: A unifying approach for margin classifiers. Journal of Machine Learning Research 1, 113–141 (2000)

    Article  MathSciNet  Google Scholar 

  9. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. In: Kernel-Induced Feature Spaces, Cambridge University Press, Cambridge (2000)

    Google Scholar 

  10. Kuncheva, L., Bezdek, J.C., Duin, R.: Decision templates for multiple classifier fusion: an experiemental comparaison. Pattern Recognition 34, 299–314 (2001)

    Article  MATH  Google Scholar 

  11. Duin, R., Tax, D.: Experiements with classifier combining rules. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 16–29. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  12. Rastrigin, L., Erenstein, R.: Method of collective recognition. Energoizdat (1982)

    Google Scholar 

  13. Jacobs, R., Jordan, M., Nowlan, S., Hinton, G.: Adaptive mixtures of local experts. Neural Computation 3, 1409–1431 (1991)

    Article  Google Scholar 

  14. Xu, L., Krzyzak, A., Suen, C.: Methods of combining multiple classifiers and their application to hardwriting recognition. IEEE Trans. Sys. Man. Cyb. 22, 418–435 (1992)

    Article  Google Scholar 

  15. Chou, K., Tu, L., Shyu, I.: Performances analysis of a multiple classifiers system for recognition of totally unconstrained handwritten numerals. In: 4th International Workshop on Frontiers of Handwritten Recognition, pp. 480–487 (1994)

    Google Scholar 

  16. Achermann, B., Bunke, H.: Combination of classifiers on the decision level for face recognition. Technical report of Bern University (1996)

    Google Scholar 

  17. Ho, T.: A theory of multiple classifier systems and its application to visual and word recognition. Ph.D. dissertation, Phd thesis of New York University (1992)

    Google Scholar 

  18. Cybenko, G.: Approximations by superposition of a sigmoidal function. Mathematics of Control, Signal and Systems 2, 303–314 (1989)

    Article  MATH  MathSciNet  Google Scholar 

  19. Freud, Y., Schapire, R.: Experiments with a new boosting algorithms. In: Machine Learning: Proceedings of the 13th International Conference (1996)

    Google Scholar 

  20. Skurichina, M., Duin, R.: Bagging for linear classifiers. Pattern Recognition 31(7), 909–930 (1998)

    Article  Google Scholar 

  21. Cooper, M., Adcock, J., Chen, R., Zhou, H.: Fxpal at trecvid 2005. In: Proceedings of Trecvid (2005)

    Google Scholar 

  22. Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E., Zhang, D.: Video seach and high level feature extraction. In: Proceedings of Trecvid (2005)

    Google Scholar 

  23. Amir, A., Argillander, J., Campbell, M., Haubold, A., Iyengar, G., Ebadollahi, S., Kang, F., Naphade, M., Natsev, A., Smith, J., Tesic, J., Volkmer, T.: Ibm research trecvid 2005 video retrieval system. In: Proceedings of Trecvid (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Benmokhtar, R., Huet, B. (2006). Performance Analysis of Multiple Classifier Fusion for Semantic Video Content Indexing and Retrieval. In: Cham, TJ., Cai, J., Dorai, C., Rajan, D., Chua, TS., Chia, LT. (eds) Advances in Multimedia Modeling. MMM 2007. Lecture Notes in Computer Science, vol 4351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69423-6_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69423-6_50

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69421-2

  • Online ISBN: 978-3-540-69423-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics