Performance Analysis of Multiple Classifier Fusion for Semantic Video Content Indexing and Retrieval

Benmokhtar, Rachid; Huet, Benoit

doi:10.1007/978-3-540-69423-6_50

Rachid Benmokhtar²¹ &
Benoit Huet²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4351))

Included in the following conference series:

International Conference on Multimedia Modeling

859 Accesses
1 Citations
1 Altmetric

Abstract

In this paper we compare a number of classifier fusion approaches within a complete and efficient framework for video shot indexing and retrieval. The aim of the fusion stage of our sytem is to detect the semantic content of video shots based on classifiers output obtained from low level features. An overview of current research in classifier fusion is provided along with a comparative study of four combination methods. A novel training technique called Weighted Ten Folding based on Ten Folding principle is proposed for combining classifier. The experimental results conducted in the framework of the TrecVid’05 features extraction task report the efficiency of different combination methods and show the improvement provided by our proposed scheme.

This work is funded by France Télécom R&D under CRE 46134752.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Naphade, M., Kristjansson, T., Frey, B., Huang, T.: Probabilistic multimedia objects (multijets): a novel approach to video indexing and retrieval. IEEE Trans. Image Process. 3, 536–540 (1998)
Google Scholar
TRECVID, Digital video retrieval at NIST, http://www-nlpir.nist.gov/projects/trecvid/
Felzenszwalb, P., Huttenlocher, D.: Efficiently computing a good segmentation. In: Proceedings of IEEE CVPR, pp. 98–104 (1998)
Google Scholar
Souvannavong, F.: Indexation et recherche de plans video par contenu semantique. Ph.D. dissertation, Phd thesis of Eurecom Institute, France (2005)
Google Scholar
Ma, W., Zhang, H.: Benchmarking of image features for content-based image retrieval. In: Thirtysecond Asilomar Conference on Signals, System and Computers, pp. 253–257 (1998)
Google Scholar
Carson, C., Thomas, M., Belongie, S.: Blobworld: A system for region-based image indexing and retrieval. In: Third international conference on visual information systems (1999)
Google Scholar
Souvannavong, D., Merialdo, B., Huet, B.: Multi modal classifier fusion for video shot content retrieval. In: Proceedings of WIAMIS (2005)
Google Scholar
Allwein, E., Schapire, R., Singer, Y.: Reducing multiclass to binary: A unifying approach for margin classifiers. Journal of Machine Learning Research 1, 113–141 (2000)
Article MathSciNet Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. In: Kernel-Induced Feature Spaces, Cambridge University Press, Cambridge (2000)
Google Scholar
Kuncheva, L., Bezdek, J.C., Duin, R.: Decision templates for multiple classifier fusion: an experiemental comparaison. Pattern Recognition 34, 299–314 (2001)
Article MATH Google Scholar
Duin, R., Tax, D.: Experiements with classifier combining rules. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 16–29. Springer, Heidelberg (2000)
Chapter Google Scholar
Rastrigin, L., Erenstein, R.: Method of collective recognition. Energoizdat (1982)
Google Scholar
Jacobs, R., Jordan, M., Nowlan, S., Hinton, G.: Adaptive mixtures of local experts. Neural Computation 3, 1409–1431 (1991)
Article Google Scholar
Xu, L., Krzyzak, A., Suen, C.: Methods of combining multiple classifiers and their application to hardwriting recognition. IEEE Trans. Sys. Man. Cyb. 22, 418–435 (1992)
Article Google Scholar
Chou, K., Tu, L., Shyu, I.: Performances analysis of a multiple classifiers system for recognition of totally unconstrained handwritten numerals. In: 4th International Workshop on Frontiers of Handwritten Recognition, pp. 480–487 (1994)
Google Scholar
Achermann, B., Bunke, H.: Combination of classifiers on the decision level for face recognition. Technical report of Bern University (1996)
Google Scholar
Ho, T.: A theory of multiple classifier systems and its application to visual and word recognition. Ph.D. dissertation, Phd thesis of New York University (1992)
Google Scholar
Cybenko, G.: Approximations by superposition of a sigmoidal function. Mathematics of Control, Signal and Systems 2, 303–314 (1989)
Article MATH MathSciNet Google Scholar
Freud, Y., Schapire, R.: Experiments with a new boosting algorithms. In: Machine Learning: Proceedings of the 13th International Conference (1996)
Google Scholar
Skurichina, M., Duin, R.: Bagging for linear classifiers. Pattern Recognition 31(7), 909–930 (1998)
Article Google Scholar
Cooper, M., Adcock, J., Chen, R., Zhou, H.: Fxpal at trecvid 2005. In: Proceedings of Trecvid (2005)
Google Scholar
Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E., Zhang, D.: Video seach and high level feature extraction. In: Proceedings of Trecvid (2005)
Google Scholar
Amir, A., Argillander, J., Campbell, M., Haubold, A., Iyengar, G., Ebadollahi, S., Kang, F., Naphade, M., Natsev, A., Smith, J., Tesic, J., Volkmer, T.: Ibm research trecvid 2005 video retrieval system. In: Proceedings of Trecvid (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Département Communications Multimédias, Institut Eurécom, 2229, route des crêtes, 06904, Sophia-Antipolis, France
Rachid Benmokhtar & Benoit Huet

Authors

Rachid Benmokhtar
View author publications
You can also search for this author in PubMed Google Scholar
Benoit Huet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering, Nanyang Technological University, Block N4, Nanyang Avenue, 639798, Singapore
Tat-Jen Cham & Deepu Rajan &
School of Computer Engineering, Nanyang Technological University, 639798, Singapore
Jianfei Cai
IBM T.J. Watson Research Center, Yorktown Heights, P.O. Box 704, 10598, New York, USA
Chitra Dorai
National University of Singapore, 3 Science Dr, 117543, Singapore
Tat-Seng Chua
Center for Multimedia and Network Technology, School of Computer Enginnering, Nanyang Technological University, 639798, Singapore
Liang-Tien Chia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Benmokhtar, R., Huet, B. (2006). Performance Analysis of Multiple Classifier Fusion for Semantic Video Content Indexing and Retrieval. In: Cham, TJ., Cai, J., Dorai, C., Rajan, D., Chua, TS., Chia, LT. (eds) Advances in Multimedia Modeling. MMM 2007. Lecture Notes in Computer Science, vol 4351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69423-6_50

Download citation

DOI: https://doi.org/10.1007/978-3-540-69423-6_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69421-2
Online ISBN: 978-3-540-69423-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics