Abstract
In this paper we compare a number of classifier fusion approaches within a complete and efficient framework for video shot indexing and retrieval. The aim of the fusion stage of our sytem is to detect the semantic content of video shots based on classifiers output obtained from low level features. An overview of current research in classifier fusion is provided along with a comparative study of four combination methods. A novel training technique called Weighted Ten Folding based on Ten Folding principle is proposed for combining classifier. The experimental results conducted in the framework of the TrecVid’05 features extraction task report the efficiency of different combination methods and show the improvement provided by our proposed scheme.
This work is funded by France Télécom R&D under CRE 46134752.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Naphade, M., Kristjansson, T., Frey, B., Huang, T.: Probabilistic multimedia objects (multijets): a novel approach to video indexing and retrieval. IEEE Trans. Image Process. 3, 536–540 (1998)
TRECVID, Digital video retrieval at NIST, http://www-nlpir.nist.gov/projects/trecvid/
Felzenszwalb, P., Huttenlocher, D.: Efficiently computing a good segmentation. In: Proceedings of IEEE CVPR, pp. 98–104 (1998)
Souvannavong, F.: Indexation et recherche de plans video par contenu semantique. Ph.D. dissertation, Phd thesis of Eurecom Institute, France (2005)
Ma, W., Zhang, H.: Benchmarking of image features for content-based image retrieval. In: Thirtysecond Asilomar Conference on Signals, System and Computers, pp. 253–257 (1998)
Carson, C., Thomas, M., Belongie, S.: Blobworld: A system for region-based image indexing and retrieval. In: Third international conference on visual information systems (1999)
Souvannavong, D., Merialdo, B., Huet, B.: Multi modal classifier fusion for video shot content retrieval. In: Proceedings of WIAMIS (2005)
Allwein, E., Schapire, R., Singer, Y.: Reducing multiclass to binary: A unifying approach for margin classifiers. Journal of Machine Learning Research 1, 113–141 (2000)
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. In: Kernel-Induced Feature Spaces, Cambridge University Press, Cambridge (2000)
Kuncheva, L., Bezdek, J.C., Duin, R.: Decision templates for multiple classifier fusion: an experiemental comparaison. Pattern Recognition 34, 299–314 (2001)
Duin, R., Tax, D.: Experiements with classifier combining rules. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 16–29. Springer, Heidelberg (2000)
Rastrigin, L., Erenstein, R.: Method of collective recognition. Energoizdat (1982)
Jacobs, R., Jordan, M., Nowlan, S., Hinton, G.: Adaptive mixtures of local experts. Neural Computation 3, 1409–1431 (1991)
Xu, L., Krzyzak, A., Suen, C.: Methods of combining multiple classifiers and their application to hardwriting recognition. IEEE Trans. Sys. Man. Cyb. 22, 418–435 (1992)
Chou, K., Tu, L., Shyu, I.: Performances analysis of a multiple classifiers system for recognition of totally unconstrained handwritten numerals. In: 4th International Workshop on Frontiers of Handwritten Recognition, pp. 480–487 (1994)
Achermann, B., Bunke, H.: Combination of classifiers on the decision level for face recognition. Technical report of Bern University (1996)
Ho, T.: A theory of multiple classifier systems and its application to visual and word recognition. Ph.D. dissertation, Phd thesis of New York University (1992)
Cybenko, G.: Approximations by superposition of a sigmoidal function. Mathematics of Control, Signal and Systems 2, 303–314 (1989)
Freud, Y., Schapire, R.: Experiments with a new boosting algorithms. In: Machine Learning: Proceedings of the 13th International Conference (1996)
Skurichina, M., Duin, R.: Bagging for linear classifiers. Pattern Recognition 31(7), 909–930 (1998)
Cooper, M., Adcock, J., Chen, R., Zhou, H.: Fxpal at trecvid 2005. In: Proceedings of Trecvid (2005)
Chang, S.-F., Hsu, W., Kennedy, L., Xie, L., Yanagawa, A., Zavesky, E., Zhang, D.: Video seach and high level feature extraction. In: Proceedings of Trecvid (2005)
Amir, A., Argillander, J., Campbell, M., Haubold, A., Iyengar, G., Ebadollahi, S., Kang, F., Naphade, M., Natsev, A., Smith, J., Tesic, J., Volkmer, T.: Ibm research trecvid 2005 video retrieval system. In: Proceedings of Trecvid (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Benmokhtar, R., Huet, B. (2006). Performance Analysis of Multiple Classifier Fusion for Semantic Video Content Indexing and Retrieval. In: Cham, TJ., Cai, J., Dorai, C., Rajan, D., Chua, TS., Chia, LT. (eds) Advances in Multimedia Modeling. MMM 2007. Lecture Notes in Computer Science, vol 4351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69423-6_50
Download citation
DOI: https://doi.org/10.1007/978-3-540-69423-6_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69421-2
Online ISBN: 978-3-540-69423-6
eBook Packages: Computer ScienceComputer Science (R0)