Decision-Level Fusion Method Based on Deep Learning

Wang, Kejun; Liu, Meichen; Hao, XueSen; Xing, Xianglei

doi:10.1007/978-3-319-69923-3_72

Kejun Wang²³,
Meichen Liu²³,
XueSen Hao²³ &
…
Xianglei Xing²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10568))

Included in the following conference series:

Chinese Conference on Biometric Recognition

3710 Accesses
1 Citations

Abstract

We present a highly accurate and very efficient approach for personality traits prediction based on video. Unlike the traditional method, we proposed a decision-level information fusion method based on deep learning. We have separated the video modal into two parts, visual modal and audio model. The two models were processed by improved VGG-16 and LSTM network, respectively, and combined with an Extreme Learning Machine (ELM) to architecture decision-level information fusion. Experiments on challenging Youtube-8M dataset show that our proposed approach significantly outperforms traditional decision-level fusion method in terms of both efficiency and accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sun, S.L., Deng, Z.L.: Multisensor information fusion with applications. Automatica 40(6), 1017–1023 (2004)
Article MathSciNet Google Scholar
Kapoor, A., Picard, R.W.: Multimodal affect recognition in learning environments. In: ACM International Conference on Multimedia, pp. 677–682. ACM (2005)
Google Scholar
Xing, X., Wang, K., Yan, T., Lv, Z.: Complete canonical correlation analysis with application to multi-view gait recognition. Pattern Recogn. 50, 107–117 (2016)
Article Google Scholar
Xing, X., Wang, K.: Couple manifold discriminant analysis with bipartite graph embedding for face recognition at a distance. Sig. Process. 125, 329–335 (2016)
Article Google Scholar
Piras, L., Giacinto, G.: Information fusion in content based image retrieval: a comprehensive overview. Inf. Fusion 37, 50–60 (2017)
Article Google Scholar
Xing, X., Wang, K., Lv, Z.: Fusion of gait and facial features using coupled projections for people identification at a distance. IEEE Sig. Process. Lett. 22(12), 2349–2353 (2015)
Article Google Scholar
Eid, M., Diener, E.: Intraindividual variability in affect: reliability, validity, and personality correlates. J. Pers. Soc. Psychol. 76(4), 662–676 (1999)
Article Google Scholar
Reeves, B., Nass, C.: The Media Equation: How People Treat Computers, Television, and New Media Like Real People and Places. Cambridge University Press, New York (1996)
Google Scholar
Lee, M.J., Choi, S.Y., Jeong, H.J., et al.: A precise image crawling system with image classification based on deep learning. Adv. Sci. Lett. 23(3), 1623–1626 (2017)
Article Google Scholar
Hu, G., Yang, Y., Yi, D., et al.: When face recognition meets with deep learning: an evaluation of convolutional neural networks for face recognition, pp. 384–392 (2015)
Google Scholar
Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation. In: Proceedings of IEEE International Conference of Computer Vision, pp. 1520–1528 (2015)
Google Scholar
Lin, X.F., Yacou, B., Sherif, B.J., et al.: Performance analysis of pattern classifier combination by plurality voting. Pattern Recogn. Lett. 24(12), 1959–1969 (2003)
Article Google Scholar
Fauvel, M., Chanussot, J., Benediktsson, J.A.: Decision fusion for the classification of urban remote sensing images. IEEE Trans. Geosci. Remote Sens. 44(10), 2828–2838 (2006)
Article Google Scholar
Pan, A., Zhou, J., Zhang, P., et al.: Research on power quality evaluation based on radar chart method and fuzzy membership degree. Energy Power Eng. 09(4), 725–734 (2017)
Article Google Scholar
Liang, L.K., Jing-Bo, L.I.: Discussion on the extraction and standardization of TCM symptom based on maximum probability method. China J. Tradit. Chin. Med. Pharm. 5, 060 (2017)
Google Scholar
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feed-ward neural networks. In: Proceedings of IEEE International Joint Conference on Neural Networks, vol. 2, pp. 985–990. IEEE (2005)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)
Google Scholar
Graves, A.: Long short-term memory. In: Graves, A. (ed.) Supervised Sequence Labelling with Recurrent Neural Networks. Springer, Heidelberg (2012). doi:10.1007/978-3-642-24797-2_4
Chapter Google Scholar
Greff, K., Srivastava, R.K., Koutnik, J., et al.: LSTM: a search space odyssey. IEEE Trans. Neural Netw. Learn. Syst. PP(99), 1–11 (2016)
Google Scholar
Ma, X., Hovy, E.: End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF (2016)
Google Scholar
Abu-El-Haija, S., Kothari, N., Lee, J., et al.: YouTube-8M: A Large-Scale Video Classification Benchmark (2016)
Google Scholar
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition, pp. 770–778 (2015)
Google Scholar
Naseem, I., Togneri, R., Bennamoun, M.: Linear regression for face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32(11), 2106–2112 (2010)
Article Google Scholar
Tabib Mahmoudi, F., Samadzadegan, F., Reinartz, P.: A decision level fusion method for object recognition using multi-angular imagery. Proc. SPIE XL1/W3(1), 62795F-62795F-6 (2013)
Google Scholar
Vedaldi, A., Lenc, K.: MatConvNet: Convolutional Neural Networks for MATLAB. Eprint Arxiv, pp. 689–692 (2014)
Google Scholar

Download references

Acknowledgments

This work was supported by the Fundamental Research Funds for the Central Universities of China, Natural Science Foundation of China, and Natural Science Fund of Heilongjiang Province of China under Grand Nos. HEUCFJ170404, 61573114, 61703119, F2015033 and QC2017070.

Author information

Authors and Affiliations

Harbin Engineering University, Harbin, China
Kejun Wang, Meichen Liu, XueSen Hao & Xianglei Xing

Authors

Kejun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Meichen Liu
View author publications
You can also search for this author in PubMed Google Scholar
XueSen Hao
View author publications
You can also search for this author in PubMed Google Scholar
Xianglei Xing
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xianglei Xing .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Jie Zhou
Beihang University, Beijing, China
Yunhong Wang
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Computing and Technology, Chinese Academy of Sciences, Beijing, China
Yong Xu
Shenzhen University, Shenzhen, China
Linlin Shen
Tsinghua University, Beijing, China
Jianjiang Feng
Chinese Academy of Sciences, Beijing, China
Shiguang Shan
Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Yu Qiao
Graduate School at Shenzhen, Tsinghua University, Shenzhen, China
Zhenhua Guo
Shenzhen University, Shenzhen, China
Shiqi Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, K., Liu, M., Hao, X., Xing, X. (2017). Decision-Level Fusion Method Based on Deep Learning. In: Zhou, J., et al. Biometric Recognition. CCBR 2017. Lecture Notes in Computer Science(), vol 10568. Springer, Cham. https://doi.org/10.1007/978-3-319-69923-3_72

Download citation

DOI: https://doi.org/10.1007/978-3-319-69923-3_72
Published: 20 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69922-6
Online ISBN: 978-3-319-69923-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics