Abstract
Matrix-variate Restricted Boltzmann Machine (MVRBM), a variant of Restricted Boltzmann Machine, has demonstrated excellent capacity of modelling matrix variable. However, MVRBM is still an unsupervised generative model, and is usually used to feature extraction or initialization of deep neural network. When MVRBM is used to classify, additional classifiers must be added. In order to make the MVRBM itself be supervised, in this paper, we propose improved MVRBMs for classification, which can be used to classify 2D data directly and accurately. To this end, on one hand, classification constraint is added to MVRBM to get Matrix-variate Restricted Boltzmann Machine Classification Model (ClassMVRBM). On the other hand, fisher discriminant analysis criterion for matrix-style variable is proposed and applied to the hidden variable, therefore, the extracted feature is more discriminative so as to enhance the classification performance of ClassMVRBM. We call the novel model Matrix-variate Restricted Boltzmann Machine Classification Model with Fisher discriminant analysis (ClassMVRBM-MVFDA). Experimental results on some publicly available databases demonstrate the superiority of the proposed models. Of which, the image classification accuracy of ClassMVRBM is higher than conventional unsupervised RBM, its variants and supervised Restricted Boltzmann Machine Classification Model (ClassRBM) for vector variable. Especially, the image classification accuracy of the proposed ClassMVRBM-MVFDA performs better than supervised ClassMVRBM and vectorial RBM-FDA.




Similar content being viewed by others
References
Chen, L., Jiang, D., Song, H., et al. (2018). A lightweight end-side user experience data collection system for quality evaluation of multimedia communications. IEEE Access, 2018(6), 15408–15419.
Sun, M., Jiang, D., Song, H., et al. (2017). Statistical resolution limit analysis of two closely-spaced signal sources using Rao test. IEEE Access, 5, 22013–22022.
Wang, H., & Wang, J. (2013). 2DPCA with L1-norm for simultaneously robust and sparse modelling. Neural Network, 46(10), 190–198.
Yang, J., Zhang, D., Frangi, A., & Yang, J. (2004). Two-dimensional PCA: A new approach to appearance-based face representation and recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(1), 131–137.
Ju, F., Sun, Y., Gao, J., Hu, Y., & Yin, B. (2015). Image outlier detection and feature extraction via L1-norm-based 2D probabilistic PCA. IEEE Transactions on Image Processing, 24(12), 4834–4846.
Li, M., & Yuan, B. (2005). 2D-LDA: A statistical linear discriminant analysis for image matrix. Pattern Recognition Letters, 26(5), 527–532.
Fischer, A., & Igel, C. (2012). An introduction to restricted Boltzmann machines. In Progress in pattern recognition, image analysis, computer vision, and applications (pp. 14–36). Springer.
Hinton, G. E., & Salakhutdinov, R. (2006). Reducing the dimensionality of data with neural networks. Science, 313(5786), 504–507.
Wang, J., Wang, W., Wang, R., & Gao, W. (2015). Image classification using RBM to encode local descriptors with group sparse learning. In Proceedings of international conference on image processing (pp. 912–916). Canada: IEEE
Vinod Nair, G. E. H. (2010). Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th international conference on machine learning (pp. 807–814). Israel.
Cho, K., Ilin, A., & Raiko, T. (2011) Improved learning of Gaussian-Bernoulli restricted Boltzmann machines. In Proceedings of the twenty-first international conference on artificial neural networks (pp. 10–17). Springer.
Nguyen, T., Tran, T., Phung, D., & Venkatesh, S. (2015). Tensor-variate restricted Boltzmann machines. In Proceedings of the twenty-ninth national conference on artificial intelligence (pp. 2887–2893). USA: AAAI.
Salakhutdinov, R., Mnih, A., & Hinton, G. (2007). Restricted Boltzmann machines for collaborative filtering. In Proceedings of the twenty-fourth international conference on machine learning (pp. 791–798). USA: ACM.
Dahl, G. E., Dong, Y., Li, D., & Acero, A. (2011). Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. IEEE Transactions on Audio, Speech and Language Processing, 20(1), 30–42.
Larochelle, H., Mandel, M., Pascanu, R., et al. (2012). Learning algorithms for the classification restricted Boltzmann machine. Journal of Machine Learning Research, 13(1), 643–669.
Peng, X., Gao, X., & Li, X. (2017). An infinite classification RBM model for radar HRRP recognition. In International joint conference on neural networks (pp. 1442–1448). USA: IEEE.
Qi, G., Sun, Y., Gao, J., Hu, Y., & Li, J. (2016). Matrix variate restricted boltzmann machine. In The proceeding of 2016 international joint conference on neural networks (pp. 389–395). Canada: IEEE.
Liu, S., Sun, Y., Hu, Y., Gao, J., Ju, F., & Yin, B. (2017). Matrix variate RBM model with gaussian distributions. In The proceeding of 2017 international joint conference on neural networks (pp. 808–815). USA: IEEE.
Jiang, D., Wang, W., Shi, L., et al. (2018). A compressive sensing-based approach to end-to-end network traffic reconstruction. IEEE Transactions on Network Science and Engineering, 5(3), 1–12.
Wang, F., Jiang, D., Wen, H., et al. (2019). Adaboost-based security level classification of mobile intelligent terminals. The Journal of Supercomputing 1–19.
Xie, G. S., Zhang, X. Y., Zhang, Y. M., et al. (2014). Integrating supervised subspace criteria with restricted Boltzmann machine for feature extraction. In International joint conference on neural networks (pp. 1622–1629). IEEE.
Jiang, D., Wang, Y., Lv, Z., et al. (2019). Big data analysis-based network behavior insight of cellular networks for industry 4.0 applications. IEEE Transactions on Industrial Informatics., 10, 15–20. https://doi.org/10.1109/tii.2019.2930226.
Jiang, D., Huo, L., Lv, Z., et al. (2018). A joint multi-criteria utility-based network selection approach for vehicle-to-infrastructure networking. IEEE Transactions on Intelligent Transportation Systems, 19(10), 3305–3319.
Jiang, D., Huo, L., & Song, H. (2018). Rethinking behaviors and activities of base stations in mobile cellular networks based on big data analysis. IEEE Transactions on Network Science and Engineering, 1(1), 1–12.
Jiang, D., Zhang, P., Lv, Z., et al. (2016). Energy-efficient multi-constraint routing algorithm with load balancing for smart city applications. IEEE Internet of Things Journal, 3(6), 1437–1447.
Zhu, J., Song, Y., Jiang, D., et al. (2018). A new deep-Q-learning-based transmission scheduling mechanism for the cognitive Internet of things. IEEE Internet of Things Journal, 5(4), 2375–2385.
Gao, J., Guo, Y., & Wang, Z. (2017). Matrix neural networks. In International symposium on neural networks (pp. 313–320). Springer.
Wong, W. K., & Sun, M. (2011). Deep learning regularized fisher mappings. IEEE Transactions on Neural Networks, 22(10), 1668–1675.
Yang, J., & Yang, J. Y. (2002). From Image vector to matrix: A straightforward image projection technique—IMPCA vs PCA. Pattern Recognition, 35(9), 1997–1999.
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. In Proceedings of the IEEE (Vol. 86, No. (11), pp. 2278–2324). IEEE.
Wang, Y., & Mori, G. (2009). Human action recognition by semilatent topic models. IEEE Transactions on Pattern Analysis & Machine Intelligence, 31(10), 1762.
Leibe, B., & Schiele, B. (2003). Analyzing appearance and contour based methods for object categorization. In Proceedings of IEEE computer society conference on computer vision and pattern recognition (pp. 1–7). USA: IEEE.
Nene, S., Nayar, S., & Murase, H. (1996). Columbia object image library (COIL-20). Technical report CUCS-005-96, USA.
Hinton, G. E. (2012). A practical guide to training restricted Boltzmann machine. LNCS, 7700, 599–619.
Tenenbaum, J., Silva, V., & Langford, J. (2000). A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500), 2319–2323.
Acknowledgements
This research is supported by the Natural Science Foundation of China (Nos. 61402024, 61772049, 61632006, 61876012), and Scientific Research Common Program of Beijing Municipal Commission of Education (Nos. KM201710005022, KM201510005024) and Beijing Key Laboratory of Computational Intelligence and Intelligent System.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, J., Tian, P., Kong, D. et al. Discriminative matrix-variate restricted Boltzmann machine classification model. Wireless Netw 27, 3621–3633 (2021). https://doi.org/10.1007/s11276-019-02234-w
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11276-019-02234-w