A Novel Deep Density Model for Unsupervised Learning

Yang, Xi; Huang, Kaizhu; Zhang, Rui; Goulermas, John Y.

doi:10.1007/s12559-018-9566-9

A Novel Deep Density Model for Unsupervised Learning

Published: 25 June 2018

Volume 11, pages 778–788, (2019)
Cite this article

Cognitive Computation Aims and scope Submit manuscript

Xi Yang¹,
Kaizhu Huang¹,
Rui Zhang¹ &
…
John Y. Goulermas²

563 Accesses
Explore all metrics

Abstract

Density models are fundamental in machine learning and have received a widespread application in practical cognitive modeling tasks and learning problems. In this work, we introduce a novel deep density model, referred to as deep mixtures of factor analyzers with common loadings (DMCFA), with an efficient greedy layer-wise unsupervised learning algorithm. The model employs a mixture of factor analyzers sharing common component loadings in each layer. The common loadings can be considered to be a feature selection or reduction matrix which makes this new model more physically meaningful. Importantly, sharing common components is capable of reducing both the number of free parameters and computation complexity remarkably. Consequently, DMCFA makes inference and learning rely on a dramatically more succinct model and avoids sacrificing its flexibility in estimating the data density by utilizing Gaussian distributions as the priors. Our model is evaluated on five real datasets and compared to three other competitive models including mixtures of factor analyzers (MFA), MFA with common loadings (MCFA), deep mixtures of factor analyzers (DMFA), and their collapsed counterparts. The results demonstrate the superiority of the proposed model in the tasks of density estimation, clustering, and generation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Mixtures of Factor Analyzers with Common Loadings: A Novel Deep Generative Approach to Clustering

Introduction to Deep Density Models with Latent Variables

Variational inference and sparsity in high-dimensional deep Gaussian mixture models

Article Open access 01 September 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Notes

The greedy layer-wise algorithm is a generative model with many layers of hidden variables.
One component of the first layer can be divided into M_c sub-components. The size of the sub-components in each first-layer component need not be the same.
The superscript represents which layer these variables belong to. Since in the second layer the sub-components corresponding to a component of the first layer share a common loading and the variance of the independent noise, $\mathbf {A}_{c}^{(2)}$ and $\mathbf {{\Psi }}_{c}^{(2)}$ are marked with the subscript c. d corresponds to the subspace dimensionality in the second layer, where d < q.
http://yann.lecun.com/exdb/mnist/

References

Adams RP, Wallach HM, Ghahramani Z. Learning the structure of deep sparse graphical models. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics; 2010. p. 1–8.
Arnold L, Ollivier Y. Layer-wise learning of deep generative models. CoRR arXiv:1212.1524; 2012.
Baek J, McLachlan GJ. Mixtures of common t-factor analyzers for clustering high-dimensional microarray data. Bioinformatics 2011;27(9):1269–1276.
Article CAS Google Scholar
Baek J, McLachlan GJ, Flack LK. Mixtures of factor analyzers with common factor loadings: applications to the clustering and visualization of high-dimensional data. IEEE Trans Pattern Anal Mach Intell 2010;32(7):1298–1309.
Article Google Scholar
Bengio Y. Learning deep architectures for AI. Found Trends Mach Learn 2009;2(1):1–127.
Article Google Scholar
Chen B, Polatkan G, Sapiro G, Dunson DB, Carin L. The hierarchical beta process for convolutional factor analysis and deep learning. In: Proceedings of the 28th International conference on machine learning; 2011. p. 361–368.
Everett B. An introduction to latent variable models. Springer Science & Business Media; 2013.
Ghahramani Z. Probabilistic machine learning and artificial intelligence. Nature 2015;521(7553):452.
Article CAS Google Scholar
Ghahramani Z, Hinton G. The em algorithm for mixtures of factor analyzers. In: Technical Report CRG-TR-96-1. University of Toronto; 1996. p. 11–18. http://www.gatsby.ucl.ac.uk/.zoubin/papers.html.
Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput 2006; 18(7):1527–1554.
Article Google Scholar
Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science 2006; 313(5786):504– 507.
Article CAS Google Scholar
Jiang Z, Zheng Y, Tan H, Tang B, Zhou H. Variational deep embedding: an unsupervised and generative approach to clustering. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence; 2017. p. 1965–1972.
Johnson B. High resolution urban land cover classification using a competitive multi-scale object-based approach. Remote Sens Lett 2013;4(2):131–140.
Article Google Scholar
Johnson B, Xie Z. Classifying a high resolution image of an urban area using super-object information. ISPRS J Photogramm Remote Sens 2013;83:40–49.
Article Google Scholar
Kung SY, Mak MW, Lin SH. Biometric authentication: a machine learning approach, chap. Expectation-maximization theory. Upper Saddle River: Prentice Hall Professional Technical Reference; 2005.
Google Scholar
Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE 1998;86(11):2278–2324.
Article Google Scholar
Likas A, Vlassis N, Verbeek JJ. The global k-means clustering algorithm. Pattern Recogn 2003;36(2): 451–461.
Article Google Scholar
McLachlan G, Krishnan T. The EM algorithm and extensions. Wiley; 2007. vol. 382.
McLachlan GJ, Peel D. Mixtures of factor analyzers. In: International Conference on machine learning (ICML); 2000. p. 599–606.
Nene SA, Nayar SK, Murase H. 1996. Columbia object image library (coil-20). Tech. rep. Technical Report CUCS-005-96.
Patel AB, Nguyen T, Baraniuk RG. 2015. A probabilistic theory of deep learning. arXiv:1504.00641.
Rippel O, Adams RP. 2013. High-dimensional probability estimation with deep density models. CoRR arXiv:http://arXiv.org/1302.5125.
Salakhutdinov R, Mnih A, Hinton GE. Restricted boltzmann machines for collaborative filtering. In: Machine learning, proceedings of the twenty-fourth international conference (ICML); 2007. p. 791–798.
Tang Y, Salakhutdinov R, Hinton GE. Deep mixtures of factor analysers. In: Proceedings of the 29th international conference on machine learning. ICML; 2012.
Tortora C, McNicholas PD, Browne RP. A mixture of generalized hyperbolic factor analyzers. Adv Data Anal Classif 2016;10(4):423–440.
Article Google Scholar
Wang W. Mixtures of common factor analyzers for high-dimensional data with missing information. J Multivar Anal 2013;117:120–133.
Article Google Scholar
Wei H, Dong Z. V4 neural network model for shape-based feature extraction and object discrimination. Cogn Comput 2015;7(6):753–762.
Article Google Scholar
Wen G, Hou Z, Li H, Li D, Jiang L, Xun E. Ensemble of deep neural networks with probability-based fusion for facial expression recognition. Cogn Comput; 201. https://doi.org/10.1007/s12559-017-9472-6.
Article Google Scholar
Yang X, Huang K, Goulermas JY, Zhang R. Joint learning of unsupervised dimensionality reduction and gaussian mixture model. Neural Process Lett 2017;45(3):791–806.
Article Google Scholar
Yang X, Huang K, Zhang R. Deep mixtures of factor analyzers with common loadings: aa novel deep generative approach to clustering. In: Neural Information processing - 24rd international conference, ICONIP; 2017.
Zeng N, Wang Z, Zhang H, Liu W, Alsaadi FE. Deep belief networks for quantitative analysis of a gold immunochromatographic strip. Cogn Comput 2016;8(4):684–692.
Article Google Scholar
Zhang J, Ding S, Zhang N, Xue Y. Weight uncertainty in Boltzmann machine. Cogn Comput 2016; 8(6):1064–1073.
Article Google Scholar
Zheng Y, Cai Y, Zhong G, Chherawala Y, Shi Y, Dong J. Stretching deep architectures for text recognition. In: Document Analysis and recognition (ICDAR)–13th international conference. IEEE; 2015. p. 236–240.
Zhong G, Yan S, Huang K, Cai Y, Dong J. Reducing and stretching deep convolutional activation features for accurate image classification. Cogn Comput 2018;10(1):179–186.
Article Google Scholar

Download references

Funding

The work reported in this paper was partially supported by the following: National Natural Science Foundation of China (NSFC) under grant no. 61473236, Natural Science Fund for Colleges and Universities in Jiangsu Province under grant no. 17KJD520010, Suzhou Science and Technology Program under grant nos. SYG201712 and SZS201613, Jiangsu University Natural Science Research Programme under grant no. 17KJB520041, Key Program Special Fund in XJTLU (KSF − A − 01).

Author information

Authors and Affiliations

Xi’an Jiaotong-Liverpool University, SIP, Suzhou, 215123, China
Xi Yang, Kaizhu Huang & Rui Zhang
University of Liverpool, Liverpool, L69 3BX, UK
John Y. Goulermas

Authors

Xi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Kaizhu Huang
View author publications
You can also search for this author in PubMed Google Scholar
Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
John Y. Goulermas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kaizhu Huang.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Ethical Approval

This article does not contain any studies with human participants performed by any of the authors.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, X., Huang, K., Zhang, R. et al. A Novel Deep Density Model for Unsupervised Learning. Cogn Comput 11, 778–788 (2019). https://doi.org/10.1007/s12559-018-9566-9

Download citation

Received: 02 February 2018
Accepted: 22 May 2018
Published: 25 June 2018
Issue Date: December 2019
DOI: https://doi.org/10.1007/s12559-018-9566-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Novel Deep Density Model for Unsupervised Learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Mixtures of Factor Analyzers with Common Loadings: A Novel Deep Generative Approach to Clustering

Introduction to Deep Density Models with Latent Variables

Variational inference and sparsity in high-dimensional deep Gaussian mixture models

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Ethical Approval

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A Novel Deep Density Model for Unsupervised Learning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Mixtures of Factor Analyzers with Common Loadings: A Novel Deep Generative Approach to Clustering

Introduction to Deep Density Models with Latent Variables

Variational inference and sparsity in high-dimensional deep Gaussian mixture models

Explore related subjects

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Ethical Approval

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation