Complete autoencoders for classification with missing values

Sánchez-Morales, Adrián; Sancho-Gómez, José-Luis; Figueiras-Vidal, Aníbal R.

doi:10.1007/s00521-020-05066-4

Complete autoencoders for classification with missing values

Original Article
Published: 20 June 2020

Volume 33, pages 1951–1957, (2021)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Adrián Sánchez-Morales¹,
José-Luis Sancho-Gómez¹ &
Aníbal R. Figueiras-Vidal²

467 Accesses
7 Citations
Explore all metrics

Abstract

It has been demonstrated that modified denoising stacking autoencoders (MSDAEs) serve to implement high-performance missing value imputation schemes. On the other hand, complete MSDAE (CMSDAE) classifiers, which extend their inputs with target estimates from an auxiliary classifier and are layer by layer trained to recover both the observation and the target estimates, offer classification results that are better than those provided by MSDAEs. As a consequence, investigating whether CMSDAEs can improve the MSDAEs imputation processes has an obvious practical importance. In this correspondence, two types of imputation mechanisms with CMSDAEs are considered. The first is a direct procedure in which the CMSDAE output is just the target. The second mechanism is suggested by the presence of the targets in the vectors to be autoencoded, and it uses the well-known multitask learning (MTL) ideas, including the observations as a secondary task. Experimental results show that these CMSDAE structures increase the quality of the missing value imputations, in particular the MTL versions. They give the best result in 5 out of 6 missing value problems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Missing Data Imputation via Denoising Autoencoders: The Untold Story

Classification of Incomplete Data Using Autoencoder and Evidential Reasoning

Missing Features Reconstruction and Its Impact on Classification Accuracy

References

LeCun Y (1987) Modeles connexionistes de l’apprentissage. Ph.D. thesis, Universite de Paris
Hinton GE, Zemel RS (1994) Autoencoders, minimum description length and Helmholtz free energy. In: Cowan JD, Tesauro G, Alspector J (eds) Advances in neural information processing systems, vol 6. Morgan Kaufmann, Burlington, pp 3–10
Google Scholar
Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408
MathSciNet MATH Google Scholar
Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35:3371–3408
Google Scholar
Tan CC, Eswaran C (2010) Reconstruction and recognition of face and digit images using autoencoders. Neural Comput Appl 19:1069–1079
Article Google Scholar
Hadjahmadi AH, Homayounpour MM (2019) Robust feature extraction and uncertainty estimation based on attractor dynamics in cyclic deep denoising autoencoder. Neural Comput Appl 31:7989–8002
Article Google Scholar
Alvear-Sandoval RF, Figueiras-Vidal AR (2018) On building ensembles of stacked denoising auto-encoding classifiers and their further improvement. Inf Fusion 39:41–52
Article Google Scholar
Alhassan Z, Budgen D, Alshammari R, Daghstani T, McGough AS, Moubayed NA (2018) Stacked denoising autoencoders for mortality risk prediction using imbalanced clinical data. In: Proceedings of the 17th IEEE international conference on machine learning and applications (ICMLA), Orlando, FL, pp 541–546
Sánchez-Morales A, Sancho-Gómez JL, Figueiras-Vidal AR (2019) Exploiting label information to improve auto-encoding based classifiers. Neurocomputing. https://doi.org/10.1016/j.neucom.2019.08.055
Article Google Scholar
Jia C, Shao M, Li S, Zhao H, Fu Y (2018) Stacked denoising tensor auto-encoder for action recognition with spatiotemporal corruptions. IEEE Trans Image Process 27:1878–1887
Article MathSciNet Google Scholar
Rubio JJ (2017) Stable Kalman filter and neural network for the chaotic systems identification. J Frankl Inst 354:7444–7462
Article MathSciNet Google Scholar
Sánchez-Morales A, Sancho-Gómez JL, Martínez-García JA, Figueiras-Vidal AR (2019) Improving deep learning performance with missing values via deletion and compensation. Neural Comput Appl. https://doi.org/10.1007/s00521-019-04013-2 (to appear)
Article Google Scholar
Gondara L, Wang K. Multiple imputation using deep denoising autoencoders. CoRR abs/1705.02737. arXiv:1705.02737
Caruana R (1997) Multitask learning. Mach Learn 28:41–75
Article Google Scholar
Wang C, Liao X, Carin L, Dunson D-B (2010) Classification with incomplete data using Dirichlet process priors. J Mach Learn Res 11:3269–3311
MathSciNet MATH Google Scholar
García-Laencina PJ, Sancho-Gómez J-L, Figueiras-Vidal AR (2013) Classifying patterns with missing values using multi-task learning perceptrons. Expert Syst Appl 40:1333–1341
Article Google Scholar
Raghunathan TW, Lepkowksi JM, Hoewyk JV, Solenbeger P (2001) A multivariate technique for multiply imputing missing values using a sequence of regression models. Surv Methodol 27:85–95
Google Scholar
Buuren SV (2007) Multiple imputation of discrete and continuous data by fully conditional specification. Stat Methods Med Res 16:219–242
Article MathSciNet Google Scholar
Dua D, Graff C (2017) UCI machine learning repository. University of California, School of Information and Computer Sciences, Irvine. http://archive.ics.uci.edu/ml
Sloan digital sky survey RD14. http://www.kaggle.com/lucidlenn/sloan-digital-sky-survey/
Rectangles data. http://www.iro.umontreal.ca/

Download references

Acknowledgements

This work has been partially supported by Network of Excellence MAPAS (TIN2017-90567-REDT, M\(^\circ \) Ciencia, Inn. y Univ.) and Grant 2-BARBAS (BBVA Foundation).

Author information

Authors and Affiliations

Departamento de Tecnologías de la Información y las Comunicaciones, Universidad Politécnica de Cartagena, Plaza del Hospital, 1. Edificio Cuartel de Antigones, 30202, Cartagena, Murcia, Spain
Adrián Sánchez-Morales & José-Luis Sancho-Gómez
Departamento de Teoría de la Señal y Comunicaciones, Universidad Carlos III de Madrid, Avda Universidad, 30., 28911, Leganés, Madrid, Spain
Aníbal R. Figueiras-Vidal

Authors

Adrián Sánchez-Morales
View author publications
You can also search for this author in PubMed Google Scholar
José-Luis Sancho-Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Aníbal R. Figueiras-Vidal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adrián Sánchez-Morales.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sánchez-Morales, A., Sancho-Gómez, JL. & Figueiras-Vidal, A.R. Complete autoencoders for classification with missing values. Neural Comput & Applic 33, 1951–1957 (2021). https://doi.org/10.1007/s00521-020-05066-4

Download citation

Received: 26 September 2019
Accepted: 03 June 2020
Published: 20 June 2020
Issue Date: March 2021
DOI: https://doi.org/10.1007/s00521-020-05066-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Complete autoencoders for classification with missing values

Abstract

Access this article

Similar content being viewed by others

Missing Data Imputation via Denoising Autoencoders: The Untold Story

Classification of Incomplete Data Using Autoencoder and Evidential Reasoning

Missing Features Reconstruction and Its Impact on Classification Accuracy

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Complete autoencoders for classification with missing values

Abstract

Access this article

Similar content being viewed by others

Missing Data Imputation via Denoising Autoencoders: The Untold Story

Classification of Incomplete Data Using Autoencoder and Evidential Reasoning

Missing Features Reconstruction and Its Impact on Classification Accuracy

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation