Abstract
Dimensionality reduction refers to reducing the number of attributes that are being considered, by producing a set of principal variables. It can be divided into feature selection and feature extraction. Dimensionality reduction serves as one of the preliminary challenges in storage management and is useful for effective transmission over the Internet. In this paper, we propose a deep learning approach using encoder–decoder networks for effective (almost-lossless) compression and encryption. The neural network essentially encrypts data into an encoded format which can only be decrypted using the corresponding decoders. Clustering is essential to reduce the variation in the dataset to ensure overfit. Using clustering resulted in a net gain of 1% over the standard encoder architecture over three MNIST datasets. The compression ratio achieved was 24.6:1. The usage of image datasets is for visualization only and the proposed pipeline could be applied for textual and visual data as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kambhatla, N., Leen, T.K.: Dimension reduction by local principal component analysis. Neural Comput 9(7), 1493–1516 (1997)
Sehgal, S., Singh, H., Agarwal, M., Bhasker, V. et al.: Data analysis using principal component analysis. In: 2014 International Conference on Medical Imaging, m-Health and Emerging Communication Systems (MedCom), p. 45 (2014)
Hu, C., Hou, X., Lu, Y.: Improving the architecture of an autoencoder for dimension reduction. In: 2014 IEEE 14th International Conference on Scalable Computing and Communications and its Associated Workshops, p. 855 (2014)
Zebang, S., Sei-ichiro, K.: Densely connected AutoEncoders for image compression. In: 2nd International Conference on Image and Graphics Processing, p. 78 (2019)
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A: Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th International Conference on Machine Learning, p. 1096 (2008)
Huang, S., Ward, M.O., Rundensteiner, E.A.: Exploration of dimensionality reduction for text visualization. In: Coordinated and Multiple Views in Exploratory Visualization (CMV’05), p. 63 (2005)
Moravec, P., Snasel, V.: Dimension reduction methods for image retrieval. In: Sixth International Conference on Intelligent Systems Design and Applications, vol. 2, p. 1055 (2006)
Kasun, L.L., Yang, Y., Huang, G.B., Zhang, Z.: Dimension reduction with extreme learning machine. IEEE Trans. Image Process. 25(8), 3906–3918 (2016)
Hartigan, J.A., Wong, M.A.: Algorithm AS 136: a K-Means clustering algorithm. J. R. Stat. Soc. Ser. C (Appl Stat) 28, 100 (1981). http://www.jstor.org/stable/2346830
Wang, W., Huang, Y., Wang, Y., Wang, L.: Generalized autoencoder: a neural network framework for dimensionality reduction. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, p. 490 (2014)
Tan, S., Mayrovouniotis, Michael L.: Reducing data dimensionality through optimizing neural network inputs. AIChE J. 41(6), 1471–1480 (1995)
Sakurada, M., Takehisa, Y.: Anomaly detection using autoencoders with nonlinear dimensionality reduction. In: Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis. ACM (2014)
Zhang, D., Zhou, Z.-H., Chen, S.: Semi-supervised dimensionality reduction. In: Proceedings of the 2007 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics (2007)
Bi, J., et al.: Dimensionality reduction via sparse support vector machines. J. Mach. Learn. Res. 3, 1229–1243 (2003)
Dash, M., Liu, H., Yao, J.: Dimensionality reduction of unsupervised data. In: Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence. IEEE (1997)
Acknowledgements
We would like to thank the providers of the publicly available datasets, which facilitated the comparisons reported in this paper. We would also like to thank the faculty of the Department of Information Technology, National Institute of Technology, Surathkal for insightful technical discussions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mukesh, B.R., Madhumitha, N., Aditya, N.P., Vivek, S., Anand Kumar M. (2021). Clustering Enhanced Encoder–Decoder Approach to Dimensionality Reduction and Encryption. In: Bhateja, V., Peng, SL., Satapathy, S.C., Zhang, YD. (eds) Evolution in Computational Intelligence. Advances in Intelligent Systems and Computing, vol 1176. Springer, Singapore. https://doi.org/10.1007/978-981-15-5788-0_73
Download citation
DOI: https://doi.org/10.1007/978-981-15-5788-0_73
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5787-3
Online ISBN: 978-981-15-5788-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)