Abstract
Hearing loss, a partial or total inability to hear, is known as hearing impairment. Untreated hearing loss can have a bad effect on normal social communication, and it can cause psychological problems in patients. Therefore, we design a three-category classification system to detect the specific category of hearing loss, which is beneficial to be treated in time for patients. Before the training and test stages, we use the technology of data augmentation to produce a balanced dataset. Then we use deep autoencoder neural network to classify the magnetic resonance brain images. In the stage of deep autoencoder, we use stacked sparse autoencoder to generate visual features, and softmax layer to classify the different brain images into three categories of hearing loss. Our method can obtain good experimental results. The overall accuracy of our method is 99.5%, and the time consuming is 0.078 s per brain image. Our proposed method based on stacked sparse autoencoder works well in classification of hearing loss images. The overall accuracy of our method is 4% higher than the best of state-of-the-art approaches.
Similar content being viewed by others
Change history
03 January 2018
The original version of this article unfortunately contained a mistake. The reference #27 in the reference list is incorrect in that the individual chapter should be cited instead of the whole book.
References
Patel, N.S., Hunter, J.B., O'Connell, B.P., et al., Risk of progressive hearing loss in untreated superior semicircular canal dehiscence. Laryngoscope. 127(5):1181–1186, 2017.
Kurabi, A., Keithley, E.M., Housley, G.D., et al., Cellular mechanisms of noise-induced hearing loss. Hear. Res. 349:129–137, 2017.
Neuhaus, C., Lang-Roth, R., Zimmermann, U., et al., Extension of the clinical and molecular phenotype of DIAPH1-associated autosomal dominant hearing loss (DFNA1). Clin. Genet. 91(6):892–901, 2017.
Etminan, M., Westerberg, B.D., Kozak, F.K., et al., Risk of sensorineural hearing loss with macrolide antibiotics: A nested case-control study. Laryngoscope. 127(1):229–232, 2017.
Brecht, E.J., Barsz, K., Gross, B., et al., Increasing GABA reverses age-related alterations in excitatory receptive fields and intensity coding of auditory midbrain neurons in aged mice. Neurobiol. Aging. 56:87–99, 2017.
Shao, H., Jiang, H., Zhao, H., et al., A novel deep autoencoder feature learning method for rotating machinery fault diagnosis. Mech. Syst.Signal Process. 95:187–204, 2017.
Li, J., Detection of left-sided and right-sided hearing loss via fractional fourier transform. Entropy 18(5):Article ID: 194, 2016.
Nayak, D.R., Detection of unilateral hearing loss by stationary wavelet entropy. CNS Neurol. Disord. - Drug Targets. 16(2):122–128, 2017.
Chen, Y., and Chen, X.-Q, Sensorineural hearing loss detection via discrete wavelet transform and principal component analysis combined with generalized eigenvalue proximal support vector machine and Tikhonov regularization. Multimed. Tools Appl. 2016 doi:https://doi.org/10.1007/s11042-016-4087-6
Li, J., Texture analysis method based on fractional Fourier entropy and fitness-scaling adaptive genetic algorithm for detecting left-sided and right-sided sensorineural hearing loss. Fundam. Inf. 151(1–4):505–521, 2017.
Lu, H. Hearing loss detection in medical multimedia data by discrete wavelet packet entropy and single-hidden layer neural network trained by adaptive learning-rate back propagation. In 14th International Symposium on Neural Networks (ISNN). Sapporo, Japan: Springer. pp 541–549, 2017.
Gao, X., Sun, Q., and Xu, H., Multiple-rank supervised canonical correlation analysis for feature extraction, fusion and recognition. Expert Syst. Appl. 84:171–185, 2017.
Kovrlija, R., and Rondeau-Mouro, C., Multi-scale NMR and MRI approaches to characterize starchy products. Food Chem. 236:2–14, 2017.
Cortés, J.C., Navarro-Quiles, A., Romero, J.V., et al., Randomizing the parameters of a Markov chain to model the stroke disease: A technical generalization of established computational methodologies towards improving real applications. J. Comput. Appl. Math. 324:225–240, 2017.
Fan, Z., Bi, D., He, L., et al., Low-level structure feature extraction for image processing via stacked sparse denoising autoencoder. Neurocomputing. 243:12–20, 2017.
Andrei, N., Accelerated adaptive Perry conjugate gradient algorithms based on the self-scaling memoryless BFGS update. J. Comput. Appl. Math. 325:149–164, 2017.
Le, M. H., Chen, J., Wang, L., et al., Automated diagnosis of prostate cancer in multi-parametric MRI based on multimodal convolutional neural networks. Phys. Med. Biol. 62(16):6497–6514, 2017.
March, W.B., and Biros, G., Far-field compression for fast kernel summation methods in high dimensions. Appl. Comput. Harmon. Anal. 43(1):39–75, 2017.
Choi, H., Cho, K., and Bengio, Y., Context-dependent word representation for neural machine translation. Comput. Speech Lang. 45:149–160, 2017.
Sankaran, A., Vatsa, M., Singh, R., et al., Group sparse autoencoder. Image Vis. Comput. 60:64–74, 2017.
Ghosh, A.K., and Chakraborty, A., Use of EM algorithm for data reduction under sparsity assumption. Comput. Stat. 32(2):387–407, 2016.
Liu, L., Cheng, D., Tian, F., et al., Active contour driven by multi-scale local binary fitting and Kullback-Leibler divergence for image segmentation. Multimed. Tools Appl. 76(7):10149–10168, 2016.
Lin, B., Wang, Q., Zhang, J., et al., Stable prediction in high-dimensional linear models. Stat. Comput. 27(5):1401–1412, 2016.
Wang, Y.B., You, Z.H., Li, X., et al., Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network. Mol. Biosyst. 13(7):1336–1344, 2017.
Hong, C., Yu, J., Jane, Y., et al., Three-dimensional image-based human pose recovery with hypergraph regularized autoencoders. Multimed. Tools Appl. 76(8):10919–10937, 2016.
Zeng, S., Gou, J., and Deng, L., An antinoise sparse representation method for robust face recognition via joint l 1 and l 2 regularization. Expert Syst. Appl. 82:1–9, 2017.
Iliadis, L., and Maglogiannis I. (Eds.), Scaled conjugate gradient based adaptive ANN control for SVM-DTC inductionmotor drive. Artif. Intell. Appl. Innov. 384–395, 2016. https://doi.org/10.1007/978-3-319-44944-933.
Gholami, A., Honarvar, F., and Moghaddam, H.A., Modeling the ultrasonic testing echoes by a combination of particle swarm optimization and Levenberg–Marquardt algorithms. Meas. Sci. Technol. 28(6):065001, 2017.
Zhang, N., Ding, S., and Zhang, J., Multi layer ELM-RBF for multi-label learning. Appl. Soft Comput. 43:535–545, 2016.
Wu, L.N., Improved image filter based on SPCNN. Sci. China Ser. F-Inf. Sci. 51(12):2115–2125, 2008.
Wu, L., Optimal multi-level thresholding based on maximum Tsallis entropy via an artificial bee Colony approach. Entropy. 13(4):841–859, 2011.
Acknowledgements
This paper is supported by Natural Science Foundation of China (61602250), Natural Science Foundation of Jiangsu Province (BK20150983, BK20150982).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Ethical Approve
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional review board and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Conflict of Interest
There is no conflict of interest with regards to the submission of this paper.
Additional information
This article is part of the Topical Collection on Image & Signal Processing.
A correction to this article is available online at https://doi.org/10.1007/s10916-017-0884-3.
Rights and permissions
About this article
Cite this article
Jia, W., Yang, M. & Wang, SH. Three-Category Classification of Magnetic Resonance Hearing Loss Images Based on Deep Autoencoder. J Med Syst 41, 165 (2017). https://doi.org/10.1007/s10916-017-0814-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-017-0814-4