Abstract
The major issue in pattern classification is in the extraction of features in the training phase. The focus of this work is on combining the ability of wavelet networks and the deep learning techniques to propose a new supervised feature extraction method to pattern classification. This new approach allows the classification of all classes of the dataset by the reconstruction of a Deep Stacked wavelet Auto-Encoder. This Network is obtained after a series of wavelet Auto-Encoders followed by a Softmax classifier at the last layer. Finally, a fine-tuning is applied for the improvement of our result using a back propagation algorithm. Our approach is tested with different image datasets which are the COIL-100, the APTI and the ImageNet datasets and is also tested with two other audio corpuses that contain Arabic words and French words. The experimental test demonstrates the efficiency of our network for image and audio classification compared to other methods.
Similar content being viewed by others
References
Amin A, Al-Sadoun H, Fischer S (1996) Hand-printed arabic character recognition system using an artificial network. Pattern Recogn 29(4):663–675. doi:10.1016/0031-3203(95)00110-7. URL http://www.sciencedirect.com/science/article/pii/0031320395001107
Ben Amar C, Zaied M, Alim A (2005) Beta wavelets. Synthesis and application tolossy image compression In: Adv Eng Softw 36:459–474. doi:10.1016/j.advengsoft.2005.01.013
Bengio Y (2009), "Learning Deep Architectures for AI", Foundations and Trends® in Machine Learning: 2(1), 1–127. doi:10.1561/2200000006
Chen Z, Wang J, He H, Huang X (2014) A Fast Deep Learning System Using GPU (1), 1552–1555
Dammak M, Mejdoub M, Zaied M, C. Ben Amar (2012) Feature vector approximation based on wavelet network, in: Proceedings of the 4th International Conference on Agents and Artificial Intelligence (ICAART), 394–399.
Daubechies I (1992) Ten lectures on wavelets, number 61 in cbms-nsf series in applied mathematics
Deng L, Yu D (2014) DEEP LEARNING: methods and applications. Found Trends SignalProcess 7(3–4):1–134
Ejbali R, Benayed Y, Zaied M, Alimi MA (2009) Wavelet networks for phonemes recognition, International conference on systems and information processing
Ejbali R, Zaied M, Amar CB (2010) Intelligent approach to train wavelet networks forRecognition system of Arabic words. KDIR, In, pp 518–522
Ejbali R, Zaied M, C. Ben Amar (2012) Multi-input multi-output beta wavelet network: modeling of acoustic units for speech recognition Int. J. Adv. Comput. Sci. Appl. IJACSA12, Sci. Inf. Organ. (SAI), 3
El Adel A, Zaied M, Amar CB (2011) Learning wavelet networks based on multiresolution analysis: application to images copy detection. International Conference on Communications, Computing and Control Applications, pp 1–6
Eladel A, Ejbali R, Zaied M, C. Ben Amar (2014a) A New Semantic Approach for CBIRBased on BetaWavelet Network Modeling Shape Refined by Texture and Color Features pp. 378–385
Eladel A, Zaied M, C. Ben Amar (2014b) A new system for image retrieval using beta wavelet network for descriptors extraction and fuzzy decision. 232–236
Guedri B, Zaied M, Amar CB (2011) Indexing and images retrieval by content, International Conference on High Performance Computing & Simulation, pp: 369–375
Hassairi S, Ejbali R, Zaied M (2015a) A Deep Convolutional Neural Wavelet Network to supervised Arabic letter image classification. In: 15th Int. Conf. Intell. Syst. Des. Appl.
Hassairi S, Ejbali R, Zaied M (2015b) Supervised Image Classification Using DeepConvolutional Wavelets Network. In: 2015 I.E. 27th Int. Conf. Tools withArtif. Intell., 265–271. IEEE. doi:10.1109/ICTAI.2015.49
He K, Zhang X, Renand S, Sun J (2015) "Deep residual learning for image recognition." arXiv preprint arXiv:1512.03385
Hinton GE, Osindero S, Teh YW (2006) A fastlearning algorithm for deep belief nets. Neural Comput 18(7):1527–1554. doi:10.1162/neco.2006.18.7.1527
Jemai O, Zaied M, C. Ben Amar and Alimi MA (2010) FBWN: an architecture of fast betawavelet networks for image classification. In: Proc Int Jt Conf Neural Networks. doi:10.1109/IJCNN.2010.5596876
Jemai O, Zaied M, Ben Amar C, Alimi MA (2011a) Fast learning algorithm of WaveletNetwork based on fast wavelet transform. Int J Pattern Recognit Artif Intell 25(08):1297–1319. doi:10.1142/S0218001411009111
O. Jemai, M. Zaied, C. Ben Amar and A. Alimi (2011b): Pyramidal hybrid approach: wavelet network with OLS algorithm-based image classification, Int. J. Wavelets, Multiresolution Inf. Process 9 111–130.
Jemai O, Ejbali R, Zaied M, Amar CB (2015) A speech recognition system based on hybrid wavelet network including a fuzzy decision support system, International Conference on Machine Vision (ICMV 2014), 944503–944503-7
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst pp:1–9
Le QV (2013) Building high-level features using large scale unsupervised learning. 2013 I.E. Int. Conf. Acoust. Speech Signal Process. pp. 8595–8598. doi:10.1109/ICASSP.2013.6639343. URL http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6639343
LeCun Y (2012) Learning invariant feature hierarchies. In: Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 7583 LNCS, 496–505
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Y. Lin, F. Lv, S. Zhu, M. Yang, T. Cour, K. Yu, L. Cao and T. Huang (2011) Large-scaleimage classification: Fast feature extraction and svm training. In: Computer Visionand Pattern Recognition (CVPR), 2011 I.E. Conference on, 1689–696. doi:10.1109/CVPR.2011.5995477
Liou CY, Cheng WC, Liou JW, Liou DR (2014) Autoencoder for words. Neurocomputing 139:84–96. doi:10.1016/j.neucom.2013.09.055
Mejdoub M, Ben Amar C (2013) Classification improvement of local feature vectors over the KNN algorithm Multimed. Tools Appl 64(1):197–218
Mejdoub M, Fonteles L, C. Ben Amar, Antonini M (2008) Fast indexing method for image retrieval using tree-structured lattices, in: International Workshop on Content-Based Multimedia Indexing (CBMI), 2008, Conference Proceedings, art. no. 4564970, pp. 365–372.
Nene S, Nayar S, Murase H (1996) Columbia Object Image Library (COIL-100). Tech.rep. (1996). URL http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.54.5914
X. Peng, R. Yan, B. Zhao, H. Tang and Z. Yi (2015) Fast low rank representation based spatialpyramid matching for image classification. Knowledge-Based Systems 90, 14–22
Picone J (1993) Signal modeling techniques in speech recognition. Proc.IEEE 81(9), 1215–1247. doi:10.1109/5.237532. URLhttp://ieeexplore.ieee.org/articleDetails.jsp?arnumber=237532
Rastegari M, Ordonez V, Redmon J, Farhadi A (2016) XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks. arXiv preprint arXiv:1603.05279
Said S, B. Ben Amor, Zaied M, C. Ben Amar, Daoudi M (2009) Fast and efficient 3D face recognition using wavelet networks, in: The 16th IEEE International Conference on Image Processing, Cairo, Egypt, November 4153–4156
Sermanet P, Chintala S, LeCun Y (2012) Convolutional neural networks applied to housenumbers digit classification. Proc. Int. Conf. Pattern Recognit. ICPR12 pp. 10–13
Shin HC, Orton MR, Collins DJ, Doran SJ, Leach MO (2013) Stacked autoencodersfor unsupervised feature learning and multiple organ detection in a pilot study using4D patient data. IEEE Trans. Pattern Anal. Mach. Intell 35(8):1930–1943. doi:10.1109/TPAMI.2012.277
Slimane F, Kanoun S, Hennebert J, Alimi AM, Ingold R (2013) A study on font-family and font-size recognition applied to Arabic word images at ultra-low resolution. Pattern Recognit. Lett 34(2):209–218. doi:10.1016/j.patrec.2012.09.012
Smagghe P, Buessler JL, Urban JP (2013) Novelty detection in image recognition usingirf neural networks properties. ESANN, In
Tsai Cw, Huang Wc, Chiang Mc (2014) Mobile, Ubiquitous, and Intelligent Computing. Lect Notes Electr Eng 274:629–636. doi:10.1007/978-3-642-40675-1
P. Vincent: Stacked DenoisingAutoencoders: learning useful representations in aDeep network with a local denoising criterion 11, 3371–3408 (2010)
Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA (2010) Stacked DenoisingAutoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: Fromerror visibility to structural similarity. IEEE Trans Image Process 13(4):600–612. doi:10.1109/TIP.2003.819861
Witkowski B (2015) Autoencoders for Image Classification.Jagiellonian University Faculty of Mathematics and Computer Science INSTITUTE OF COMPUTER SCIENCEApr 28
Zaied M, C. Ben Amar, Alimi MA (2003) Award a new wavelet based beta function, International conference on signal, system and design, 185–191
M. Zaied, S. Said, O. Jemai and C. Ben Amar (2011) a Novel Approach for Face RecognitionBased on Fast Learning Algorithm and Wavelet Network Theory. Int. J. Wavelets,Multiresolution Inf. Process 09(06), 923–945. doi:10.1142/S0219691311004389
Zaied M, Mohamed R, C. ben Amar (2012) A power tool for content-based image retrieval using multiresolution wavelet network modeling and dynamic histograms. Int. REv.Comput. Softw. (IRECOS), 7
Zeiler MD, Fergus R (2013) Visualizing and Understanding Convolutional Networks. arXivPrepr. arXiv1311.2901 (2013). URL http://arxiv.org/abs/1311.2901
Zhang Q, Benveniste A (1992) Wavelet networks. IEEE Trans Neural Networks 3(6):889–898. doi:10.1109/72.165591
Acknowledgements
The authors would like to acknowledge the financial support of this work by grants from General Direction of Scientific Research (DGRST), Tunisia, under the ARUBprogram.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hassairi, S., Ejbali, R. & Zaied, M. A deep stacked wavelet auto-encoders to supervised feature extraction to pattern classification. Multimed Tools Appl 77, 5443–5459 (2018). https://doi.org/10.1007/s11042-017-4461-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-4461-z