Discriminative Deep Belief Network for Indoor Environment Classification Using Global Visual Features

Zrira, Nabila; Khan, Haris Ahmad; Bouyakhf, El Houssine

doi:10.1007/s12559-017-9534-9

Discriminative Deep Belief Network for Indoor Environment Classification Using Global Visual Features

Published: 01 January 2018

Volume 10, pages 437–453, (2018)
Cite this article

Cognitive Computation Aims and scope Submit manuscript

Nabila Zrira¹,
Haris Ahmad Khan^2,3 &
El Houssine Bouyakhf¹

497 Accesses
10 Citations
1 Altmetric
Explore all metrics

Abstract

Indoor environment classification, also known as indoor environment recognition, is a highly appreciated perceptual ability in mobile robots. In this paper, we present a novel approach which is centered on biologically inspired methods for recognition and representation of indoor environments. First, global visual features are extracted by using the GIST descriptor, and then we use the subsequent features for training the discriminative deep belief network (DDBN) classifier. DDBN employs a new deep architecture which is based on restricted Boltzmann machines (RBMs) and the joint density model. The back-propagation technique is used over the entire classifier to fine-tune the weights for an optimum classification. The acquired experimental results validate our approach as it performs well both in the real-world and in synthetic datasets and outperforms the Convolution Neural Networks (ConvNets) in terms of computational efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Activity Recognition from Multi-modal Sensor Data Using a Deep Convolutional Neural Network

Restricted Boltzmann Machine with Adaptive Local Hidden Units

3D Object Classification Using Deep Belief Networks

Notes

References

Aboudib A, Gripon V, Coppin G. A biologically inspired framework for visual information processing and an application on modeling bottom-up visual attention. Cogn Comput. 2016;8:1–20.
Article Google Scholar
Ackley DH, Hinton GE, Sejnowski TJ. A learning algorithm for boltzmann machines*. Cogn Sci. 1985; 9(1):147–169.
Article Google Scholar
Alexandre LA. 3d object recognition using convolutional neural networks with transfer learning between input channels. Intelligent Autonomous Systems 13. Springer; 2016. p. 889–898.
Bengio Y. Learning deep architectures for ai. Found TrendsⓇ Mach Learn. 2009;2(1):1–127.
Article Google Scholar
Biederman I. 1981. On the semantics of a glance at a scene.
Carpenter GA, Ross WD. Art-emap: A neural network architecture for object recognition by evidence accumulation. IEEE Trans Neural Netw. 1995;6(4):805–818.
Article PubMed CAS Google Scholar
Carreira-Perpinan MA, Hinton GE. On contrastive divergence learning. Proceedings of the tenth international workshop on artificial intelligence and statistics. Citeseer; 2005. p. 33–40.
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L. Imagenet: A large-scale hierarchical image database. 2009. CVPR 2009. IEEE Conference on Computer Vision and Pattern Recognition. IEEE; 2009. p. 248–255.
Deng L, Yu D. Deep learning: methods and applications. Found Trends Signal Process. 2014;7(3–4):197–387.
Article Google Scholar
Eitel A, Springenberg JT, Spinello L, Riedmiller M, Burgard W. Multimodal deep learning for robust rgb-d object recognition. 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE; 2015. p. 681–687.
Girshick R, Donahue J, Darrell T, Malik J. Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans Pattern Anal Mach Intell. 2016;38(1):142–158.
Article PubMed Google Scholar
Gupta S, Girshick R, Arbeláez P, Malik J. Learning rich features from rgb-d images for object detection and segmentation. European Conference on Computer Vision. Springer; 2014. p. 345–360.
Harding P, Robertson NM. Visual saliency from image features with application to compression. Cogn Comput. 2013;5(1):76–98.
Article Google Scholar
Hinton GE. A practical guide to training restricted boltzmann machines. Neural Networks: Tricks of the Trade. Springer; 2012. p. 599–619.
Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput. 2006; 18(7):1527–1554.
Article PubMed Google Scholar
Itti L, Koch C, Niebur E, et al. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 1998;20(11):1254–1259.
Article Google Scholar
Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick R, Guadarrama S, Darrell T. Caffe: Convolutional architecture for fast feature embedding. Proceedings of the ACM International Conference on Multimedia. ACM; 2014. p. 675–678.
Jiann-Der L. Object recognition using a neural network with optimal feature extraction. Math Comput Modell. 1997;25(12):105–117.
Article Google Scholar
Keyvanrad MA, Homayounpour MM. Deep belief network training improvement using elite samples minimizing free energy. (2014) arXiv:http://arXiv.org/abs/1411.4046.
Kootstra G, de Boer B, Schomaker LR. Predicting eye fixations on complex visual stimuli using local symmetry. Cogn Comput. 2011;3(1):223–240.
Article Google Scholar
Larochelle H, Bengio Y. Classification using discriminative restricted boltzmann machines. Proceedings of the 25th international conference on Machine learning. ACM; 2008. p. 536–543.
Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE; 2006. p. 2169–2178.
Le Meur O, Le Callet P, Barba D, Thoreau D. A coherent computational approach to model the bottom-up visual attention. IEEE Trans Pattern Anal Mach Intell 2006;28:802–817.
Article PubMed Google Scholar
Liang J, Yuen SY. A novel saliency prediction method based on fast radial symmetry transform and its generalization. Cogn Comput. 2016;8:1–10.
Article CAS Google Scholar
Liu Y, Zhou S, Chen Q. Discriminative deep belief networks for visual data classification. Pattern Recogn. 2011;44(10):2287–2296.
Article Google Scholar
Mahadevan V, Vasconcelos N. Saliency-based discriminant tracking. 2009. CVPR 2009. IEEE Conference on Computer Vision and Pattern Recognition. IEEE; 2009. p. 1007–1013.
Marat S, Phuoc TH, Granjon L, Guyader N, Pellerin D, Guérin-Dugué A. Modelling spatio-temporal saliency to predict gaze direction for short videos. Int J Comput Vis 2009;82(3):231–243.
Article Google Scholar
Marat S, Rahman A, Pellerin D, Guyader N, Houzet D. Improving visual saliency by adding ‘face feature map’ and ‘center bias’. Cogn Comput. 2013;5(1):63–75.
Article Google Scholar
Mishra AK, Aloimonos Y. Active segmentation. Int J Humanoid Robot 2009;6(03):361–386.
Article Google Scholar
Mohamed Ar, Dahl G, Hinton G. Deep belief networks for phone recognition. Nips workshop on deep learning for speech recognition and related applications; 2009. p. 39.
Oliva A, Torralba A. Building the gist of a scene: The role of global image features in recognition. Progress Brain Res 2006;155:23–36.
Article Google Scholar
Ouadiay FZ, Zrira N, Bouyakhf EH, Himmi MM. 3d object categorization and recognition based on deep belief networks and point clouds. Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics; 2016. p. 311–318.
Pandey M, Lazebnik S. Scene recognition and weakly supervised object localization with deformable part-based models. 2011 IEEE International Conference on Computer Vision (ICCV). IEEE; 2011. p. 1307–1314.
Quattoni A, Torralba A. Recognizing indoor scenes. 2009. CVPR 2009. IEEE Conference on Computer Vision and Pattern Recognition. IEEE; 2009. p. 413–420.
Ripley BD. Pattern recognition and neural networks. Cambridge: Cambridge University Press; 2007.
Google Scholar
Serre T, Kreiman G, Kouh M, Cadieu C, Knoblich U, Poggio T. A quantitative theory of immediate visual recognition. Progress Brain Res 2007;165:33–56.
Article Google Scholar
Siagian C, Itti L. Rapid biologically-inspired scene classification using features shared with visual attention. IEEE Trans Pattern Anal Mach Intell 2007;29(2):300–312.
Article PubMed Google Scholar
Siagian C, Itti L. Rapid biologically-inspired scene classification using features shared with visual attention. IEEE Trans Pattern Anal Mach Intell 2007;29(2):300–312.
Article PubMed Google Scholar
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. (2014) arXiv:http://arXiv.org/abs/1409.1556.
Socher R, Huval B, Bath B, Manning CD, Ng AY. Convolutional-recursive deep learning for 3d object classification. Advances in Neural Information Processing Systems; 2012. p. 665–673.
Szummer M, Picard RW. Indoor-outdoor image classification. 1998. Proceedings., 1998 IEEE International Workshop on Content-Based Access of Image and Video Database. IEEE; 1998. p. 42–51.
Tieleman T. Training restricted boltzmann machines using approximations to the likelihood gradient. Proceedings of the 25th international conference on Machine learning. ACM; 2008. p. 1064–1071.
Tu Z, Abel A, Zhang L, Luo B, Hussain A. A new spatio-temporal saliency-based video object segmentation. Cogn Comput. 2016;8:1–19.
Tünnermann J, Mertsching B. Region-based artificial visual attention in space and time. Cogn Comput 2014;6 (1):125–143.
Article Google Scholar
Ulrich I, Nourbakhsh I. Appearance-based place recognition for topological localization. 2000. Proceedings. ICRA’00. IEEE International Conference on Robotics and Automation. IEEE; 2000. p. 1023–1029.
Wang Y, Zhao Q, Wang B, Wang S, Zhang Y, Guo W, Feng Z. A real-time active pedestrian tracking system inspired by the human visual system. Cogn Comput 2015;8:1–13.
Google Scholar
Zeng N, Wang Z, Zhang H, Liu W, Alsaadi FE. Deep belief networks for quantitative analysis of a gold immunochromatographic strip. Cogn Comput 2016;8:1–9.
Article CAS Google Scholar
Zhao J, Du C, Sun H, Liu X, Sun J. Biologically motivated model for outdoor scene classification. Cogn Comput 2015;7(1):20–33.
Article Google Scholar
Zhao J, Sun S, Liu X, Sun J, Yang A. A novel biologically inspired visual saliency model. Cogn Comput 2014;6(4):841–848.
Article Google Scholar
Zhou B, Khosla A, Lapedriza A, Torralba A, Oliva A. Places: An image database for deep scene understanding. (2016) arXiv:1610.02055.
Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A. Learning deep features for scene recognition using places database. Advances in neural information processing systems; 2014. p. 487–495.
Zhou S, Chen Q, Wang X. Discriminative deep belief networks for image classification. 2010 IEEE International Conference on Image Processing. IEEE; 2010. p. 1561–1564.
Zrira N, Bouyakhf EH. 2016. A novel incremental topological mapping using global visual features. International Journal of Computational Vision and Robotics. In press.
Zuo Z, Wang G, Shuai B, Zhao L, Yang Q, Jiang X. Learning discriminative and shareable features for scene classification. Computer Vision–ECCV 2014. Springer; 2014. p. 552–568.

Download references

Acknowledgments

The authors would like to thank Mr. Mohammad Ali Keyvanrad of Laboratory for Intelligent Multimedia Processing (LIMP), Amirkabir University of Technology, Tehran, Iran for his discussions and Matlab Toolbox which helped in the improvement of the paper.

Author information

Authors and Affiliations

LIMIARF Laboratory, Faculty of Sciences Rabat, Mohammed V University in Rabat, Rabat, Morocco
Nabila Zrira & El Houssine Bouyakhf
Le2i, FRE CNRS 2005, Arts et Métiers, Université Bourgogne Franche-Comté, Dijon, France
Haris Ahmad Khan
The Norwegian Colour and Visual Computing Laboratory, NTNU-Norwegian University of Science and Technology, Gjøvik, Norway
Haris Ahmad Khan

Authors

Nabila Zrira
View author publications
You can also search for this author in PubMed Google Scholar
Haris Ahmad Khan
View author publications
You can also search for this author in PubMed Google Scholar
El Houssine Bouyakhf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nabila Zrira.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Informed Consent

All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2008 (5). Additional informed consent was obtained from all participants for which identifying information is included in this article.

Human and Animal Rights

This article does not contain any studies with human or animal subjects performed by any of the authors.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zrira, N., Khan, H.A. & Bouyakhf, E.H. Discriminative Deep Belief Network for Indoor Environment Classification Using Global Visual Features. Cogn Comput 10, 437–453 (2018). https://doi.org/10.1007/s12559-017-9534-9

Download citation

Received: 22 May 2016
Accepted: 05 December 2017
Published: 01 January 2018
Issue Date: June 2018
DOI: https://doi.org/10.1007/s12559-017-9534-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discriminative Deep Belief Network for Indoor Environment Classification Using Global Visual Features

Abstract

Access this article

Similar content being viewed by others

Activity Recognition from Multi-modal Sensor Data Using a Deep Convolutional Neural Network

Restricted Boltzmann Machine with Adaptive Local Hidden Units

3D Object Classification Using Deep Belief Networks

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Informed Consent

Human and Animal Rights

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Discriminative Deep Belief Network for Indoor Environment Classification Using Global Visual Features

Abstract

Access this article

Similar content being viewed by others

Activity Recognition from Multi-modal Sensor Data Using a Deep Convolutional Neural Network

Restricted Boltzmann Machine with Adaptive Local Hidden Units

3D Object Classification Using Deep Belief Networks

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Informed Consent

Human and Animal Rights

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation