Skip to main content
Log in

Vision-based entrance detection in outdoor scenes

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Doors are a significant object for the visually impaired and robots to enter and exit buildings. Although the accuracy of door detection is reported high in indoor scenes, it has become a difficult problem in outdoor scenes in computer vision. The reason may lie in the fact that such properties of a simple ordinary door such as handles, corners, and the gap between the door and the ground may not be visible due to the great variety of doors in outdoor environments. In this paper, we present a vision-based method for detecting building entrances in outdoor images. After extracting the lines and deleting the extra ones, regions between the vertical lines are specified and the features including height, width, location, color, texture and the number of lines inside the regions are obtained. Finally, some additional knowledge such as door existence at the bottom of the image, a reasonable height and width of a door, the difference between color and texture of the doors and those of the neighboring regions, and numerous lines on doors is used to decide on door detection. The method was tested on the eTRIMS dataset, door images from the ImageNet dataset, and our own dataset including doors of houses, apartments, and stores leading to acceptable results. The obtained results show that our approach outperforms comparable state-of-the-art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Anguelov D, Koller D, Parker E, Thrun S (2004) Detecting and modeling doors with mobile robots. In: Proceedings of the IEEE International Conference on Robotics and Automation, New Orleans, pp. 3777–3784

  2. Chen Z, Li Y, Birchfield ST (2011) Visual detection of lintel-occluded doors by integrating multiple cues using data-driven Markov chain Monte Carlo process. Robot Auton Syst 59(11):966–976

    Google Scholar 

  3. Cohen A, Schwing AG, Pollefeys M (2014) Efficient structured parsing of facades using dynamic programming. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, pp. 3206–3213

  4. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886–893

  5. Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell (TPAMI) 36(8):1532–1545

    Article  Google Scholar 

  6. Garcia-Garcia A, Orts-Escolano S, Oprea SO, Villena-Martinez V, Garcia-Rodriguez J (2017) A review on deep learning techniques applied to semantic segmentation. arXiv 1704.06857

  7. Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1440–1448

  8. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587

  9. Gould S, He X (2014) Scene understanding by labeling pixels. Commun ACM 57(11):68–77

    Article  Google Scholar 

  10. Grompone von Gioi R, Jakubowicz J, Morel JM, Randall G (2010) LSD: A fast line segment detector with a false detection control. IEEE Trans Pattern Anal Mach Intell (TPAMI) 32(4):722–732

    Article  Google Scholar 

  11. Grompone von Gioi R, Jakubowicz J, Morel JM, Randall G (2012) LSD: A line segment detector. Image Processing on Line 2:35–55

    Article  Google Scholar 

  12. He Z, You X, Yuan Y (2009) Texture image retrieval based on non-tensor product wavelet filter banks. Signal Process 89(8):1501–1510

    Article  Google Scholar 

  13. Hensler J, Blaich M, Bittel O (2009) Real-time door detection based on AdaBoost learning algorithm. In: Proceedings of the International Conference on Research and Education in Robotics, La Ferte-Bernard, pp. 61–73

    Google Scholar 

  14. Hoiem D, Efros AA, Hebert M (2007) Recovering surface layout from an image. Int J Comput Vis (IJCV) 75(1):151–172

    Article  Google Scholar 

  15. Jain AK, Farrokhnia F (1991) Unsupervised texture segmentation using Gabor filters. Pattern Recogn 24(12):1167–1186

    Article  Google Scholar 

  16. Kang SJ, Trinh HH, Kim DN, Jo KH (2010) Entrance detection of buildings using multiple cues. In: Proceedings of the International Conference on Intelligent Information and Database Systems, Hue, pp. 251–260

  17. Korc F, Forstner W (2009) eTRIMS image database for interpreting images of man-made scenes. Technical Report, University of Bonn

  18. Leung T, Malik J (2001) Representing and recognizing the visual appearance of materials using three-dimensional textons. Int J Comput Vis (IJCV) 43(1):29–44

    Article  Google Scholar 

  19. Liu J, Korah T, Hedau V, Parameswaran V, Grzeszczuk R, Liu Y (2014) Entrance detection from street-view images. In: IEEE International Conference on Computer Vision and Pattern Recognition Workshop (CVPR), Columbus

  20. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis (IJCV) 60(2):91–110

    Article  Google Scholar 

  21. Mathias M, Martinovic A, Van Gool L (2016) ATLAS: A three-layered approach to facade parsing. Int J Comput Vis (IJCV) 118(1):22–48

    Article  MathSciNet  Google Scholar 

  22. Murillo AC, Kosecka J, Guerrero JJ, Sagues C (2008) Visual door detection integrating appearance and shape cues. Robot Auton Syst 56(6):512–521

    Article  Google Scholar 

  23. Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: Towards real-time object detection with region proposal networks. In: Proceedings of the International Conference on Neural Information Processing Systems (NIPS), pp. 91–99

  24. Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, Fei-Fei L (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis (IJCV) 115(3):211–252

    Article  MathSciNet  Google Scholar 

  25. Sekkal R, Pasteau F, Babel M, Brun B, Leplumey I (2013) Simple monocular door detection and tracking. In: Proceedings of the IEEE International Conference on Image Processing, Melbourne, pp. 3929–3933

  26. Shuai B, Zuo Z, Wang G, Wang B (2016) Scene parsing with integration of parametric and non-parametric models. IEEE Trans Image Process 25(5):2379–2391

    Article  MathSciNet  Google Scholar 

  27. Szeliski R (2011) Computer vision: Algorithms and applications. Springer, London

    Book  Google Scholar 

  28. Teboul O (2010) Ecole Centrale Paris Facades Database. Available: http://vision.mas.ecp.fr/Personnel/teboul/data.php

  29. Teboul O, Kokkinos I, Simon L, Koutsourakis P, Paragios N (2013) Parsing facades with shape grammars and reinforcement learning. IEEE Trans Pattern Anal Mach Intell (TPAMI) 35(7):1744–1756

    Article  Google Scholar 

  30. Tighe J, Niethammer M, Lazebnik S (2015) Scene parsing with object instance inference using regions and per-exemplar detectors. Int J Comput Vis (IJCV) 112(2):150–171

    Article  MathSciNet  Google Scholar 

  31. Yang MY, Förstner W (2011) Regionwise classification of building facade images. In: Proceedings of the ISPRS Conference on Photogrammetric Image Analysis, Munich, pp. 209–220

  32. Zhang D, You X, Wang P, Yanushkevich SN, Tang YY (2009) Facial biometrics using non-tensor product wavelet and 2D discriminant techniques. Int J Pattern Recognit Artif Intell 23(3):1–21

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abbas Vafaei.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Talebi, M., Vafaei, A. & Monadjemi, A. Vision-based entrance detection in outdoor scenes. Multimed Tools Appl 77, 26219–26238 (2018). https://doi.org/10.1007/s11042-018-5846-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-5846-3

Keywords

Navigation