skip to main content
10.1145/3615884.3629428acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
short-paper

WYA: A Novel Spatial Scene Classification Framework Based on Surrounding Object Detection

Authors Info & Claims
Published:13 November 2023Publication History

ABSTRACT

Spatial scene classification has long been a prominent area of research in the field of geographic information science. In the past, traditional approaches heavily relied on retrieval methods based on image features. However, given the rapid advancements in deep learning and artificial intelligence, the efficient classification of complex spatial scenes has become increasingly crucial. This paper presents a novel framework named WYA (Where You At) that combines surrounding object detection with knowledge graph to automate the process of spatial scene classification. Initially, the input images undergo processing using object detection techniques to identify key entities within the scenes. Subsequently, a knowledge graph, which encompasses various spatial scenes, entities, and their relationships, is utilized to identity spatial scene catogories. To validate the effectiveness of the framework, experiments were conducted using eight spatial scene categories as an example. The results demonstrated a high level of consistency with actual spatial types, thus affirming the efficacy of the framework and highlighting its potential application value in the domain of spatial scene classification.

References

  1. Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. ArXiv abs/2004.10934 (2020). https://api.semanticscholar.org/CorpusID:216080778Google ScholarGoogle Scholar
  2. Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. 2016. The Cityscapes Dataset for Semantic Urban Scene Understanding. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3213--3223. https://doi.org/10.1109/CVPR.2016.350Google ScholarGoogle ScholarCross RefCross Ref
  3. N. Dalal and B. Triggs. 2005. Histograms of oriented gradients for human detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Vol. 1. 886--893 vol. 1. https://doi.org/10.1109/CVPR.2005.177Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. 248--255. https://doi.org/10.1109/CVPR.2009.5206848Google ScholarGoogle ScholarCross RefCross Ref
  5. Pedro F. Felzenszwalb, Ross B. Girshick, and David McAllester. 2010. Cascade object detection with deformable part models. In 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2241--2248. https://doi.org/10.1109/CVPR.2010.5539906Google ScholarGoogle ScholarCross RefCross Ref
  6. Ross Girshick. 2015. Fast R-CNN. In 2015 IEEE International Conference on Computer Vision (ICCV). 1440--1448. https://doi.org/10.1109/ICCV.2015.169Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask R-CNN. In 2017 IEEE International Conference on Computer Vision (ICCV). 2980--2988. https://doi.org/10.1109/ICCV.2017.322Google ScholarGoogle ScholarCross RefCross Ref
  8. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770--778.Google ScholarGoogle Scholar
  9. Luis Herranz, Shuqiang Jiang, and Xiangyang Li. 2016. Scene Recognition with CNNs: Objects, Scales and Dataset Bias. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016), 571--579. https://api.semanticscholar.org/CorpusID:15429030Google ScholarGoogle ScholarCross RefCross Ref
  10. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. Commun. ACM 60 (2012), 84--90.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Wenbin Li, Sajad Saeedi, John McCormac, Ronald Clark, Dimos Tzoumanikas, Qing Ye, Yuzhong Huang, Rui Tang, and Stefan Leutenegger. 2018. InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset. In British Machine Vision Conference (BMVC).Google ScholarGoogle Scholar
  12. Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature Pyramid Networks for Object Detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 936--944. https://doi.org/10.1109/CVPR.2017.106Google ScholarGoogle ScholarCross RefCross Ref
  13. Tsung Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal Loss for Dense Object Detection. IEEE Transactions on Pattern Analysis Machine Intelligence PP, 99 (2017), 2999--3007.Google ScholarGoogle Scholar
  14. Tsung Yi Lin, Michael Maire, Serge Belongie, James Hays, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In European Conference on Computer Vision.Google ScholarGoogle Scholar
  15. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng Yang Fu, and Alexander C. Berg. 2016. SSD:Single Shot MultiBox Detector. In European Conference on Computer Vision, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). 21--37.Google ScholarGoogle Scholar
  16. David G. Lowe. 2004. Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60, 2 (2004), 91--110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ariadna Quattoni and Antonio Torralba. 2009. Recognizing indoor scenes. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. 413--420. https://doi.org/10.1109/CVPR.2009.5206537Google ScholarGoogle ScholarCross RefCross Ref
  18. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779--788. https://doi.org/10.1109/CVPR.2016.91Google ScholarGoogle ScholarCross RefCross Ref
  19. Joseph Redmon and Ali Farhadi. 2017. YOLO9000: Better, Faster, Stronger. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6517--6525. https://doi.org/10.1109/CVPR.2017.690Google ScholarGoogle ScholarCross RefCross Ref
  20. Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An Incremental Improvement. ArXiv abs/1804.02767 (2018). https://api.semanticscholar.org/CorpusID:4714433Google ScholarGoogle Scholar
  21. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2017. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 6 (2017), 1137--1149. https://doi.org/10.1109/TPAMI.2016.2577031Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. K. Simonyan and A. Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations.Google ScholarGoogle Scholar
  23. Mingxing Tan, Ruoming Pang, and Quoc V. Le. 2020. EfficientDet: Scalable and Efficient Object Detection. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 10778--10787. https://doi.org/10.1109/CVPR42600.2020.01079Google ScholarGoogle ScholarCross RefCross Ref
  24. P. Viola and M. Jones. 2001. Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001, Vol. 1. I--I. https://doi.org/10.1109/CVPR.2001.990517Google ScholarGoogle ScholarCross RefCross Ref
  25. Yajun Wang, Qizi Mu, Moyin Yu, Honghai Wang, and Liyuan Zhu. 2023. Out-door Scene Recognition Based on Convolutional Neural Network. Automation Application 64, 201--207 (2023).Google ScholarGoogle Scholar
  26. Degang Xu, Lu Wand, and Fan Li. 2021. Review of Typical Object Detection Algorithms for Deep Learning. Computer Engineering and Applications 10--25 (2021).Google ScholarGoogle Scholar
  27. Shun Zhang, Yihong Gong, and Jinjun Wang. 2019. The Development of Deep Convolution Neural Network and Its Applications on Computer Vision. Chinese Journal of Computers 42, 453--482 (2019).Google ScholarGoogle Scholar
  28. Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2018. Places: A 10 Million Image Database for Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 6 (2018), 1452--1464. https://doi.org/10.1109/TPAMI.2017.2723009Google ScholarGoogle ScholarCross RefCross Ref
  29. Zhengxia Zou, Keyan Chen, Zhenwei Shi, Yuhong Guo, and Jieping Ye. 2023. Object Detection in 20 Years: A Survey. Proc. IEEE 111, 3 (2023), 257--276.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. WYA: A Novel Spatial Scene Classification Framework Based on Surrounding Object Detection

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      EM-GIS '23: Proceedings of the 8th ACM SIGSPATIAL International Workshop on Security Response using GIS
      November 2023
      59 pages
      ISBN:9798400703461
      DOI:10.1145/3615884
      • Editors:
      • Yan Huang,
      • Jean-Claude Thill,
      • Hui Zhang,
      • Danhuai Guo,
      • Yi Liu,
      • Wei Xu,
      • Bin Chen

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 November 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate30of54submissions,56%
    • Article Metrics

      • Downloads (Last 12 months)34
      • Downloads (Last 6 weeks)2

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader