Knowledge Guided Deep Learning for General-Purpose Computer Vision Applications

Djenouri, Youcef; Belbachir, Ahmed Nabil; Jhaveri, Rutvij H.; Djenouri, Djamel

doi:10.1007/978-3-031-44237-7_18

Youcef Djenouri^15,16,
Ahmed Nabil Belbachir¹⁶,
Rutvij H. Jhaveri¹⁷ &
…
Djamel Djenouri¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14184))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

412 Accesses

Abstract

This research targets general-purpose smart computer vision that eliminates reliance on domain-specific knowledge to reach adaptable generic models for flexible applications. It proposes a novel approach in which several deep learning models are trained for each image. Statistical information of each trained image is then calculated and stored with the loss values of each model used in the training phase. The stored information is finally used to select the appropriate model for each new image data in the testing phase. To efficiently select the appropriate model, a kNN (k Nearest Neighbors) strategy is used to select the best model in the testing phase. The developed framework called KGDL (Knowledge Guided Deep Learning) was evaluated and tested using two computer vision benchmarks, 1) ImageNet for image classification, and 2) COCO for object detection. The results reveal the effectiveness of KGDL in terms of accuracy and competitiveness of inference runtime. In particular, it achieved \(94\%\) of classification rate in ImageNet, and 92% of intersection over union in COCO dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.image-net.org/.
2.
https://cocodataset.org/.

References

Belhadi, A., Djenouri, Y., Diaz, V.G., Houssein, E.H., Lin, J.C.W.: Hybrid intelligent framework for automated medical learning. Expert. Syst. 39(6), e12737 (2022)
Article Google Scholar
Bello, I., et al.: Revisiting ResNets: improved training and scaling strategies. Adv. Neural. Inf. Process. Syst. 34, 22614–22627 (2021)
Google Scholar
Chowdhury, A.A., Hossen, M.A., Azam, M.A., Rahman, M.H.: DeepQGHO: quantized greedy hyperparameter optimization in deep neural networks for on-the-fly learning. IEEE Access 10, 6407–6416 (2022)
Article Google Scholar
Dash, T., Chitlangia, S., Ahuja, A., Srinivasan, A.: A review of some techniques for inclusion of domain-knowledge into deep neural networks. Sci. Rep. 12(1), 1–15 (2022)
Article Google Scholar
Djenouri, Y., Belhadi, A., Lin, J.C.W., Cano, A.: Adapted k-nearest neighbors for detecting anomalies on spatio-temporal traffic flow. IEEE Access 7, 10015–10027 (2019)
Article Google Scholar
Dong, W., Zhou, C., Wu, F., Wu, J., Shi, G., Li, X.: Model-guided deep hyperspectral image super-resolution. IEEE Trans. Image Process. 30, 5754–5768 (2021)
Article Google Scholar
Hou, X., Zhang, X., Liang, H., Shen, L., Lai, Z., Wan, J.: GuidedStyle: attribute knowledge guided style manipulation for semantic face editing. Neural Netw. 145, 209–220 (2022)
Article Google Scholar
Li, M., Liu, R., Wang, F., Chang, X., Liang, X.: Auxiliary signal-guided knowledge encoder-decoder for medical report generation. In: World Wide Web, pp. 1–18 (2022)
Google Scholar
Li, Y., et al.: MViTv 2: improved multiscale vision transformers for classification and detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4804–4814 (2022)
Google Scholar
Li, Y., Ouyang, S., Zhang, Y.: Combining deep learning and ontology reasoning for remote sensing image semantic segmentation. Knowl.-Based Syst. 243, 108469 (2022)
Google Scholar
Li, Y., Zhou, Y., Zhang, Y., Zhong, L., Wang, J., Chen, J.: DKDFN: domain knowledge-guided deep collaborative fusion network for multimodal unitemporal remote sensing land cover classification. ISPRS J. Photogramm. Remote. Sens. 186, 170–189 (2022)
Article Google Scholar
Liu, P., Chen, L., Chen, Z.N.: Prior-knowledge-guided deep-learning-enabled synthesis for broadband and large phase shift range metacells in metalens antenna. IEEE Trans. Antennas Propag. 70(7), 5024–5034 (2022)
Article Google Scholar
Qu, Z., Gao, L.Y., Wang, S.Y., Yin, H.N., Yi, T.M.: An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network. Image Vision Comput. 125, 104518 (2022)
Google Scholar
Yang, F., Wang, R., Chen, X.: SEGA: semantic guided attention on visual prototype for few-shot learning. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1056–1066 (2022)
Google Scholar
Yin, C., Zhao, R., Qian, B., Lv, X., Zhang, P.: Domain knowledge guided deep learning with electronic health records. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 738–747. IEEE (2019)
Google Scholar

Download references

Acknowledgment

This work is funded in part by the Research Council of Norway’s ULEARN “Unsupervised Lifelong Learning” project, which is co-funded under grant number 316080.

Author information

Authors and Affiliations

University of South-Eastern Norway, Kongsberg, Norway
Youcef Djenouri
NORCE Norwegian Research Centre, Oslo, Norway
Youcef Djenouri & Ahmed Nabil Belbachir
Pandit Deendayal Energy University, Gandhinagar, India
Rutvij H. Jhaveri
University of the West of England, Bristol, UK
Djamel Djenouri

Authors

Youcef Djenouri
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Nabil Belbachir
View author publications
You can also search for this author in PubMed Google Scholar
Rutvij H. Jhaveri
View author publications
You can also search for this author in PubMed Google Scholar
Djamel Djenouri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Youcef Djenouri .

Editor information

Editors and Affiliations

Cyprus University of Technology, Limassol, Cyprus
Nicolas Tsapatsoulis
Cyprus University of Technology/CYENS Center of Excellence, Limassol, Cyprus
Andreas Lanitis
The University of New Mexico, Albuquerque, NM, USA
Marios Pattichis
University of Cyprus/CYENS Center of Excellence, Nicosia, Cyprus
Constantinos Pattichis
University of Cyprus/KIOS Center of Excellence, Nicosia, Cyprus
Christos Kyrkou
Cyprus University of Technology, Limassol, Cyprus
Efthyvoulos Kyriacou
Cyprus University of Technology/CYENS Center of Excellence, Limassol, Cyprus
Zenonas Theodosiou
CYENS Center of Excellence, Nicosia, Cyprus
Andreas Panayides

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Djenouri, Y., Belbachir, A.N., Jhaveri, R.H., Djenouri, D. (2023). Knowledge Guided Deep Learning for General-Purpose Computer Vision Applications. In: Tsapatsoulis, N., et al. Computer Analysis of Images and Patterns. CAIP 2023. Lecture Notes in Computer Science, vol 14184. Springer, Cham. https://doi.org/10.1007/978-3-031-44237-7_18

Download citation

DOI: https://doi.org/10.1007/978-3-031-44237-7_18
Published: 20 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44236-0
Online ISBN: 978-3-031-44237-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Knowledge Guided Deep Learning for General-Purpose Computer Vision Applications