Skip to main content

A Review on the Video Summarization and Glaucoma Detection

  • Conference paper
  • First Online:
Wireless Mobile Communication and Healthcare (MobiHealth 2022)

Abstract

Glaucoma is a severe disease that arises from low intraocular pressure, it is asymptomatic in the initial stages and can lead to blindness, due to its degenerative characteristic. There isn’t any available cure for it, and it is the second most common cause of blindness in the world. Regular visits to the ophthalmologist are the best way to prevent or contain it, with a precise diagnosis performed with professional equipment. From another perspective, for some individuals or populations, this task can be difficult to accomplish, due to several restrictions, such as low incoming resources, geographical adversities, and traveling restrictions (distance, lack of means of transportation, etc.). Also, logistically, due to its dimensions, relocating the professional equipment can be expensive, thus becoming inviable to bring them to remote areas. As an alternative, some low-cost products are available in the market that copes with this need, namely the D-Eye lens, which can be attached to a smartphone and enables the capture of fundus images, presenting as major drawback lower quality imaging when compared to professional equipment. Some techniques rely on video capture to perform summarization and build a full image with the desired features. In this context, the goal of this paper is to present a review of the methods that can perform video summarization and methods for glaucoma detection, combining both to indicate if individuals present glaucoma symptoms, as a pre-screening approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cowan, C.S., et al.: Cell types of the human retina and its organoids at single-cell resolution. Cell 182(6), 1623–1640 (2020)

    Article  Google Scholar 

  2. Xu, L., Zhang, K., Yang, G., Chu, J.: Gesture recognition using dual-stream CNN based on fusion of sEMG energy kernel phase portrait and IMU amplitude image. Biomed. Sig. Process. Control 73, 103364 (2022). https://doi.org/10.1016/j.bspc.2021.103364

    Article  Google Scholar 

  3. Atila, O., Şengür, A.: Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition. Appl. Acoust. 182, 108260 (2021)

    Article  Google Scholar 

  4. Lin, J., Zhong, S.H., Fares, A.: Deep hierarchical LSTM networks with attention for video summarization. Comput. Electr. Eng. 97, 107618 (2022). https://doi.org/10.1016/j.compeleceng.2021.107618

    Article  Google Scholar 

  5. Zhao, B., Gong, M., Li, X.: Hierarchical multimodal transformer to summarize videos. Neurocomputing 468, 360–369 (2022). https://doi.org/10.1016/j.neucom.2021.10.039

    Article  Google Scholar 

  6. Liang, G., Lv, Y., Li, S., Wang, X., Zhang, Y.: Video summarization with a dual-path attentive network. Neurocomputing 467, 1–9 (2022). https://doi.org/10.1016/j.neucom.2021.09.015

    Article  Google Scholar 

  7. Hussain, T., Muhammad, K., Ding, W., Lloret, J., Baik, S.W., de Albuquerque, V.H.C.: A comprehensive survey of multi-view video summarization. Pattern Recogn. 109, 107567 (2021). https://doi.org/10.1016/j.patcog.2020.107567

    Article  Google Scholar 

  8. Fu, H., Wang, H.: Self-attention binary neural tree for video summarization. Pattern Recogn. Lett. 143, 19–26 (2021). https://doi.org/10.1016/j.patrec.2020.12.016

    Article  Google Scholar 

  9. Harakannanavar, S.S., Sameer, S.R., Kumar, V., Behera, S.K., Amberkar, A.V., Puranikmath, V.I.: Robust video summarization algorithm using supervised machine learning. Global Transitions Proc. 3(1), 131–135 (2022). https://doi.org/10.1016/j.gltp.2022.04.009

    Article  Google Scholar 

  10. Li, P., Ye, Q., Zhang, L., Yuan, L., Xu, X., Shao, L.: Exploring global diverse attention via pairwise temporal relation for video summarization. Pattern Recogn. 111, 107677 (2021). https://doi.org/10.1016/j.patcog.2020.107677

    Article  Google Scholar 

  11. Feng, X., Zhu, Y., Yang, C.: Video summarization based on fusing features and shot segmentation. In: Proceedings of 2021 7th IEEE International Conference on Network Intelligence and Digital Content, IC-NIDC 2021, pp. 383–387 (2021)

    Google Scholar 

  12. Badre, S.R., Thepade, S.D.: Summarization with key frame extraction using thepade’s sorted n-ary block truncation coding applied on haar wavelet of video frame. In: 2016 Conference on Advances in Signal Processing, CASP, pp. 332–336 (2016)

    Google Scholar 

  13. Fei, M., Jiang, W., Mao, W.: Memorable and rich video summarization. J. Vis. Commun. Image Represent. 42, 207–217 (2017). https://doi.org/10.1016/j.jvcir.2016.12.001

    Article  Google Scholar 

  14. Mehmood, I., Sajjad, M., Rho, S., Baik, S.W.: Divide-and-conquer based summarization framework for extracting affective video content. Neurocomputing 174, 393–403 (2016). https://doi.org/10.1016/j.neucom.2015.05.126

    Article  Google Scholar 

  15. Huang, C., Wang, H.: A novel key-frames selection framework for comprehensive video summarization. IEEE Trans. Circ. Syst. Video Technol. 30(2), 577–589 (2020)

    Article  Google Scholar 

  16. Zhu, W., Lu, J., Han, Y., Zhou, J.: Learning multiscale hierarchical attention for video summarization. Pattern Recogn. 122, 108312 (2022). https://doi.org/10.1016/j.patcog.2021.108312

    Article  Google Scholar 

  17. Chai, C., et al.: Graph-based structural difference analysis for video summarization. Inf. Sci. 577, 483–509 (2021). https://doi.org/10.1016/j.ins.2021.07.012

    Article  MathSciNet  Google Scholar 

  18. De Avila, S.E.F., Lopes, A.P.B., Da Luz, A., De Albuquerque Araújo, A.: VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn. Lett. 32(1), 56–68 (2011). https://doi.org/10.1016/j.patrec.2010.08.004

  19. Huang, S., Li, X., Zhang, Z., Wu, F., Han, J.: User-ranking video summarization with multi-stage spatio-temporal representation. IEEE Trans. Image Process. 28(6), 2654–2664 (2019)

    Article  MathSciNet  MATH  Google Scholar 

  20. Agyeman, R., Muhammad, R., Choi, G.S.: Soccer video summarization using deep learning. In: Proceedings - 2nd International Conference on Multimedia Information Processing and Retrieval, MIPR 2019, pp. 270–273 (2019)

    Google Scholar 

  21. Riahi, A., Elharrouss, O., Al-Maadeed, S.: EMD-3DCNN-based method for COVID-19 detection. Comput. Biol. Med. 142, 105188 (2022). https://doi.org/10.1016/j.compbiomed.2021.105188

    Article  Google Scholar 

  22. Apostolidis, E., Adamantidou, E., Metsai, A.I., Mezaris, V., Patras, I.: Video summarization using deep neural networks: a survey. Proc. IEEE 109(11), 1838–1863 (2021)

    Article  Google Scholar 

  23. Lei, Z., Zhang, C., Zhang, Q., Qiu, G.: FrameRank: a text processing approach to video summarization. In: Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2019, pp. 368–373 (2019)

    Google Scholar 

  24. Gygli, M., Grabner, H., Riemenschneider, H., Van Gool, L.: Creating summaries from user videos. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 505–520. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10584-0_33

    Chapter  Google Scholar 

  25. Song, Y., Vallmitjana, J., Stent, A., Jaimes, A.: TVSum: summarizing web videos using titles. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 07–12 June, pp. 5179–5187 (2015)

    Google Scholar 

  26. VTW Dataset. http://aliensunmin.github.io/project/%0Avideo-language/

  27. Mehta, P., et al.: Automated detection of glaucoma with interpretable machine learning using clinical data and multimodal retinal images. Am. J. Ophthalmol. 231, 154–169 (2021). https://doi.org/10.1016/j.ajo.2021.04.021

    Article  Google Scholar 

  28. Sudlow, C., et al.: UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12(3), 1–10 (2015)

    Article  Google Scholar 

  29. Nayak, D.R., Das, D., Majhi, B., Bhandary, S.V., Acharya, U.R.: ECNet: an evolutionary convolutional network for automated glaucoma detection using fundus images. Biomed. Sig. Process. Control 67, 102559 (2021). https://doi.org/10.1016/j.bspc.2021.102559

    Article  Google Scholar 

  30. Li, L., Xu, M., Wang, X., Jiang, L., Liu, H.: Attention based glaucoma detection: a large-scale database and CNN model. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10563–10572 (2019)

    Google Scholar 

  31. Li, L., et al.: A large-scale database and a CNN model for attention-based glaucoma detection. IEEE Trans. Med. Imaging 39(2), 413–424 (2020). https://ieeexplore.ieee.org/document/8756196/

  32. Venugopal, N., Mari, K., Manikandan, G., Sekar, K.R.: Phase quantized polar transformative with cellular automaton for early glaucoma detection. Ain Shams Eng. J. 12(4), 4145–4155 (2021). https://doi.org/10.1016/j.asej.2021.04.018

    Article  Google Scholar 

  33. Zulfira, F.Z., Suyanto, S., Septiarini, A.: Segmentation technique and dynamic ensemble selection to enhance glaucoma severity detection. Comput. Biol. Med. 139, 104951 (2021). https://doi.org/10.1016/j.compbiomed.2021.104951

    Article  Google Scholar 

  34. RIM-ONE (2020). https://www.ias-iss.org/ojs/IAS/article/view/2346

  35. García, G., Colomer, A., Naranjo, V.: Glaucoma detection from raw SD-OCT volumes: a novel approach focused on spatial dependencies. Comput. Methods Programs Biomed. 200, 105855 (2021)

    Article  Google Scholar 

  36. Gupta, N., Garg, H., Agarwal, R.: A robust framework for glaucoma detection using CLAHE and EfficientNet. Vis. Comput. 1–14 (2021). https://doi.org/10.1007/s00371-021-02114-5

  37. Pizer, S.M., et al.: Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 39(3), 355–368 (1987). https://linkinghub.elsevier.com/retrieve/pii/S0734189X8780186X

Download references

Acknowledgements

This work is funded by FCT/MEC through national funds and, when applicable, co-funded by the FEDER-PT2020 partnership agreement under the project UIDB/00308/2020.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Paulo Coelho .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Correia, T., Cunha, A., Coelho, P. (2023). A Review on the Video Summarization and Glaucoma Detection. In: Cunha, A., M. Garcia, N., Marx Gómez, J., Pereira, S. (eds) Wireless Mobile Communication and Healthcare. MobiHealth 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 484. Springer, Cham. https://doi.org/10.1007/978-3-031-32029-3_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-32029-3_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-32028-6

  • Online ISBN: 978-3-031-32029-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics