A Review on the Video Summarization and Glaucoma Detection

Correia, Tales; Cunha, António; Coelho, Paulo

doi:10.1007/978-3-031-32029-3_14

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 484))

Included in the following conference series:

International Conference on Wireless Mobile Communication and Healthcare

281 Accesses

Abstract

Glaucoma is a severe disease that arises from low intraocular pressure, it is asymptomatic in the initial stages and can lead to blindness, due to its degenerative characteristic. There isn’t any available cure for it, and it is the second most common cause of blindness in the world. Regular visits to the ophthalmologist are the best way to prevent or contain it, with a precise diagnosis performed with professional equipment. From another perspective, for some individuals or populations, this task can be difficult to accomplish, due to several restrictions, such as low incoming resources, geographical adversities, and traveling restrictions (distance, lack of means of transportation, etc.). Also, logistically, due to its dimensions, relocating the professional equipment can be expensive, thus becoming inviable to bring them to remote areas. As an alternative, some low-cost products are available in the market that copes with this need, namely the D-Eye lens, which can be attached to a smartphone and enables the capture of fundus images, presenting as major drawback lower quality imaging when compared to professional equipment. Some techniques rely on video capture to perform summarization and build a full image with the desired features. In this context, the goal of this paper is to present a review of the methods that can perform video summarization and methods for glaucoma detection, combining both to indicate if individuals present glaucoma symptoms, as a pre-screening approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Cowan, C.S., et al.: Cell types of the human retina and its organoids at single-cell resolution. Cell 182(6), 1623–1640 (2020)
Article Google Scholar
Xu, L., Zhang, K., Yang, G., Chu, J.: Gesture recognition using dual-stream CNN based on fusion of sEMG energy kernel phase portrait and IMU amplitude image. Biomed. Sig. Process. Control 73, 103364 (2022). https://doi.org/10.1016/j.bspc.2021.103364
Article Google Scholar
Atila, O., Şengür, A.: Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition. Appl. Acoust. 182, 108260 (2021)
Article Google Scholar
Lin, J., Zhong, S.H., Fares, A.: Deep hierarchical LSTM networks with attention for video summarization. Comput. Electr. Eng. 97, 107618 (2022). https://doi.org/10.1016/j.compeleceng.2021.107618
Article Google Scholar
Zhao, B., Gong, M., Li, X.: Hierarchical multimodal transformer to summarize videos. Neurocomputing 468, 360–369 (2022). https://doi.org/10.1016/j.neucom.2021.10.039
Article Google Scholar
Liang, G., Lv, Y., Li, S., Wang, X., Zhang, Y.: Video summarization with a dual-path attentive network. Neurocomputing 467, 1–9 (2022). https://doi.org/10.1016/j.neucom.2021.09.015
Article Google Scholar
Hussain, T., Muhammad, K., Ding, W., Lloret, J., Baik, S.W., de Albuquerque, V.H.C.: A comprehensive survey of multi-view video summarization. Pattern Recogn. 109, 107567 (2021). https://doi.org/10.1016/j.patcog.2020.107567
Article Google Scholar
Fu, H., Wang, H.: Self-attention binary neural tree for video summarization. Pattern Recogn. Lett. 143, 19–26 (2021). https://doi.org/10.1016/j.patrec.2020.12.016
Article Google Scholar
Harakannanavar, S.S., Sameer, S.R., Kumar, V., Behera, S.K., Amberkar, A.V., Puranikmath, V.I.: Robust video summarization algorithm using supervised machine learning. Global Transitions Proc. 3(1), 131–135 (2022). https://doi.org/10.1016/j.gltp.2022.04.009
Article Google Scholar
Li, P., Ye, Q., Zhang, L., Yuan, L., Xu, X., Shao, L.: Exploring global diverse attention via pairwise temporal relation for video summarization. Pattern Recogn. 111, 107677 (2021). https://doi.org/10.1016/j.patcog.2020.107677
Article Google Scholar
Feng, X., Zhu, Y., Yang, C.: Video summarization based on fusing features and shot segmentation. In: Proceedings of 2021 7th IEEE International Conference on Network Intelligence and Digital Content, IC-NIDC 2021, pp. 383–387 (2021)
Google Scholar
Badre, S.R., Thepade, S.D.: Summarization with key frame extraction using thepade’s sorted n-ary block truncation coding applied on haar wavelet of video frame. In: 2016 Conference on Advances in Signal Processing, CASP, pp. 332–336 (2016)
Google Scholar
Fei, M., Jiang, W., Mao, W.: Memorable and rich video summarization. J. Vis. Commun. Image Represent. 42, 207–217 (2017). https://doi.org/10.1016/j.jvcir.2016.12.001
Article Google Scholar
Mehmood, I., Sajjad, M., Rho, S., Baik, S.W.: Divide-and-conquer based summarization framework for extracting affective video content. Neurocomputing 174, 393–403 (2016). https://doi.org/10.1016/j.neucom.2015.05.126
Article Google Scholar
Huang, C., Wang, H.: A novel key-frames selection framework for comprehensive video summarization. IEEE Trans. Circ. Syst. Video Technol. 30(2), 577–589 (2020)
Article Google Scholar
Zhu, W., Lu, J., Han, Y., Zhou, J.: Learning multiscale hierarchical attention for video summarization. Pattern Recogn. 122, 108312 (2022). https://doi.org/10.1016/j.patcog.2021.108312
Article Google Scholar
Chai, C., et al.: Graph-based structural difference analysis for video summarization. Inf. Sci. 577, 483–509 (2021). https://doi.org/10.1016/j.ins.2021.07.012
Article MathSciNet Google Scholar
De Avila, S.E.F., Lopes, A.P.B., Da Luz, A., De Albuquerque Araújo, A.: VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn. Lett. 32(1), 56–68 (2011). https://doi.org/10.1016/j.patrec.2010.08.004
Huang, S., Li, X., Zhang, Z., Wu, F., Han, J.: User-ranking video summarization with multi-stage spatio-temporal representation. IEEE Trans. Image Process. 28(6), 2654–2664 (2019)
Article MathSciNet MATH Google Scholar
Agyeman, R., Muhammad, R., Choi, G.S.: Soccer video summarization using deep learning. In: Proceedings - 2nd International Conference on Multimedia Information Processing and Retrieval, MIPR 2019, pp. 270–273 (2019)
Google Scholar
Riahi, A., Elharrouss, O., Al-Maadeed, S.: EMD-3DCNN-based method for COVID-19 detection. Comput. Biol. Med. 142, 105188 (2022). https://doi.org/10.1016/j.compbiomed.2021.105188
Article Google Scholar
Apostolidis, E., Adamantidou, E., Metsai, A.I., Mezaris, V., Patras, I.: Video summarization using deep neural networks: a survey. Proc. IEEE 109(11), 1838–1863 (2021)
Article Google Scholar
Lei, Z., Zhang, C., Zhang, Q., Qiu, G.: FrameRank: a text processing approach to video summarization. In: Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2019, pp. 368–373 (2019)
Google Scholar
Gygli, M., Grabner, H., Riemenschneider, H., Van Gool, L.: Creating summaries from user videos. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 505–520. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10584-0_33
Chapter Google Scholar
Song, Y., Vallmitjana, J., Stent, A., Jaimes, A.: TVSum: summarizing web videos using titles. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 07–12 June, pp. 5179–5187 (2015)
Google Scholar
VTW Dataset. http://aliensunmin.github.io/project/%0Avideo-language/
Mehta, P., et al.: Automated detection of glaucoma with interpretable machine learning using clinical data and multimodal retinal images. Am. J. Ophthalmol. 231, 154–169 (2021). https://doi.org/10.1016/j.ajo.2021.04.021
Article Google Scholar
Sudlow, C., et al.: UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12(3), 1–10 (2015)
Article Google Scholar
Nayak, D.R., Das, D., Majhi, B., Bhandary, S.V., Acharya, U.R.: ECNet: an evolutionary convolutional network for automated glaucoma detection using fundus images. Biomed. Sig. Process. Control 67, 102559 (2021). https://doi.org/10.1016/j.bspc.2021.102559
Article Google Scholar
Li, L., Xu, M., Wang, X., Jiang, L., Liu, H.: Attention based glaucoma detection: a large-scale database and CNN model. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10563–10572 (2019)
Google Scholar
Li, L., et al.: A large-scale database and a CNN model for attention-based glaucoma detection. IEEE Trans. Med. Imaging 39(2), 413–424 (2020). https://ieeexplore.ieee.org/document/8756196/
Venugopal, N., Mari, K., Manikandan, G., Sekar, K.R.: Phase quantized polar transformative with cellular automaton for early glaucoma detection. Ain Shams Eng. J. 12(4), 4145–4155 (2021). https://doi.org/10.1016/j.asej.2021.04.018
Article Google Scholar
Zulfira, F.Z., Suyanto, S., Septiarini, A.: Segmentation technique and dynamic ensemble selection to enhance glaucoma severity detection. Comput. Biol. Med. 139, 104951 (2021). https://doi.org/10.1016/j.compbiomed.2021.104951
Article Google Scholar
RIM-ONE (2020). https://www.ias-iss.org/ojs/IAS/article/view/2346
García, G., Colomer, A., Naranjo, V.: Glaucoma detection from raw SD-OCT volumes: a novel approach focused on spatial dependencies. Comput. Methods Programs Biomed. 200, 105855 (2021)
Article Google Scholar
Gupta, N., Garg, H., Agarwal, R.: A robust framework for glaucoma detection using CLAHE and EfficientNet. Vis. Comput. 1–14 (2021). https://doi.org/10.1007/s00371-021-02114-5
Pizer, S.M., et al.: Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 39(3), 355–368 (1987). https://linkinghub.elsevier.com/retrieve/pii/S0734189X8780186X

Download references

Acknowledgements

This work is funded by FCT/MEC through national funds and, when applicable, co-funded by the FEDER-PT2020 partnership agreement under the project UIDB/00308/2020.

Author information

Authors and Affiliations

School of Technology and Management, Polytechnic of Leiria, 2411-901, Leiria, Portugal
Tales Correia & Paulo Coelho
Escola de Ciências e Tecnologias, University of Trás-os-Montes e Alto Douro, Quinta de Prados, 5001-801, Vila Real, Portugal
António Cunha
Institute for Systems and Computer Engineering, Technology and Science (INESC TEC), 4200-465, Porto, Portugal
António Cunha
Institute for Systems Engineering and Computers at Coimbra (INESC Coimbra), DEEC, Pólo II, 3030-290, Coimbra, Portugal
Paulo Coelho

Authors

Tales Correia
View author publications
You can also search for this author in PubMed Google Scholar
António Cunha
View author publications
You can also search for this author in PubMed Google Scholar
Paulo Coelho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Paulo Coelho .

Editor information

Editors and Affiliations

University of Trás-os-Montes and Alto Douro, Vila Real, Portugal
António Cunha
University of Beira Interior, Covilha, Portugal
Nuno M. Garcia
Ossietzky Universität Oldenburg, Oldenburg, Niedersachsen, Germany
Jorge Marx Gómez
University of Trás-os-Montes and Alto Douro, Vila Real, Portugal
Sandra Pereira

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Correia, T., Cunha, A., Coelho, P. (2023). A Review on the Video Summarization and Glaucoma Detection. In: Cunha, A., M. Garcia, N., Marx Gómez, J., Pereira, S. (eds) Wireless Mobile Communication and Healthcare. MobiHealth 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 484. Springer, Cham. https://doi.org/10.1007/978-3-031-32029-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-32029-3_14
Published: 14 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-32028-6
Online ISBN: 978-3-031-32029-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Review on the Video Summarization and Glaucoma Detection