Abstract
Glaucoma is a severe disease that arises from low intraocular pressure, it is asymptomatic in the initial stages and can lead to blindness, due to its degenerative characteristic. There isn’t any available cure for it, and it is the second most common cause of blindness in the world. Regular visits to the ophthalmologist are the best way to prevent or contain it, with a precise diagnosis performed with professional equipment. From another perspective, for some individuals or populations, this task can be difficult to accomplish, due to several restrictions, such as low incoming resources, geographical adversities, and traveling restrictions (distance, lack of means of transportation, etc.). Also, logistically, due to its dimensions, relocating the professional equipment can be expensive, thus becoming inviable to bring them to remote areas. As an alternative, some low-cost products are available in the market that copes with this need, namely the D-Eye lens, which can be attached to a smartphone and enables the capture of fundus images, presenting as major drawback lower quality imaging when compared to professional equipment. Some techniques rely on video capture to perform summarization and build a full image with the desired features. In this context, the goal of this paper is to present a review of the methods that can perform video summarization and methods for glaucoma detection, combining both to indicate if individuals present glaucoma symptoms, as a pre-screening approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cowan, C.S., et al.: Cell types of the human retina and its organoids at single-cell resolution. Cell 182(6), 1623–1640 (2020)
Xu, L., Zhang, K., Yang, G., Chu, J.: Gesture recognition using dual-stream CNN based on fusion of sEMG energy kernel phase portrait and IMU amplitude image. Biomed. Sig. Process. Control 73, 103364 (2022). https://doi.org/10.1016/j.bspc.2021.103364
Atila, O., Şengür, A.: Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition. Appl. Acoust. 182, 108260 (2021)
Lin, J., Zhong, S.H., Fares, A.: Deep hierarchical LSTM networks with attention for video summarization. Comput. Electr. Eng. 97, 107618 (2022). https://doi.org/10.1016/j.compeleceng.2021.107618
Zhao, B., Gong, M., Li, X.: Hierarchical multimodal transformer to summarize videos. Neurocomputing 468, 360–369 (2022). https://doi.org/10.1016/j.neucom.2021.10.039
Liang, G., Lv, Y., Li, S., Wang, X., Zhang, Y.: Video summarization with a dual-path attentive network. Neurocomputing 467, 1–9 (2022). https://doi.org/10.1016/j.neucom.2021.09.015
Hussain, T., Muhammad, K., Ding, W., Lloret, J., Baik, S.W., de Albuquerque, V.H.C.: A comprehensive survey of multi-view video summarization. Pattern Recogn. 109, 107567 (2021). https://doi.org/10.1016/j.patcog.2020.107567
Fu, H., Wang, H.: Self-attention binary neural tree for video summarization. Pattern Recogn. Lett. 143, 19–26 (2021). https://doi.org/10.1016/j.patrec.2020.12.016
Harakannanavar, S.S., Sameer, S.R., Kumar, V., Behera, S.K., Amberkar, A.V., Puranikmath, V.I.: Robust video summarization algorithm using supervised machine learning. Global Transitions Proc. 3(1), 131–135 (2022). https://doi.org/10.1016/j.gltp.2022.04.009
Li, P., Ye, Q., Zhang, L., Yuan, L., Xu, X., Shao, L.: Exploring global diverse attention via pairwise temporal relation for video summarization. Pattern Recogn. 111, 107677 (2021). https://doi.org/10.1016/j.patcog.2020.107677
Feng, X., Zhu, Y., Yang, C.: Video summarization based on fusing features and shot segmentation. In: Proceedings of 2021 7th IEEE International Conference on Network Intelligence and Digital Content, IC-NIDC 2021, pp. 383–387 (2021)
Badre, S.R., Thepade, S.D.: Summarization with key frame extraction using thepade’s sorted n-ary block truncation coding applied on haar wavelet of video frame. In: 2016 Conference on Advances in Signal Processing, CASP, pp. 332–336 (2016)
Fei, M., Jiang, W., Mao, W.: Memorable and rich video summarization. J. Vis. Commun. Image Represent. 42, 207–217 (2017). https://doi.org/10.1016/j.jvcir.2016.12.001
Mehmood, I., Sajjad, M., Rho, S., Baik, S.W.: Divide-and-conquer based summarization framework for extracting affective video content. Neurocomputing 174, 393–403 (2016). https://doi.org/10.1016/j.neucom.2015.05.126
Huang, C., Wang, H.: A novel key-frames selection framework for comprehensive video summarization. IEEE Trans. Circ. Syst. Video Technol. 30(2), 577–589 (2020)
Zhu, W., Lu, J., Han, Y., Zhou, J.: Learning multiscale hierarchical attention for video summarization. Pattern Recogn. 122, 108312 (2022). https://doi.org/10.1016/j.patcog.2021.108312
Chai, C., et al.: Graph-based structural difference analysis for video summarization. Inf. Sci. 577, 483–509 (2021). https://doi.org/10.1016/j.ins.2021.07.012
De Avila, S.E.F., Lopes, A.P.B., Da Luz, A., De Albuquerque Araújo, A.: VSUMM: a mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn. Lett. 32(1), 56–68 (2011). https://doi.org/10.1016/j.patrec.2010.08.004
Huang, S., Li, X., Zhang, Z., Wu, F., Han, J.: User-ranking video summarization with multi-stage spatio-temporal representation. IEEE Trans. Image Process. 28(6), 2654–2664 (2019)
Agyeman, R., Muhammad, R., Choi, G.S.: Soccer video summarization using deep learning. In: Proceedings - 2nd International Conference on Multimedia Information Processing and Retrieval, MIPR 2019, pp. 270–273 (2019)
Riahi, A., Elharrouss, O., Al-Maadeed, S.: EMD-3DCNN-based method for COVID-19 detection. Comput. Biol. Med. 142, 105188 (2022). https://doi.org/10.1016/j.compbiomed.2021.105188
Apostolidis, E., Adamantidou, E., Metsai, A.I., Mezaris, V., Patras, I.: Video summarization using deep neural networks: a survey. Proc. IEEE 109(11), 1838–1863 (2021)
Lei, Z., Zhang, C., Zhang, Q., Qiu, G.: FrameRank: a text processing approach to video summarization. In: Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2019, pp. 368–373 (2019)
Gygli, M., Grabner, H., Riemenschneider, H., Van Gool, L.: Creating summaries from user videos. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 505–520. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10584-0_33
Song, Y., Vallmitjana, J., Stent, A., Jaimes, A.: TVSum: summarizing web videos using titles. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 07–12 June, pp. 5179–5187 (2015)
VTW Dataset. http://aliensunmin.github.io/project/%0Avideo-language/
Mehta, P., et al.: Automated detection of glaucoma with interpretable machine learning using clinical data and multimodal retinal images. Am. J. Ophthalmol. 231, 154–169 (2021). https://doi.org/10.1016/j.ajo.2021.04.021
Sudlow, C., et al.: UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12(3), 1–10 (2015)
Nayak, D.R., Das, D., Majhi, B., Bhandary, S.V., Acharya, U.R.: ECNet: an evolutionary convolutional network for automated glaucoma detection using fundus images. Biomed. Sig. Process. Control 67, 102559 (2021). https://doi.org/10.1016/j.bspc.2021.102559
Li, L., Xu, M., Wang, X., Jiang, L., Liu, H.: Attention based glaucoma detection: a large-scale database and CNN model. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 10563–10572 (2019)
Li, L., et al.: A large-scale database and a CNN model for attention-based glaucoma detection. IEEE Trans. Med. Imaging 39(2), 413–424 (2020). https://ieeexplore.ieee.org/document/8756196/
Venugopal, N., Mari, K., Manikandan, G., Sekar, K.R.: Phase quantized polar transformative with cellular automaton for early glaucoma detection. Ain Shams Eng. J. 12(4), 4145–4155 (2021). https://doi.org/10.1016/j.asej.2021.04.018
Zulfira, F.Z., Suyanto, S., Septiarini, A.: Segmentation technique and dynamic ensemble selection to enhance glaucoma severity detection. Comput. Biol. Med. 139, 104951 (2021). https://doi.org/10.1016/j.compbiomed.2021.104951
RIM-ONE (2020). https://www.ias-iss.org/ojs/IAS/article/view/2346
García, G., Colomer, A., Naranjo, V.: Glaucoma detection from raw SD-OCT volumes: a novel approach focused on spatial dependencies. Comput. Methods Programs Biomed. 200, 105855 (2021)
Gupta, N., Garg, H., Agarwal, R.: A robust framework for glaucoma detection using CLAHE and EfficientNet. Vis. Comput. 1–14 (2021). https://doi.org/10.1007/s00371-021-02114-5
Pizer, S.M., et al.: Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 39(3), 355–368 (1987). https://linkinghub.elsevier.com/retrieve/pii/S0734189X8780186X
Acknowledgements
This work is funded by FCT/MEC through national funds and, when applicable, co-funded by the FEDER-PT2020 partnership agreement under the project UIDB/00308/2020.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Correia, T., Cunha, A., Coelho, P. (2023). A Review on the Video Summarization and Glaucoma Detection. In: Cunha, A., M. Garcia, N., Marx Gómez, J., Pereira, S. (eds) Wireless Mobile Communication and Healthcare. MobiHealth 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 484. Springer, Cham. https://doi.org/10.1007/978-3-031-32029-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-32029-3_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-32028-6
Online ISBN: 978-3-031-32029-3
eBook Packages: Computer ScienceComputer Science (R0)