Abstract
The recent challenge faced by the users from the multimedia area is to collect the relevant object or unique image from the collection of huge data. During the classification of semantics, the media was allowed to access the text by merging the media with the text or content before the emergence of content based retrieval. After its presence, media retrieval process is made easier than earlier stages by adding the attributes to the media in the database using multi-dimensional feature vectors which are termed as descriptors. The identification this features has become major challenges, so to overcome this issue this paper focuses on a deep learning techniques named as Modified Visual Geometry Group _16, and the result of this techniques have been compared with the existing other feature extraction techniques such as conventional histogram of oriented gradients (HOG), local binary patterns (LBP) and convolution neural network (CNN) methods. In this scheme the video frame image retrieval is performed by assigning the indexing to the all video files in the database in order to perform the system more efficiently. Thus the system produces the top result matches for the similar query in comparison with the existing techniques based on accuracy, precision, recall and F1 score in optimized video frame retrieval.












Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ahanger G, Little TD (1996) A survey of technologies for parsing and indexing digital video1. J Vis Commun Image Represent 7(1):28–43
Fablet R, Bouthemy P, Pérez P (2002) Nonparametric motion characterization using causal probabilistic models for video indexing and retrieval. IEEE Trans Image Process 11(4):393–407
Ferman AM, Tekalp AM, Mehrotra R (2002) Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans Image Process 11(5):497–508
Flickner M, Sawhney H, Niblack W, Ashley J, Huang Q, Dom B, Yanker P (1995) Query by image and video content: the QBIC system. Computer 28(9):23–32
Hong S, Im W, Yang HS (2018) Cbvmr: content-based video-music retrieval using soft intra-modal structure constraint. In: Proceedings of the 2018 ACM on international conference on multimedia retrieval, pp 353–361
Jones S, Shao L (2013) Content-based retrieval of human actions from realistic video databases. Inf Sci 236:56–65
Liu H, Lu H, Xue X (2012) A segmentation and graph-based video sequence matching method for video copy detection. IEEE Trans Knowl Data Eng 25(8):1706–1718
Mühling M, Korfhage N, Müller E, Otto C, Springstein M, Langelage T, Freisleben B (2017) Deep learning for content-based video retrieval in film and television production. Multimed Tools Appl 76(21):22169–22194
Pimentel Filho CA, Santos CAS (2010) A new approach for video indexing and retrieval based on visual features. J Inf Data Manag 1(2):293–293
Rossetto L, Giangreco I, Schuldt H, Dupont S, Seddati O, SezginM, Sahillioğlu Y (2015) IMOTION—a content-based video retrieval engine. In: International conference on multimedia modeling. pp 255–260
Roy PP, Bhunia AK, Pal U (2018) Date-field retrieval in scene image and video frames using text enhancement and shape coding. Neurocomputing 274:37–49
Sahoo PK, Kanungo P, Mishra S (2018) A fast valley-based segmentation for detection of slowly moving objects. SIViP 12(7):1265–1272
Song J, Gao L, Liu L, Zhu X, Sebe N (2018) Quantization-based hashing: a general framework for scalable image and video retrieval. Pattern Recognit 75:175–187
Wang M, Ming Y, Liu Q, Yin J (2017) Image-based video retrieval using deep feature. In: 2017 IEEE international conference on smart computing (SMARTCOMP). pp 1–6
Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(2):142–154
Zhu Y, Huang X, Huang Q, Tian Q (2016) Large-scale video copy retrieval with temporal-concentration SIFT. Neurocomputing 187:83–91
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kumar, B.S., Seetharaman, K. Content based video retrieval using deep learning feature extraction by modified VGG_16. J Ambient Intell Human Comput 13, 4235–4247 (2022). https://doi.org/10.1007/s12652-022-03869-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-022-03869-y