Skip to main content

Advertisement

Log in

RETRACTED ARTICLE: Content based video retrieval system based on multimodal feature grouping by KFCM clustering algorithm to promote human–computer interaction

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

This article was retracted on 06 June 2022

This article has been updated

Abstract

Content Based Video Retrieval (CBVR) is so popular these days, because of the increased utilization of video based analytical systems. Video based analytics is quite effective than image analysis, as a series of actions are captured by the video. This ends up with better decision making ability. The CBVR systems play an important role in boosting the human–computer interaction. This paper presents a multimodal CBVR that takes both the visual and audio information into account for retrieving relevant videos to the user. Two modules are employed by this work to deal with video and audio data. The video data is processed to detect the significant frame from shots and is achieved by Lion Optimization Algorithm (LOA). The features are extracted from the visual data and with respect to the audio data, MHEC and LPCC features are extracted. The extracted features are clustered by Kernelized Fuzzy C Mean (KFCM) algorithm. Finally, the feature database is formed and is utilized in the query matching process during the testing phase. The performance of the proposed work is tested in terms of precision, recall, F-measure and time consumption rates. The proposed CBVR system proves better performance than the existing approaches and is evident through attained results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Change history

References

  • Antoniol G, Rollo VF, Venturi G (2005) Linear predictive coding and cepstrum coefficients for mining time variant information from software repositories. In: Proceedings of the 2005 international workshop on mining software repositories

  • Araujo A, Girod B (2017) Large-scale video retrieval using image queries. IEEE Trans Circuits Syst Video Technol 28(6):1406–1420

    Article  Google Scholar 

  • Baroffio L, Canclini A, Cesana M, Redondi A, Tagliasacchi M, Tubaro S (2015) Coding local and global binary visual features extracted from video sequences. IEEE Trans Image Process 24(11):3546–3560

    Article  MathSciNet  Google Scholar 

  • Bi C, Yuan Y, Zhang J, Shi Y, Xiang Y, Wang Y, Zhang R (2018) Dynamic mode decomposition based video shot detection. IEEE Access 6:21397–21407

    Article  Google Scholar 

  • Castañón G, Elgharib M, Saligrama V, Jodoin PM (2015) Retrieval in long-surveillance videos using user-described motion and object attributes. IEEE Trans Circuits Syst Video Technol 26(12):2313–2327

    Article  Google Scholar 

  • Chiu CY, Chao SP, Wu MY, Yang SN, Lin HC (2004) Content-based retrieval for human motion data. J Vis Commun Image Represent 15(3):446–466

    Article  Google Scholar 

  • Chou CL, Chen HT, Lee SY (2015) Pattern-based near-duplicate video retrieval and localization on web-scale videos. IEEE Trans Multimed 17(3):382–395

    Article  Google Scholar 

  • Dong J, Li X, Snoek CG (2018) Predicting visual features from text for image and video caption retrieval. IEEE Trans Multimed 20(12):3377–3388

    Article  Google Scholar 

  • Feng Y, Zhou P, Xu J, Ji S, Wu DO (2018) Video big data retrieval over media cloud: a context-aware online learning approach. IEEE Trans Multimed 21(7):1762–1777. https://doi.org/10.1109/TMM.2018.2885237

  • Fernandes FC, van Spaendonck RL, Burrus CS (2004) Multidimensional, mapping-based complex wavelet transforms. IEEE Trans Image Process 14(1):110–124

    Article  MathSciNet  Google Scholar 

  • Git clone. https://github.com/gtoderici/sports-1m-dataset.git. Accessed 10 Feb 2020

  • Hu W, Xie N, Li L, Zeng X, Maybank S (2011) A survey on visual content-based video indexing and retrieval. IEEE Trans Syst Man Cybern Part C Appl Rev 41(6):797–819

    Article  Google Scholar 

  • Jawahar CV, Chennupati B, Paluri B, Jammalamadaka N (2005) Video retrieval based on textual queries. In: Proceedings of the thirteenth international conference on advanced computing and communications, Coimbatore, pp 1–6

  • Kekre HB, Mishra D, Rege MP (2015) Survey on recent techniques in content based video retrieval. Int J Eng Tech Res IJETR 3(5):69–73

    Google Scholar 

  • Khotanzad A, Hong YH (1990) Invariant image recognition by Zernike moments. IEEE Trans Pattern Anal Mach Intell 12(5):489–498

    Article  Google Scholar 

  • Kurzhals K, John M, Heimerl F, Kuznecov P, Weiskopf D (2016) Visual movie analytics. IEEE Trans Multimed 18(11):2149–2160

    Article  Google Scholar 

  • Li C, Zhou B (2020) Fast key-frame image retrieval of intelligent city security video based on deep feature coding in high concurrent network environment. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-01679-8

  • Li K, Li S, Oh S, Fu Y (2017) Videography-based unconstrained video analysis. IEEE Trans Image Process 26(5):2261–2273

    Article  MathSciNet  Google Scholar 

  • Lokoc J, Bailer W, Schoeffmann K, Muenzer B, Awad G (2018) On influential trends in interactive video retrieval: video browser showdown 2015–2017. IEEE Trans Multimed 20(12):3361–3376

    Article  Google Scholar 

  • Marques O, Furht B (2002) Content-based image and video retrieval, vol 21. Springer Science & Business Media, New York

    Book  Google Scholar 

  • Mccomb K et al (1993) Female lions can identify potentially infanticidal males from their roars. Proc R Soc Lond Ser B Biol Sci 252(1333):59–64

    Article  Google Scholar 

  • Sadjadi SO, Hasan T, Hansen JHL (2012) Mean Hilbert Envelope Coefficients (MHEC) for robust speaker recognition. In: Interspeech, pp 1696–1699

  • Selesnick IW (2006) A higher density discrete wavelet transform. IEEE Trans Signal Process 54(8):3039–3048

    Article  MathSciNet  Google Scholar 

  • Shen RK, Lin YN, Juang TTY, Shen VR, Lim SY (2017) Automatic Detection of video shot boundary in social media using a hybrid approach of HLFPN and keypoint matching. IEEE Trans Comput Soc Syst 5(1):210–219

    Article  Google Scholar 

  • Thomas SS, Gupta S, Subramanian VK (2018) Context driven optimized perceptual video summarization and retrieval. In: IEEE transactions on circuits and systems for video technology

  • Veltkamp RC, Burkhardt H, Kriegel HP (eds) (2013) State-of-the-art in content-based image and video retrieval, vol 22. Springer Science & Business Media, New York

  • Wang Y, Zhang Y, Sheng M, Guo K (2019) On the interaction of video caching and retrieving in multi-server mobile-edge computing systems. IEEE Wirel Commun Lett 8(5):1444–1447

    Article  Google Scholar 

  • Yang H, Meinel C (2014) Content based lecture video retrieval using speech and video text information. IEEE Trans Learn Technol 7(2):142–154

    Article  Google Scholar 

  • Yao J, Zhang YT (2001) Bionic wavelet transform: a new time–frequency method based on an auditory model. IEEE Trans Biomed Eng 48(8):856–863

    Article  Google Scholar 

  • Yazdani M, Jolai F (2016) Lion optimization algorithm (LOA): a nature-inspired metaheuristic algorithm. J Comput Des Eng 3:24–36

    Google Scholar 

  • Zhang X, Ma S, Wang S, Zhang X, Sun H, Gao W (2016) A joint compression scheme of video feature descriptors and visual content. IEEE Trans Image Process 26(2):633–647

    Article  MathSciNet  Google Scholar 

  • Zhang P, Thomas T, Zhuo T, Huang W, Huang H (2017) Object coding based video authentication for privacy protection in immersive communication. J Ambient Intell Humaniz Comput 8(6):871–884

    Article  Google Scholar 

  • Zhenjiang M (2000) Zernike moment—based image shape analysis and its application. Pattern Recogn Lett 21:169–177

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to T. Prathiba.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s12652-022-04085-4"

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Prathiba, T., Kumari, R.S.S. RETRACTED ARTICLE: Content based video retrieval system based on multimodal feature grouping by KFCM clustering algorithm to promote human–computer interaction. J Ambient Intell Human Comput 12, 6215–6229 (2021). https://doi.org/10.1007/s12652-020-02190-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-020-02190-w

Keywords

Navigation