Abstract
Searching an image or a video in a huge volume of graphical data is a tedious time-consuming process. If this search is performed using the conventional element matching technique, the complexity of the search will render the system useless. To overcome this problem, the current paper proposes a Content-Based Image Retrieval (CBIR) and a Content-Based Video Retrieval (CBVR) technique using clustering algorithms based on neural networks. Neural networks have proved to be quite powerful for dimensionality reduction due to their parallel computations. Retrieval of images in a large database on the basis of the content of the query image has been proved fast and efficient through practical results. Two images of the same object, but taken from different camera angles or have rotational and scaling transforms is also matched effectively. In medical domain, CBIR has proved to be a boon to the doctors. The tumor, cancer etc can be easily deducted comparing the images with normal to the images with diseases. Java and Weka have been used for implementation. The thumbnails extracted from the video facilitates the video search in a large videos database. The unsupervised nature of Self Organizing Maps (SOM) has made the software all the more robust.
Similar content being viewed by others
Data availability
The source code of the implementation and the data-set used to draw observations are openly available on Github at the URL “https://github.com/pintojoey/imivid”. Readers are encouraged to reuse and extend the implementation of the project under the MIT open source License. The original data-set of input images was taken from the work “A Biologically Inspired Algorithm for the Recovery of Shading and Reflectance Images” by Adriana Olmos, Frederick A A Kingdom (https://doi.org/10.1068/p5321).
References
Abbas OA (2008) Comparisons Between Data Clustering Algorithms., International Arab Journal of Information Technology (IAJIT), vol. 5
Ansari A, Mohammed MH (2015) Article: Content based Video Retrieval Systems - Methods, Techniques, Trends and Challenges, International Journal of Computer Applications, vol. 112, pp. 13–22, 2
Bautista PA, Yagi Y (2012) Digital simulation of staining in histopathology multispectral images: enhancement and linear transformation of spectral transmittance, Journal of Biomedical Optics, vol. 17, pp. 056013–1–056013-10
Bay H, Ess A, Tuytelaars T, Gool LV (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110:346–359
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection, in Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01, Washington
Déniz O, Bueno G, Salido J, De la Torre F (2011) Face recognition using histograms of oriented gradients. Pattern Recogn Lett 32:1598–1603
Egmont-Petersen M, de Ridder D, Handels H (2002) Image processing with neural networks - a review
Grabner M, Grabner H, Bischof H (2006) Fast approximated SIFT, in Proceedings of the 7th Asian Conference on Computer Vision - Volume Part I, Berlin
Gu C, Gao Y (2012) A content-based image retrieval system based on Hadoop and Lucene, in Proceedings of the 2012 Second International Conference on Cloud and Green Computing, Washington
Hassaballah M, Abdelmgeid AA, Alshazly HA (2016) Image features detection, description and matching," in Image Feature Detectors and Descriptors : Foundations and Applications, Cham, Springer International Publishing, p. 11–45.
Hong Nguyen TK, Belleudy C, Pham TV (2014) Power evaluation of Sobel filter on Xilinx platform, in 2014 IEEE Faible Tension Faible Consommation
Ke Y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors, in Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, 2004. CVPR 2004
Kohonen T (1990) The self-organizing map, Proceedings of the IEEE, vol. 78, pp. 1464–1480, 9
Latif A, Rasheed A, Sajid U, Ahmed J, Ali N, Ratyal NI, Zafar B, Dar SH, Sajid M, Khalil T (2019) Content-based image retrieval and feature extraction: a comprehensive review, Mathematical Problems in Engineering, vol 2019
Leutenegger S, Chli M, Siegwart RY (2011) BRISK: Binary Robust invariant scalable keypoints, in Computer Vision (ICCV), 2011 IEEE international conference on, IEEE, p. 2548–2555
Lowe DG (2004) Distinctive Image Features from Scale-Invariant Keypoints, Int. J. Comput. Vision, vol. 60, p. 91–110, 11
Niblack CW, Barber R, Equitz W, Flickner MD, Glasman EH, Petkovic D, Yanker P, Faloutsos C, Taubin G (1993) QBIC project: querying images by content, using color, texture, and shape, in Proceedings of SPIE - The International Society for Optical Engineering
Olmos A, Kingdom FAA (2004) A biologically inspired algorithm for the recovery of shading and reflectance images. Perception 33:1463–1473
Park SB, Lee JW, Kim SK (2004) Content-based image classification using a neural network. Pattern Recogn Lett 25:287–300
Prasad JD (2013) Gabor filter visualization, J Atmos Sci
Rosten E, Drummond T (2005) Fusing points and lines for high performance tracking., in ICCV
Sakr NA, ELdesouky AI, Arafat H (2016) An Efficient Fast-response Content-based Image Retrieval Framework for Big Data, Comput. Electr. Eng., vol. 54, p. 522–538, 8
Sima AA, Buckley SJ (2013) Optimizing SIFT for matching of short wave infrared and visible wavelength images. Remote Sens 5:2037–2056
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, pp. 1349–1380, 12
Tuytelaars T, Mikolajczyk K (2008) Local Invariant Feature Detectors: A Survey, Found. Trends. Comput. Graph. Vis., vol. 3, p. 177–280, 7
Yang Y, Lovell BC, Dadgostar F (2009) Content-based video retrieval (CBVR) system for CCTV surveillance videos, in Proceedings of the 2009 Digital Image Computing: Techniques and Applications, Washington
Acknowledgments
The authors would like to acknowledge the organization Indian Institute of Information Technology, Nagpur, for giving the necessary support for carrying out the research work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Pinto, J., Jain, P. & Kumar, T. A content based image information retrieval and video thumbnail extraction framework using SOM. Multimed Tools Appl 80, 16683–16709 (2021). https://doi.org/10.1007/s11042-020-10227-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10227-7