Abstract
In this paper we depict an implemented system for medical image retrieval. Our system performs retrieval based on both textual and visual content, separately and combined, using advanced encoding and quantization techniques. The text-based retrieval subsystem uses textual data acquired from an image’s corresponding article to generate a suitable representation. Using a vector space model, the generated representations structure is altered to increase performance. Query expansion with pseudo-relevance feedback is applied to fine-tune the results. The content-based retrieval subsystem performs retrieval based on visual features extracted from the images. A Gaussian Mixture Model is constructed from the extracted visual features, in our case - RGB histograms, and is used in encoding the same features into Fisher Vectors. With scalability and speed in mind, we utilized a product quantization technique over the generated vectors, which provides fast response times over large image collections. Product quantization drastically reduces the size of the image representation at almost no cost to accuracy, thus improving the scalability factor of our system. Our system uses modality classification to further improve retrieval results. This subsystem labels the image modality based on their visual content. The images are described using state-of-the-art opponentSIFT visual features. Classification was performed using Support Vector Machines (SVMs). The predictions from the SVMs are used for re-ranking the resulting images based on their modality and the modality of the query. The system was evaluated against the standardized ImageCLEF 2013, 2012 and 2011 medical datasets and it reported state-of-the-art performance for all datasets.
Similar content being viewed by others
References
Amati G, Van Rijsbergen CJ (2002) Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans Inf Syst (TOIS) 20 (4):357–389
Atrey PK, Hossain MA, El Saddik A, Kankanhalli MS (2010) Multimodal fusion for multimedia analysis: a survey. Multimedia Systems 16(6):345–379
Brazier H, Begley CM (1996) Selecting a database for literature searches in nursing: Medline or cinahl? J Adv Nurs 24(4):868–875
Chang C-C, Lin C-J (2001) LIBSVM: a library for support vector machines, software available at, http://www.csie.ntu.edu.tw/cjlin/libsvm
Chatfield K, Lempitsky V, Vedaldi A, Zisserman A The devil is in the details: an evaluation of recent feature encoding methods
Clough P, Sanderson M, Müller H The clef cross language image retrieval track (imageclef) 2004. In: Image and Video Retrieval, Springer, 2004, pp. 243–251
de Herrera AGS, Markonis D, Eggel I, Müller H (2012) The medgift group in imageclefmed 2012. In: CLEF (Online Working Notes/Labs/Workshop)
de Herrera AGS, Kalpathy-Cramer J, Fushman DD, Antani S, Müller H (2013) Overview of the imageclef 2013 medical tasks. In: Working notes of CLEF 2013
Dimitrovski I, Kocev D, Kitanovski I, Loskovska S, DŻeroski S (2015) Improved medical image modality classification using a combination of visual and textual features. Comput Med Imaging Graph 39:14–26
Douze M, Ramisa A, Schmid C Combining attributes and fisher vectors for efficient image retrieval. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, 2011, pp 745–752
Dye C, Reeder JC, Terry RF (2013) Research for universal health coverage. World Health Organization
El-Naqa I, Yang Y, Galatsanos NP, Nishikawa RM, Wernick MN (2004) A similarity learning approach to content-based image retrieval: application to digital mammography, Medical Imaging. Trans IEEE 23(10):1233–1244
Escalante HJ, Hérnadez CA, Sucar LE, Montes M (2008) Late fusion of heterogeneous methods for multimedia image retrieval. In: Proceedings of the 1st ACM international conference on Multimedia information retrieval, ACM, pp 172–179
Ghosh P, Antani S, Long LR, Thoma GR Review of medical image retrieval systems and future directions. In: 2011 24th International Symposium on Computer-Based Medical Systems (CBMS), IEEE, 2011, pp. 1–6
Gonalves N, Oje E, Ricardo V Document mining combining image exploration and text characterization, note
Guld MO, Kohnen M, Keysers D, Schubert H, Wein B. B, Bredno J, Lehmann T. M Quality of DICOM header information for image categorization. In: SPIE vol. 4685 - Medical Imaging 2002: PACS and Integrated Medical Information Systems: Design and Evaluation, 2002, pp. 280–287
Hearst MA, Divoli A, Guturu H, Ksikes A, Nakov P, Wooldridge MA, Ye J (2007) Biotext search engine: beyond abstract search. Bioinformatics 23(16):2196–2197
Ide NC, Loane RF, Demner-Fushman D (2007) Essie: a concept-based search engine for structured biomedical text. J Am Med Inform Assoc 14(3):253–263
Jegou H, Douze M, Schmid C (2011) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):117–128
Kahn Jr CE, Thao C (2007) Goldminer: a radiology image search engine. AJR Am J Roentgenol 188(6):1475–1478
Kalpathy-Cramer J, Müller H, Bedrick S, Eggel I, de Herrera AGS, Tsikrika T (2011) Overview of the clef 2011 medical image classification and retrieval tasks. In: CLEF (Notebook Papers/Labs/Workshop)
Kalpathy-Cramer J, Hersh W (2008) Effectiveness of global features for automatic medical image classification and retrieval–the experiences of ohsu at imageclefmed. Pattern Recogn Lett 29(15):2032–2038
Kitanovski I, Dimitrovski I, Loskovska S (2013) Fcse at medical tasks of imageclef2013. In: CLEF (Online Working Notes/Labs/Workshop)
Kitanovski I, Trojacanec K, Dimitrovski I, Loshkovska S (2013) Merging words and concepts for medical articles retrieval. In: Proceedings of the 10th Conference on Open Research Areas in Information Retrieval, LE CENTRE DE HAUTES ETUDES INTERNATIONALES D’INFORMATIQUE DOCUMENTAIRE, pp 25–28
Kitanovski I, Trojacanec K, Dimitrovski I, Loskovska S (2013) Multimodal medical image retrieval. In: ICT Innovations 2012, Springer, pp 81–89
Kitanovski I, Dimitrovski I, Madjarov G, Loskovska S (2014) Medical image retrieval using multimodal data. In: Discovery Science, Springer, pp 144–155
Kumar A, Kim J, Cai W, Fulham M, Feng D (2013) Content-based medical image retrieval: A survey of applications to multidimensional and multimodality data. J Digit Imaging 26(6):1025–1039
Lehmann TM, Wein BB, Dahmen J, Bredno J, Vogelsang F, Kohnen M Content-based image retrieval in medical applications: a novel multistep approach. In: Proceedings of SPIE: Storage and Retrieval for Media Databases, Vol. 3972, 2000, pp. 312–320
Lin H-T, Lin C-J, Weng RC (2007) A note on Platt’s probabilistic outputs for support vector machines. Mach Learn 68:267–276
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Macdonald C, Plachouras V, He B, Lioma C, Ounis I (2006) University of glasgow at webclef 2005: Experiments in per-field normalisation and language specific stemming. In: Accessing Multilingual Information Repositories, Springer, p 898–907
Mazin B, Delon J, Gousseau Y (2012) Combining color and geometry for local image matching. In: 2012 21st International Conference on Pattern Recognition (ICPR), IEEE, pp 2667–2680
Montague M, Aslam JA (2001) Relevance score normalization for metasearch. In: Proceedings of the tenth international conference on Information and knowledge management, ACM, pp 427–433
Müller H, Kalpathy-Cramer J, Kahn Jr JCE, Hersh W Comparing the quality of accessing medical literature using content-based visual and textual information retrieval. In: SPIE Medical Imaging, International Society for Optics and Photonics, 2009, pp. 726405–726405
Müller H, de Herrera AGS, Kalpathy-Cramer J, Demner-Fushman D, Antani S, Eggel I (2012) Overview of the imageclef 2012 medical image retrieval and classification tasks. In: CLEF (Online Working Notes/Labs/Workshop)
Medical retrieval task. http://www.imageclef.org/node/104/, accessed: 2014-07-03
Névéol A, Deserno TM, Darmoni SJ, Güld MO, Aronson AR (2009) Natural language processing versus content-based image analysis for medical document retrieval. J Am Soc Inf Sci Technol 60(1):123–134
Okan Ozturkmenoglu NMC, Alpkocak A (2013) Demir at imageclefmed 2013: The effects of modality classification to information retrieval. In: CLEF (Online Working Notes/Labs/Workshop)
Ounis I, Amati G, Plachouras V, He B, Macdonald C, Johnson D (2005) Terrier information retrieval platform. In: Advances in Information Retrieval, Springer, pp 517–519
Pubmed. http://www.ncbi.nlm.nih.gov/pubmed, accessed: 2015-03-30
Rahman MM, You D, Simpson MS, Antani SK, Demner-Fushman D, Thoma GR (2013) Multimodal biomedical image retrieval using hierarchical classification and modality fusion. International Journal of Multimedia Information Retrieval 2(3):159–173
Science direct. http://www.elsevier.com/online-tools/sciencedirect, accessed: 2015-03-30
Simonyan K, Modat M, Ourselin S, Cash D, Criminisi A, Zisserman A (2012) Immediate roi search for 3-d medical images. In: MICCAI International Workshop on Content-Based Retrieval for Clinical Decision Support
Spyridon Stathopoulos AK, Ismini Lourentzou, Kalamboukis T (2013) Ipl at clef 2013 medical retrieval task. In: CLEF (Online Working Notes/Labs/Workshop)
Tommasi T, Orabona F, Caputo B (2008) Discriminative cue integration for medical image annotation. Pattern Recogn Lett 29(15):1996–2002
van de Sande K, Gevers T, Snoek C (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596
van Gemert JC, Veenman CJ, Smeulders AWM, Geusebroek JM Visual word ambiguity, IEEE Transactions on Pattern Analysis and Machine Intelligence 99 (1)
Weiss Y, Torralba A, Fergus R (2009) Spectral hashing. In: Advances in neural information processing systems, pp 1753–1760
Xu S, McCusker J, Krauthammer M (2008) Yale image finder (yif): a new search engine for retrieving biomedical images. Bioinformatics 24(17):1968–1970
Zheng L, Wetzel AW, Gilbertson J, Becich MJ (2003) Design and analysis of a content-based pathology image retrieval system. IEEE Trans Inf Technol Biomed 7(4):249–255
Acknowledgments
We would like to acknowledge the support of the European Commission through the project MAESTRA - Learning from Massive, Incompletely annotated, and Structured Data (Grant number ICT-2013-612944).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kitanovski, I., Strezoski, G., Dimitrovski, I. et al. Multimodal medical image retrieval system. Multimed Tools Appl 76, 2955–2978 (2017). https://doi.org/10.1007/s11042-016-3261-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3261-1