skip to main content
10.1145/3297001.3297007acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
research-article

An Approach for Multimodal Medical Image Retrieval using Latent Dirichlet Allocation

Authors Info & Claims
Published:03 January 2019Publication History

ABSTRACT

Modern medical practices are increasingly dependent on Medical Imaging for clinical analysis and diagnoses of patient illnesses. A significant challenge when dealing with the extensively available medical data is that it often consists of heterogeneous modalities. Existing works in the field of Content based medical image retrieval (CBMIR) have several limitations as they focus mainly on visual or textual features for retrieval. Given the unique manifold of medical data, we seek to leverage both the visual and textual modalities to improve the image retrieval. We propose a Latent Dirichlet Allocation (LDA) based technique for encoding the visual features and show that these features effectively model the medical images. We explore early fusion and late fusion techniques to combine these visual features with the textual features. The proposed late fusion technique achieved a higher mAP than the state-of-the-art on the ImageCLEF 2009 dataset, underscoring its suitability for effective multimodal medical image retrieval.

References

  1. David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993--1022. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Yu Cao, Shawn Steffey, Jianbiao He, Degui Xiao, Cui Tao, Ping Chen, and Henning Müller. 2014. Medical image retrieval: a multimodal approach. Cancer informatics 13 (2014), CIN-S14053.Google ScholarGoogle Scholar
  3. Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. Ieee, 248--255.Google ScholarGoogle ScholarCross RefCross Ref
  4. Hayit Greenspan and Adi T Pinhas. 2007. Medical image categorization and retrieval for PACS using the GMM-KL framework. IEEE Transactions on Information Technology in Biomedicine 11, 2 (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Thomas Hofmann. 1999. Probabilistic latent semantic analysis. In Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc., 289--296. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Yonggang Huang, Jun Zhang, Yongwang Zhao, and Dianfu Ma. 2010. Medical image retrieval with query-dependent feature fusion based on one-class SVM. In Computational Science and Engineering (CSE), 2010 IEEE 13th International Conference on. IEEE, 176--183. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Thomas Lehmann et al. 2003. The IRMA project: A state of the art report on content-based image retrieval in medical applications. In Korea-Germany Workshop on Advanced Medical Image. 161--171.Google ScholarGoogle Scholar
  8. Rainer Lienhart, Stefan Romberg, and Eva Hörster. 2009. Multilayer pLSA for multimodal image retrieval. In Proceedings of the ACM International Conference on Image and Video Retrieval. ACM, 9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. David G Lowe. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 2 (2004), 91--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Sandy A Napel, Christopher F Beaulieu, Cesar Rodriguez, Jingyu Cui, Jiajing Xu, Ankit Gupta, Daniel Korenblum, Hayit Greenspan, Yongjun Ma, and Daniel L Rubin. 2010. Automated retrieval of CT images of liver lesions on the basis of image similarity: method and preliminary results. Radiology 256, 1 (2010), 243--252.Google ScholarGoogle ScholarCross RefCross Ref
  11. Trong-Ton Pham, Nicolas Eric Maillot, Joo-Hwee Lim, and J Chevallet. 2007. Latent semantic fusion model for image retrieval and annotation. In Proceedings of the 16th ACM conference on Conference on information and knowledge management. ACM, 439--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Joel Pyykkö and Dorota Głowacka. 2016. Interactive content-based image retrieval with deep neural networks. In International Workshop on Symbiotic Interaction. Springer, 77--88.Google ScholarGoogle Scholar
  13. Adnan Qayyum, Syed Muhammad Anwar, Muhammad Awais, and Muhammad Majid. 2017. Medical image retrieval using deep convolutional neural network. Neurocomputing 266 (2017), 8--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Md Mahmudur Rahman, Sameer K Antani, Rodney L Long, Dina Demner-Fushman, and George R Thoma. 2009. Multi-modal query expansion based on local analysis for medical image retrieval. In MICCAI International Workshop on Medical Content-Based Retrieval for Clinical Decision Support. Springer, 110--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).Google ScholarGoogle Scholar
  16. Xiaoying Tai and Weihua Song. 2007. An improved approach based on FCM using feature fusion for medical image retrieval. In Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on, Vol. 2. IEEE, 336--342. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An Approach for Multimodal Medical Image Retrieval using Latent Dirichlet Allocation

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            CODS-COMAD '19: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data
            January 2019
            380 pages
            ISBN:9781450362078
            DOI:10.1145/3297001

            Copyright © 2019 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 3 January 2019

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

            Acceptance Rates

            CODS-COMAD '19 Paper Acceptance Rate62of198submissions,31%Overall Acceptance Rate197of680submissions,29%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader