Abstract
Medical social networks have become an exchange of opinions between patients and health professionals. However, patients are anxious to quickly find a reliable analysis and a concise explanation of their medical images and express their queries through a textual description or a visual description or both sets. For this, we present in this paper a multimodal research model to research medical images based on multimedia information that is extracted from a radiological collaborative social network. Indeed, the opinions shared on a medical image in a medico-social network are a textual description which in most cases requires cleaning by using a medical thesaurus. In addition, we describe the textual description and medical image in a TF-IDF weight vector using an approach of “bag of words”. We use latent semantic analysis to establish relationships between textual terms and visual terms in shared opinions on the medical image. The multimodal modeling researches the medical information through multimodal queries. Our model is evaluated against the ImageCLEFMed’2015 baseline, which is the ground truth for our experiments. We have conducted numerous experiments with different descriptors and many combinations of modalities. The analysis of results shows that the model based on two methods can increase the performance of a research system based on a single modality, visual or textual.
Similar content being viewed by others
Notes
Singular value decomposition (SVD).
References
Barnard K, Duygulu P, Freitas JFG, Blei DM, Jordan MI (2003) Matching words and pictures. J Mach Learn Res 3:1107–1135
Bouslimi R, Akaichi J (2015) Automatic medical image annotation on social network of physician collaboration. J Netw Model Anal Health Inform Bioinform 4(10):219–228
Bouslimi R, Akaichi J, Ayadi MG, Hedhli H (2016) A medical collaboration network for medical image analysis. J Netw Model Anal Health Inform Bioinform 5(10):145–165
Chang YC, Chen HH (2008) Using an image-text parallel corpus and the web for query expansion in cross-language image retrieval. In: Workshop of the Cross-Language Evaluation Forum for European Languages CLEF 2007: Advances in multilingual and multimodal information retrieval, Lecture notes in Computer Science (LNCS), vol 5152. Springer, Heidelberg, pp 504–511
Clinchant S, Csurka G, Ah-Pine J (2011) Semantic combination of textual and visual information in multimedia retrieval. In: Proceedings of 1st ACM International Conference on multimedia retrieval, New York, NY, USA
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: ECCV’04 workshop on statistical learning in computer vision, pp 59–74
de Herrera AGS, Muller H, Bromuri S (2015) Overview of the ImageCLEF 2015 medical classification task. In: Working notes of CLEF 2015 (Cross Language Evaluation Forum)
Duan L, Yuan B, Wu C, Li J, Guo Q (2014) Text-image separation and indexing in historic patent document image based on extreme learning machine. In: Proceedings of ELM-2014 Volume 2, Volume 4 of the series Proceedings in Adaptation, Learning and Optimization, pp 299–307
Duygulu P, Barnard K, Freitas JFG, Forsyth DA (2002) Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proceedings of ECCV’02, Springer-Verlag, London, UK, pp 97–112
Gordo A, Rusinol M, Karatzas D, Bagdanov AD (2013) Document classification and page stream segmentation for digital mailroom applications. In: 2013 12th international conference on document analysis and recognition (ICDAR), IEEE, pp 621–625
Gorgevik D, Cakmakov D (2005) Handwritten digit recognition by combining SVM classifiers. In: The international conference on computer as a tool, EUROCON 2005, IEEE, vol 2, pp 1393–1396
Guillaumin M, Mensink T, Verbeek JJ, Schmid C (2009) TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation. In: Proceedings of ICCV, pp 309–316
Ho TK, Hull JJ, Srihari SN (1994) Decision combination in multiple classifier systems. IEEE Trans Pattern Anal Mach Intell 16(1):66–75
Hofmann T (1998) Learning and representing topic. A hierarchical mixture model for word occurrences in document databases. In: Proceedings of CONALD’98, Pittsburgh
Jeon J, Lavrenko V, Manmatha R (2003) Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings SIGIR’03 (ACM, New York, NY, USA, 2003), pp 119–126
Joachims T (2002) Learning to classify text using support vector machines. In: The springer international series in engineering and computer science, vol 668. Springer, US
Jurie F, Triggs W (2005) Creating efficient codebooks for visual recognition. In: ICCV
Kittler J, Hatef M, Duin RP, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239
Larlus D, Dorkó G, Jurie F (2006) Création de vocabulaires visuels efficaces pour la catégorisation d’images. In: Reconnaissance des Formes et Intelligence Artificielle, Tours
Lavrenko V, Manmatha R, Jeon J (2004) A model for learning the semantics of pictures. NIPS, Mathura, pp 553–560
Lin WC, Chang YC, Chen HH (2007) Integrating textual and visual information for cross-language image retrieval: a trans-media dictionary approach. J Inf Process Manag 43(2):488–502
Liu B (2012) Sentiment analysis and opinion mining. Synth Lect Hum Lang Technol 5(1):1–167
Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Makadia A, Pavlovic V, Kumar S (2010) Baselines for image annotation. Int J Comput Vis 90:88–105
Matas J, Chum O, Martin U, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of the British machine vision conference, BMVA, pp 384–393
Metzler D, Manmatha R (2004), An inference network approach to image retrieval. In: CIVR’04, pp 42–50
Meüller H, Clough P, Deselaers T, Caputo B (2010) ImageCLEF: experimental evaluation in visual information retrieval, vol 32. Springer, Berlin
Monay F, Gatica-Perez D (2003) On image auto-annotation with latent space models. In: Proceedings of MULTIMEDIA’03 (ACM, New York, NY, USA), pp 275–278
Mori Y, Takahashi H, Oka R (1999) Image-to-word transformation based on dividing and vector quantizing images with words. In: Proceedings of first international workshop multimedia intelligent storage and retrieval management (Orlando, Florida, USA, 1999), pp 405–409
Priyatharshini R, Chitrakala S (2012) Association based image retrieval: a survey. second international joint conference, AIM/CCPE 2012, Bangalore, India, pp 17–26
Rahman NA, Mabni Z, Omar N, Fairuz H, Hanum M, Amirah N Nur, Rahim TM (2015) A parallel latent semantic indexing (LSI) algorithm for malay hadith translated document retrieval. In: First international conference, SCDS 2015, Putrajaya, Malaysia, pp 154–163
Robertson S, Walker S, Hancock-Beaulieu M, Gull A, Lau M, Okapi (1994) at trec-3. In: Text REtrieval conference, pp 21–30
Rusinol M, Lladós J (2009) Logo spotting by a bag-of-words approach for document categorization. In: 2009 10th International conference on document analysis and recognition, IEEE, pp 111–115
Rusinol M, Karatzas D, Bagdanov AD, Lladós J (2012) Multipage document retrieval by textual and visual representations. In: 2012 21st international conference on pattern recognition (ICPR), IEEE, pp 521–524
Salton G, Wong A, Yang C (1975) A vector space model for automatic indexing. Commun ACM 18(11):613–620
Sebastiani F (2002) Machine learning in automated text categorization. ACM Comput Surv 34(1):1– 47
Shekhar R, Jawahar C (2012) Word image retrieval using bag of visual words. In: 2012 10th IAPR international workshop on document analysis systems (DAS), IEEE, pp 297–301
Song W, Uchida S, Liwicki M (2011) Look Inside the world of parts of handwritten characters. In: 2011 international conference on document analysis and recognition (ICDAR), IEEE, pp 784–788
Terrades OR, Valveny E, Tabbone S (2009) Optimal classifier fusion in a non-bayesian probabilistic framework. IEEE Trans Pattern Anal Mach Intell 31(9):1630–1644
Valle E, Cord M, (2009) Advanced techniques in CBIR: local descriptors, visual dictionaries and bags of features. In: Tutorials of the XXII Brazilian symposium on computer graphics and image processing, IEEE, pp 72–78
Vidal-Naquet M, Ullman S (2003) Object recognition with informative features and linear classification. In: ICCV, pp 281–288
Wang S, Pan P, Lu Y, Xie L (2013) Improving cross-modal and multi-modal retrieval combining content and semantics similarities with probabilistic model. J Multimed Tools Appl 74(6):2009–2032
Yang J, Jiang YG, Hauptmann A, Ngo CW (2007) Evaluating bag-of-visual-words representations in scene classification. In: MIR '07: Proceedings of the international workshop on Workshop on multimedia information retrieval, pp 197–206
Ye G, Liu D, Jhuo I.-H., Chang S.-F (2012) Robust late fusion with rank minimization. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 3021–3028
Zhai C (2001) Notes on the lemur TFIDF model. Technical report. Carnegie Mellon University
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bouslimi, R., Ayadi, M.G. & Akaichi, J. Semantic medical image retrieval in a medical social network. Soc. Netw. Anal. Min. 7, 2 (2017). https://doi.org/10.1007/s13278-016-0420-3
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-016-0420-3