skip to main content
10.1145/2578153.2578196acmconferencesArticle/Chapter ViewAbstractPublication PagesetraConference Proceedingsconference-collections
research-article

Infusing perceptual expertise and domain knowledge into a human-centered image retrieval system: a prototype application

Published: 26 March 2014 Publication History

Abstract

Traditional content-based image retrieval techniques, which primarily rely on image content at the pixel level, are not effective in accessing images at the semantic level. Defining approaches to incorporate experts' perceptual and conceptual capabilities of image understanding in their domain of expertise into the retrieval processes promises to help bridge this semantic gap. Towards accomplishing this, we design and implement a novel multimodal interactive system for image retrieval. To incorporate human expertise, the system stores expert-derived information extracted from two human sensor modalities that intuitively relate to image search, eye movements and verbal descriptions, both generated by medical experts. Experimental evaluation of the system shows that by transferring experts' perceptual expertise and domain knowledge into image-based computational procedures, our system can take advantage of the different human-centered modalities' respective strengths and improve the retrieval performance over just using image-based features.

References

[1]
Audet, S., Okutomi, M., and Tanaka, M. 2010. Direct image alignment of projector-camera systems with planar surfaces. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, IEEE, 303--310.
[2]
Ballerini, L., Li, X., Fisher, R. B., and Rees, J. 2010. A query-by-example content-based image retrieval system of non-melanoma skin lesions. In Medical content-based retrieval for clinical decision support. Springer, 31--38.
[3]
Coddington, J., Xu, J., Sridharan, S., Rege, M., and Bailey, R. 2012. Gaze-based image retrieval system using dual eye-trackers. In Emerging Signal Processing Applications (ESPA), 2012 IEEE International Conference on, IEEE, 37--40.
[4]
Faro, A., Giordano, D., Pino, C., and Spampinato, C. 2010. Visual attention for implicit relevance feedback in a content based image retrieval. In Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications, ACM, 73--76.
[5]
Li, R., Shi, P., and Haake, A. R. 2013. Image understanding from experts eyes by modeling perceptual skill of diagnostic reasoning processes. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, 2187--2194.
[6]
Lowe, D. G. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 2, 91--110.
[7]
Manning, C. D., Raghavan, P., and Schütze, H. 2008. Introduction to information retrieval, vol. 1. Cambridge University Press Cambridge.
[8]
McInnes, B. T., Pedersen, T., and Pakhomov, S. V. 2009. Umls-interface and umls-similarity: open source software for measuring paths and semantic similarity. In AMIA Annual Symposium Proceedings, vol. 2009, American Medical Informatics Association, 431.
[9]
Patel, V. L., Arocha, J. F., and Kaufman, D. R. 1994. Diagnostic reasoning and medical expertise. The psychology of learning and motivation 31, 187--252.
[10]
Squire, D. M., Müller, W., Müller, H., and Raki, J. 1998. Content-based query of image databases, inspirations from text retrieval: inverted files, frequency-based weights and relevance feedback.
[11]
Vaidyanathan, P., Pelz, J., Li, R., Mulpuru, S., Wang, D., Shi, P., Calvelli, C., and Haake, A. 2011. Using human experts' gaze data to evaluate image processing algorithms. In IVMSP Workshop, 2011 IEEE 10th, IEEE, 129--134.
[12]
Westheimer, G. 1972. Visual acuity and spatial modulation thresholds. In Visual psychophysics. Springer, 170--187.
[13]
Womack, K., Alm, C., Calvelli, C., Pelz, J., Shi, P., and Haake, A. 2013. Using linguistic analysis to characterize conceptual units of thought in spoken medical narratives. In Proceedings of Interspeech 2013, Lyon, France, 3722--3726.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ETRA '14: Proceedings of the Symposium on Eye Tracking Research and Applications
March 2014
394 pages
ISBN:9781450327510
DOI:10.1145/2578153
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 March 2014

Check for updates

Author Tags

  1. content-based image retrieval
  2. eye tracking
  3. local invariant features
  4. multi-modal interaction
  5. semantic analysis
  6. visual attention

Qualifiers

  • Research-article

Funding Sources

Conference

ETRA '14
ETRA '14: Eye Tracking Research and Applications
March 26 - 28, 2014
Florida, Safety Harbor

Acceptance Rates

Overall Acceptance Rate 69 of 137 submissions, 50%

Upcoming Conference

ETRA '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Eye-Tracking as a Method for Enhancing Research on Information SearchUnderstanding and Improving Information Search10.1007/978-3-030-38825-6_9(161-181)Online publication date: 30-May-2020
  • (2017)GEO matching regionsMultimedia Tools and Applications10.1007/s11042-016-3834-z76:14(15377-15411)Online publication date: 1-Jul-2017
  • (2016)Language as Sensor in Human‐Centered Computing: Clinical Contexts as Use CasesLanguage and Linguistics Compass10.1111/lnc3.1217110:3(105-119)Online publication date: 2-Mar-2016
  • (2016)Intelligent medical image grouping through interactive learningInternational Journal of Data Science and Analytics10.1007/s41060-016-0021-22:3-4(95-105)Online publication date: 25-Aug-2016
  • (2016)An Expert-in-the-loop Paradigm for Learning Medical Image GroupingAdvances in Knowledge Discovery and Data Mining10.1007/978-3-319-31753-3_38(477-488)Online publication date: 12-Apr-2016
  • (2015)Multimodal Interactive Machine Learning for User UnderstandingCompanion Proceedings of the 20th International Conference on Intelligent User Interfaces10.1145/2732158.2732166(129-132)Online publication date: 29-Mar-2015
  • (2014)Fusing Multimodal Human Expert Data to Uncover Hidden SemanticsProceedings of the 7th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Eye-Gaze & Multimodality10.1145/2666642.2666649(21-26)Online publication date: 16-Nov-2014
  • (2014)Unsupervised feature approach for content based image retrieval using principal component analysis2014 11th International Computer Conference on Wavelet Actiev Media Technology and Information Processing(ICCWAMTIP)10.1109/ICCWAMTIP.2014.7073406(271-275)Online publication date: Dec-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media