ABSTRACT
Digital libraries have now become by far the preferred method for storing large image collections, making it increasingly important to improve issues of search and discovery within these databases. Content-based annotations of images, for classification purposes, have been a particularly significant area of research; however, even the best of such methods are prone to error -- potentially associating incorrect text to image features and overlooking significant features. In the broader field of classification and search, Newman et al. demonstrated benefits of using statistical topic models to enrich document metadata. [Newman et al. 2007] This assumes that a document can be described with sufficient accuracy using a "bag of words" model, i.e. described by the words it contains (instances from a large vocabulary of words), and the frequency with which they occur. [Horster et al. 2007] applied this idea to large-scale image datasets, using SIFT features as data for creating a topic model.
Supplemental Material
Available for Download
- Blei, D., Ng, A., Jordan, M., and Lafferty, J. 2003. Latent dirichlet allocation. Journal of Machine Learning Research 3, 993--1022. Google ScholarDigital Library
- Duda, R., and Hart, P. 1972. Use of the hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11--15. Google ScholarDigital Library
- Horster, E., Lienhart, R., and Slaney, M. 2007. Image retrieval on large-scale image databases. In CIVR '07: Proc. of the 6th ACM international conference on image and video retrieval, 17--24. Google ScholarDigital Library
- Newman, D., Hagedorn, K., Chemudugunta, C., and Smyth, P. 2007. Subject metadata enrichment using statistical topic models. In JCDL '07: Proceedings of the 7th ACM/IEEECS Joint Conference on Digital Libraries, 366--375. Google ScholarDigital Library
- Nister, D., and Stewenius, H. 2006. Scalable recognition with a vocabulary tree. In CVPR '06: Proc. of the IEEE Conference on Comp. Vision and Pattern Recognition, vol. 2, 2161--8. Google ScholarDigital Library
Index Terms
Using statistical topic models to organize and visualize large-scale architectural image databases
Recommendations
Statistical word sense aware topic models
LDA has been proved effective in modeling the semantic relation between surface words. This semantic information in the document collection is useful to measure the topic distribution for a document. In general, a surface word may significantly ...
Multilingual Topic Models for Bilingual Dictionary Extraction
A machine-readable bilingual dictionary plays a crucial role in many natural language processing tasks, such as statistical machine translation and cross-language information retrieval. In this article, we propose a framework for extracting a bilingual ...
Knowledge-based word sense disambiguation using topic models
AAAI'18/IAAI'18/EAAI'18: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial IntelligenceWord Sense Disambiguation is an open problem in Natural Language Processing which is particularly challenging and useful in the unsupervised setting where all the words in any given text need to be disambiguated without using any labeled data. Typically ...
Comments