ABSTRACT
We propose a new method to select relevant images to the given keywords from images gathered from theWeb based on the Probabilistic Latent Semantic Analysis (PLSA) model which is a probabilistic latent topic model originally proposed for text document analysis. The experimental results shows that the results by the proposed method is almost equivalent to or outperforms the results by existing methods. In addition, it is proved that our method can select more various images compared to the existing SVM-based methods.
- Fergus, L. Fei-Fei, P. Perona, and A. Zisserman. Learning object categories from google's image search. In Proc. of IEEE International Conference on Computer Vision, pages 1816--1823, 2005. Google ScholarDigital Library
- G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004. Google ScholarDigital Library
- Monay and D. Gatica-Perez. Modeling semantic aspects for cross-media image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(10):1802--1817, 2007. Google ScholarDigital Library
- Yanai. Generic image classification using visual knowledge on the web. In Proc. of ACM International Conference Multimedia, pages 67--76, 2003. Google ScholarDigital Library
- Yanai. Image collector III: A web image-gathering system with bag-of-keypoints. In Proc. of the International World Wide Web Conference, 2007. Google ScholarDigital Library
- Yanai and K. Barnard. Probabilistic Web image gathering. In Proc. of ACM SIGMM International Workshop on Multimedia Information Retrieval, pages 57--64, 2005. Google ScholarDigital Library
Index Terms
- Automatic web image selection with a probabilistic latent topic model
Recommendations
Structural topic model for latent topical structure analysis
HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1Topic models have been successfully applied to many document analysis tasks to discover topics embedded in text. However, existing topic models generally cannot capture the latent topical structures in documents. Since languages are intrinsically ...
Topic-based Amharic text summarization with probabilistic latent semantic analysis
MEDES '12: Proceedings of the International Conference on Management of Emergent Digital EcoSystemsThis paper investigates the problem of building a concept-based single-document Amharic text summarization system. Because local languages like Amharic lack extensive linguistic resources, we propose to use statistical approaches called topic modeling ...
Cross-lingual latent topic extraction
ACL '10: Proceedings of the 48th Annual Meeting of the Association for Computational LinguisticsProbabilistic latent topic models have recently enjoyed much success in extracting and analyzing latent topics in text in an unsupervised way. One common deficiency of existing topic models, though, is that they would not work well for extracting cross-...
Comments