Abstract
Social Multimedia computing is a new approach which combines the contextual information available in the social networks with available multimedia content to achieve greater accuracy in traditional multimedia problems like face and landmark recognition. Tian et al.[12] introduce this concept and suggest various fields where this approach yields significant benefits. In this paper, this approach has been applied to the landmark recognition problem. The dataset of flickr.com was used to select a set of images for a given landmark. Then image processing techniques were applied on the images and text mining techniques were applied on the accompanying social metadata to determine independent rankings. These rankings were combined using models similar to meta search engines to develop an improved integrated ranking system. Experiments have shown that the recombination approach gives better results than the separate analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Wan, H.L., Chowdhury, M.U.: Image Semantic Classification by Using SVM. Journal of Software 14, 1891–1899 (2003)
Zhang, D., Wong, A., Indrawan, M., Lu, G.: Content-based Image Retrieval Using Gabor Texture Feature. In: Proceedings of First IEEE Pacific-Rim Conference on Multimedia (PCM 2000), Sydney, Australia, pp. 392–395 (2000)
Lowe, D.: Distinctive Image Features from Scale-Invariant Keypoints. Int’l J. Computer Vision 2(60), 91–110 (2004)
Kennedy, L.S., Naaman, M.: Generating diverse and representative image search results for landmarks. In: Proceeding of the 17th International Conference on World Wide Web, WWW 2008, pp. 297–306. ACM Press, New York (2008)
Beis, J., Lowe, D.G: Shape indexing using approximate nearest-neighbour search in high-dimensional spaces. In: Conference on Computer Vision and Pattern Recognition, pp. 1000–1006, Puerto Rico (1997)
Oztekin, B., Karypis, G., Kumar, V.: Expert Agreement and Content Based Reranking in a Meta-search Environment Using Mearf. In: Proceedings of the 11th International World Wide Web Conference, pp. 333–344, Honolulu, Hawaii, USA, May 7-11 (2002)
Wiguna, W.S., Fernández-Tébar, J.J., GarcÃa-Serrano, A.: Using a Fuzzy Model for Combining Search Results from Different Information Sources to Build a Metasearch Engine. In: International Conference 9th Fuzzy Daysin Dortmund, Germany, pp. 325–334 (2006)
Zheng, Y., Zhao, M., Song, H., Adam, H., Buddemeier, U., Bissacco, A., Brucher, F., Chua, T., Neven, H.: Tour the World: building a web-scale landmark recognition engine. In: Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR (2009)
Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the 15th International Conference on Multimedia, Augsburg, Germany, September 25-29, pp. 631–640 (2007)
Simon, I., Snavely, N., Seitz, S.M.: Scene summarization for online image collections. In: Proceedings of the 11th IEEE International Conference on Computer Vision. IEEE, Los Alamitos (2007)
Tsai, C., Qamra, A., Chang, E.Y., Wang, Y.: Extent: Inferring Image Metadata from Context and Content. In: IEEE International Conference on Multimedia and Expo., pp. 1270–1273 (2005)
Tian, Y., Srivastava, J., Huang, T., Contractor, N.: Social Multimedia Computing. In: Computer IEEE Computer Society Digital Library, June 30. IEEE Computer Society, Los Alamitos (2010)
Ramos, J.: Using TF-IDF to Determine Word Relevance in Document Queries .In: First International Conference on. Machine Learning (2003)
Hoffman, T.: Probabilistic Latent Semantic Indexing. In: Uncertainity in Artificial Intelligence, UAI 1999, Stockholm (1999)
Explore About Interestingness, http://www.flickr.com/explore/interesting
Yager, R.R.: Induced aggregation operators. Fuzzy Sets and Systems 137, 59–69 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mahapatra, A., Wan, X., Tian, Y., Srivastava, J. (2011). Augmenting Image Processing with Social Tag Mining for Landmark Recognition. In: Lee, KT., Tsai, WH., Liao, HY.M., Chen, T., Hsieh, JW., Tseng, CC. (eds) Advances in Multimedia Modeling. MMM 2011. Lecture Notes in Computer Science, vol 6523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17832-0_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-17832-0_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17831-3
Online ISBN: 978-3-642-17832-0
eBook Packages: Computer ScienceComputer Science (R0)