Skip to main content

Augmenting Image Processing with Social Tag Mining for Landmark Recognition

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6523))

Abstract

Social Multimedia computing is a new approach which combines the contextual information available in the social networks with available multimedia content to achieve greater accuracy in traditional multimedia problems like face and landmark recognition. Tian et al.[12] introduce this concept and suggest various fields where this approach yields significant benefits. In this paper, this approach has been applied to the landmark recognition problem. The dataset of flickr.com was used to select a set of images for a given landmark. Then image processing techniques were applied on the images and text mining techniques were applied on the accompanying social metadata to determine independent rankings. These rankings were combined using models similar to meta search engines to develop an improved integrated ranking system. Experiments have shown that the recombination approach gives better results than the separate analysis.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   99.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wan, H.L., Chowdhury, M.U.: Image Semantic Classification by Using SVM. Journal of Software 14, 1891–1899 (2003)

    MATH  Google Scholar 

  2. Zhang, D., Wong, A., Indrawan, M., Lu, G.: Content-based Image Retrieval Using Gabor Texture Feature. In: Proceedings of First IEEE Pacific-Rim Conference on Multimedia (PCM 2000), Sydney, Australia, pp. 392–395 (2000)

    Google Scholar 

  3. Lowe, D.: Distinctive Image Features from Scale-Invariant Keypoints. Int’l J. Computer Vision 2(60), 91–110 (2004)

    Article  Google Scholar 

  4. Kennedy, L.S., Naaman, M.: Generating diverse and representative image search results for landmarks. In: Proceeding of the 17th International Conference on World Wide Web, WWW 2008, pp. 297–306. ACM Press, New York (2008)

    Google Scholar 

  5. Beis, J., Lowe, D.G: Shape indexing using approximate nearest-neighbour search in high-dimensional spaces. In: Conference on Computer Vision and Pattern Recognition, pp. 1000–1006, Puerto Rico (1997)

    Google Scholar 

  6. Oztekin, B., Karypis, G., Kumar, V.: Expert Agreement and Content Based Reranking in a Meta-search Environment Using Mearf. In: Proceedings of the 11th International World Wide Web Conference, pp. 333–344, Honolulu, Hawaii, USA, May 7-11 (2002)

    Google Scholar 

  7. Wiguna, W.S., Fernández-Tébar, J.J., García-Serrano, A.: Using a Fuzzy Model for Combining Search Results from Different Information Sources to Build a Metasearch Engine. In: International Conference 9th Fuzzy Daysin Dortmund, Germany, pp. 325–334 (2006)

    Google Scholar 

  8. Zheng, Y., Zhao, M., Song, H., Adam, H., Buddemeier, U., Bissacco, A., Brucher, F., Chua, T., Neven, H.: Tour the World: building a web-scale landmark recognition engine. In: Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR (2009)

    Google Scholar 

  9. Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the 15th International Conference on Multimedia, Augsburg, Germany, September 25-29, pp. 631–640 (2007)

    Google Scholar 

  10. Simon, I., Snavely, N., Seitz, S.M.: Scene summarization for online image collections. In: Proceedings of the 11th IEEE International Conference on Computer Vision. IEEE, Los Alamitos (2007)

    Google Scholar 

  11. Tsai, C., Qamra, A., Chang, E.Y., Wang, Y.: Extent: Inferring Image Metadata from Context and Content. In: IEEE International Conference on Multimedia and Expo., pp. 1270–1273 (2005)

    Google Scholar 

  12. Tian, Y., Srivastava, J., Huang, T., Contractor, N.: Social Multimedia Computing. In: Computer IEEE Computer Society Digital Library, June 30. IEEE Computer Society, Los Alamitos (2010)

    Google Scholar 

  13. Ramos, J.: Using TF-IDF to Determine Word Relevance in Document Queries .In: First International Conference on. Machine Learning (2003)

    Google Scholar 

  14. Hoffman, T.: Probabilistic Latent Semantic Indexing. In: Uncertainity in Artificial Intelligence, UAI 1999, Stockholm (1999)

    Google Scholar 

  15. Explore About Interestingness, http://www.flickr.com/explore/interesting

  16. Yager, R.R.: Induced aggregation operators. Fuzzy Sets and Systems 137, 59–69 (2003)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mahapatra, A., Wan, X., Tian, Y., Srivastava, J. (2011). Augmenting Image Processing with Social Tag Mining for Landmark Recognition. In: Lee, KT., Tsai, WH., Liao, HY.M., Chen, T., Hsieh, JW., Tseng, CC. (eds) Advances in Multimedia Modeling. MMM 2011. Lecture Notes in Computer Science, vol 6523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17832-0_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17832-0_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17831-3

  • Online ISBN: 978-3-642-17832-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics