Skip to main content

Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7713))

Abstract

Probability models have been used in cross-modal multimedia information retrieval recently by building conjunctive models bridging the text and image components. Previous studies have shown that cross-modal information retrieval system using the topic correlation model (TCM) outperforms state-of-the-art models in English corpus. In this paper, we will focus on the Chinese language, which is different from western languages composed by alphabets. Words and characters will be chosen as the basic structural units of Chinese, respectively. We also set up a test database, named Ch-Wikipedia, in which documents with paired image and text are extracted from Chinese website of Wikipedia. We investigate the problems of retrieving texts (ranked by semantic closeness) given an image query, and vice versa. The capabilities of the TCM model is verified by experiments across the Ch-Wikipedia dataset.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Blei, D.M., Ng, A., Jordan, M.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)

    MATH  Google Scholar 

  2. Blei, D.M., Lafferty, J.D.: Topic models. Chapman & Hall/CRC Data Mining and Knowledge Discovery Series (2009)

    Google Scholar 

  3. Barnard, K., Duygulu, P., Forsyth, D., Freitas, N., Blei, D., Jordan, M.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)

    MATH  Google Scholar 

  4. Jeon, J., Lavreko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: ACM SIGIR Conf. Research and Development in Information Retrieval, New York, pp. 119–126 (2003)

    Google Scholar 

  5. Jiang, Y., Ngo, C., Yang, J.: Towards optimal Bag-of-features for object categorization and semantic video retrieval. In: CIVR, pp. 494–501 (2007)

    Google Scholar 

  6. Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Image and Video Retrieval, pp. 42–50 (2005)

    Google Scholar 

  7. Qin, Z., Thint, M., Huang, Z.: Ranking Answers by Hierarchical Topic Models. In: Chien, B.-C., Hong, T.-P., Chen, S.-M., Ali, M. (eds.) IEA/AIE 2009. LNCS, vol. 5579, pp. 103–112. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  8. Rasiwasia, N., Pereira, J.C., Coviello, E., Doyle, G., Lanckriet, G.R.G., Levy, R., Vasconcelos, N.: A new approach to cross-modal multimedia retrieval. In: ACM Multimedia (MM), pp. 251–260 (2010)

    Google Scholar 

  9. Rasiwasia, N., Moreno, P., Vasconcelos, N.: Bridging the gap: Query by semantic example. IEEE Transactions on Multimedia 9(5), 923–938 (2007)

    Article  Google Scholar 

  10. Schmid, C., Mikolajczyk, K.: A performance evaluation of local descriptors. In: ICPR, vol. 2, pp. 257–263 (2003)

    Google Scholar 

  11. Snoek, C., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications 25(1), 5–35 (2005)

    Article  Google Scholar 

  12. Westerveld, T.: Probabilistic multimedia retrieval. ACM 25, 438 (2002)

    Google Scholar 

  13. Xu, T.Q.: Fundamental structural principles of Chinese semantic syntax in terms of Chinese Characters. Applied Linguistics 1, 3–13 (2001) (in Chinese)

    Google Scholar 

  14. Yu, J., Cong, Y., Qin, Z., Wan, T.: Cross-modal topic correlations for multimedia retrieval. To appear in ICPR (2012)

    Google Scholar 

  15. Yuan, X., Yu, J., Qin, Z., Wan, T.: A SIFT-LBP image retrieval model based on bag-of features. In: International Conference on Image Processing (ICIP), pp. 1061–1064 (2011)

    Google Scholar 

  16. Zhang, Y., Qin, Z.: A topic model of observing Chinese characters. In: Proceedings of the International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), vol. 2, pp. 7–10 (2010)

    Google Scholar 

  17. Zhao, Q., Qin, Z., Wan, T.: What Is the Basic Semantic Unit of Chinese Language? A Computational Approach Based on Topic Models. In: Kanazawa, M., Kornai, A., Kracht, M., Seki, H. (eds.) MOL 12. LNCS, vol. 6878, pp. 143–157. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  18. Zhao, Q., Qin, Z., Wan, T.: Topic Modeling of Chinese Language Using Character-Word Relations. In: Lu, B.-L., Zhang, L., Kwok, J. (eds.) ICONIP 2011, Part III. LNCS, vol. 7064, pp. 139–147. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cong, Y., Qin, Z., Yu, J., Wan, T. (2012). Cross-Modal Information Retrieval – A Case Study on Chinese Wikipedia. In: Zhou, S., Zhang, S., Karypis, G. (eds) Advanced Data Mining and Applications. ADMA 2012. Lecture Notes in Computer Science(), vol 7713. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35527-1_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35527-1_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35526-4

  • Online ISBN: 978-3-642-35527-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics