skip to main content
10.1145/2463676.2465274acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Inter-media hashing for large-scale retrieval from heterogeneous data sources

Authors Info & Claims
Published:22 June 2013Publication History

ABSTRACT

In this paper, we present a new multimedia retrieval paradigm to innovate large-scale search of heterogenous multimedia data. It is able to return results of different media types from heterogeneous data sources, e.g., using a query image to retrieve relevant text documents or images from different data sources. This utilizes the widely available data from different sources and caters for the current users' demand of receiving a result list simultaneously containing multiple types of data to obtain a comprehensive understanding of the query's results. To enable large-scale inter-media retrieval, we propose a novel inter-media hashing (IMH) model to explore the correlations among multiple media types from different data sources and tackle the scalability issue. To this end, multimedia data from heterogeneous data sources are transformed into a common Hamming space, in which fast search can be easily implemented by XOR and bit-count operations. Furthermore, we integrate a linear regression model to learn hashing functions so that the hash codes for new data points can be efficiently generated. Experiments conducted on real-world large-scale multimedia datasets demonstrate the superiority of our proposed method compared with state-of-the-art techniques.

References

  1. A. Andoni and P. Indyk. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In FOCS, pages 459--468, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Bozkaya and Z. M. Özsoyoglu. Distance-based indexing for high-dimensional metric spaces. In SIGMOD, pages 357--368, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. M. Bronstein, A. M. Bronstein, F. Michel, and N. Paragios. Data fusion through cross-modality metric learning using similarity-sensitive hashing. In CVPR, pages 3594--3601, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  5. T.-S. Chua, J. Tang, R. Hong, H. Li, Z. Luo, and Y.-T. Zheng. Nus-wide: A real-world web image database from national university of singapore. In CIVR, pages 48:1--48:9, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Ciaccia, M. Patella, and P. Zezula. M-tree: An efficient access method for similarity search in metric spaces. In VLDB, pages 426--435, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Datar, N. Immorlica, P. Indyk, and V. S. Mirrokni. Locality-sensitive hashing scheme based on p-stable distributions. In SCG, pages 253--262, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Datta, D. Joshi, J. Li, and J. Z. Wang. Image retrieval: Ideas, influences, and trends of the new age. ACM Comput. Surv., 40(2), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. K. Deschacht and M.-F. Moens. Finding the best picture: Cross-media retrieval of content. In ECIR, pages 539--546, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Gan, J. Feng, Q. Fang, and W. Ng. Locality-sensitive hashing scheme based on dynamic collision counting. In SIGMOD, pages 541--552. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Gong and S. Lazebnik. Iterative quantization: A procrustean approach to learning binary codes. In CVPR, pages 817--824, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. R. Hardoon, S. R. Szedmak, and J. R. Shawe-taylor. Canonical correlation analysis: An overview with application to learning methods. Neural Computing, 16(12):2639--2664, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. X. He and P. Niyogi. Locality preserving projections. In NIPS, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J.-P. Heo, Y. Lee, J. He, S.-F. Chang, and S.-E. Yoon. Spherical hashing. In CVPR, pages 2957--2964, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Z. Huang, H. T. Shen, J. Liu, and X. Zhou. Effective data co-reduction for multimedia similarity search. In SIGMOD Conference, pages 1021--1032, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. V. Jagadish, B. C. Ooi, K.-L. Tan, C. Yu, and R. Z. 0003. idistance: An adaptive b+-tree based indexing method for nearest neighbor search. TODS, 30(2):364--397, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Jain, B. Kulis, and K. Grauman. Fast image search for learned metrics. In CVPR, pages 1--8, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  18. W. Kong and W.-J. Li. Isotropic hashing. In NIPS, pages 1655--1663, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. W. Kong, W.-J. Li, and M. Guo. Manhattan hashing for large-scale image retrieval. In SIGIR, pages 45--54, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Kulis and T. Darrell. Learning to hash with binary reconstructive embeddings. NIPS, 22:1042--1050, 2009.Google ScholarGoogle Scholar
  21. S. Kumar and R. Udupa. Learning hash functions for cross-view similarity search. In IJCAI, pages 1360--1365, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Y.-Y. Lin, T.-L. Liu, and H.-T. Chen. Semantic manifold learning for image retrieval. In ACM MM, pages 249--258, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. Liu, C. Xu, and H. Lu. Cross-media retrieval: state-of-the-art and open issues. IJMIS, 1(1):33--52, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  24. Q. Lv, W. Josephson, Z. Wang, M. Charikar, and K. Li. Multi-probe lsh: Efficient indexing for high-dimensional similarity search. In VLDB, pages 950--961, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. F. Nie, D. Xu, I. W.-H. Tsang, and C. Zhang. Flexible manifold embedding: a framework for semi-supervised and unsupervised dimension reduction. TIP, 19(7):1921--1932, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. H. T. Shen, B. C. Ooi, and X. Zhou. Towards effective indexing for very large video sequence database. In SIGMOD, pages 730--741, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. Song, Y. Yang, Z. Huang, H. T. Shen, and R. Hong. Multiple feature hashing for real-time large scale near-duplicate video retrieval. In ACM Multimedia, pages 423--432, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. C. Strecha, A. M. Bronstein, M. M. Bronstein, and P. Fua. Ldahash: Improved matching with smaller descriptors. TPAMI, 34(1):66--78, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Y. Tao, K. Yi, C. Sheng, and P. Kalnis. Efficient and accurate nearest neighbor and closest pair search in high-dimensional space. TODS, 35(3), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. Wang, O. Kumar, and S.-F. Chang. Semi-supervised hashing for scalable image retrieval. In CVPR, pages 3424--3431, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  31. R. Weber, H.-J. Schek, and S. Blott. A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In VLDB, pages 194--205, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Y. Weiss, A. Torralba, and R. Fergus. Spectral hashing. In NIPS, pages 1753--1760, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Y. Yang, D. Xu, F. Nie, J. Luo, and Y. Zhuang. Ranking with local regression and global alignment for cross media retrieval. In ACM Multimedia, pages 175--184, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Y. Yang, Y. Yang, Z. Huang, H. T. Shen, and F. Nie. Tag localization with spatial correlations and joint group sparsity. In CVPR, pages 881--888, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Y. Yang, Y. Zhuang, F. Wu, and Y. Pan. Harmonizing hierarchical manifolds for multimedia document semantics understanding and cross-media retrieval. TMM, 10(3):437--446, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. D. Zhang, J. Wang, D. Cai, and J. Lu. Self-taught hashing for fast similarity search. In SIGIR, pages 18--25, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Z. Zhou, Y. Tian, Y. Li, T. Huang, and W. Gao. Large-scale cross-media retrieval of wikipediamm images with textual and visual query expansion. In CLEF, pages 763--770, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Y. Zhuang, Y. Yang, and F. Wu. Mining semantic correlation of heterogeneous multimedia data for cross-media retrieval. TMM, 10(2):221--229, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Inter-media hashing for large-scale retrieval from heterogeneous data sources

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
          June 2013
          1322 pages
          ISBN:9781450320375
          DOI:10.1145/2463676

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 June 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          SIGMOD '13 Paper Acceptance Rate76of372submissions,20%Overall Acceptance Rate785of4,003submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader