skip to main content
10.1145/3240876.3240877acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Deep click feature based query merging for robust image recognition

Authors Info & Claims
Published:17 August 2018Publication History

ABSTRACT

We address the problem of image representation with user click data, wherein each image is represented as a count vector based on its clicked queries. As the query set obtained from search engines is large-scale and redundant, this image representation is extremely high-dimensional and with low discriminative ability. To deal with this issue, we propose a deep click feature based query clustering approach, and construct a compact and low-dimensional click feature with merged queries. Specially, to learn the deep click feature, we construct a smooth image-click graph instead of the direct image-click vector to represent each query, and use it as the input of the convolutional network. A similarity graph based re-sorting and propagation method is applied to construct the click graph. We evaluate our method on the public Clickture-Dog dataset. Experimental results show that: 1) Query merging with image-click graph outperforms that with image-click vector, since it improves the click-unbalance among categories and captures more structured information; 2) The deep model helps to generate a powerful hierarchical click feature for queries, making an improved clustering result.

References

  1. Rudi L. Cilibrasi and Paul M. B. Vitanyi. 2007. The Google Similarity Distance. IEEE Transactions on Knowledge & Data Engineering 19, 3 (2007), 370--383. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Kamran Ghasedi Dizaji, Amirhossein Herandi, Cheng Deng, Weidong Cai, and Heng Huang. 2017. Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization. (2017).Google ScholarGoogle Scholar
  3. Linan Feng and Bir Bhanu. 2016. Semantic Concept Co-occurrence Patterns for Image Annotation and Retrieval. IEEE Trans Pattern Anal Mach Intell 38, 4 (2016), 785--799. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Wu Feng and Dong Liu. 2017. Fine-Grained Image Recognition from Click-Through Logs Using Deep Siamese Network. In International Conference on Multimedia Modeling. 127--138.Google ScholarGoogle ScholarCross RefCross Ref
  5. Xian Sheng Hua, Linjun Yang, Jingdong Wang, Jing Wang, Ming Ye, Kuansan Wang, Yong Rui, and Jin Li. 2013. Clickage: towards bridging semantic and intent gaps via mining click logs of search engines. In ACM International Conference on Multimedia. 243--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hong Shao, Shuang Chen, Jie Yi Zhao, Wen Cheng Cui, and Y. U. Tian-Shu. 2015. Face recognition based on subset selection via metric learning on manifold. Frontiers of Information Technology & Electronic Engineering 16, 12 (2015), 1046--1058.Google ScholarGoogle ScholarCross RefCross Ref
  7. Min Tan, Zhenfang Hu, Baoyuan Wang, Jieyi Zhao, and Yueming Wang. 2016. Robust object recognition via weakly supervised metric and template learning. Neurocomputing 101 (2016), 96--107. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Min Tan, Gang Pan, Yueming Wang, Yuting Zhang, and Zhaohui Wu. 2014. L1-norm latent SVM for compact features in object detection. Neurocomputing 139, 139 (2014), 56--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Tan, B. Wang, Z. Wu, J. Wang, and G. Pan. 2016. Weakly Supervised Metric Learning for Traffic Sign Recognition in a LIDAR-Equipped Vehicle. IEEE Transactions on Intelligent Transportation Systems 17, 5 (2016), 1415--1427. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Min Tan, Yueming Wang, and Gang Pan. 2012. Feature reduction for efficient object detection via l1-norm latent SVM. In Sino-Foreign-Interchange Conference on Intelligent Science and Intelligent Data Engineering. 322--329. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Min Tan, Jun Yu, Qingming Huang, and Weichen Wu. 2018. Click data guided query modeling with click propagation and sparse coding. Multimedia Tools and Applications 3 (2018), 1--14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Min Tan, Jun Yu, Zhou Yu, Fei Gao, Yong Rui, and Dacheng Tao. {n. d.}. User Click Data Based Fine-grained Image Recognition via Weakly Supervised Metric Learning. ACM Transactions on Multimedia Computing Communications and Applications (Accepted) ({n. d.}). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Min Tan, Jun Yu, Guangjian Zheng, Weichen Wu, and Kejia Sun. 2016. Deep Neural Network Boosted Large Scale Image Recognition Using User Click Data. In International Conference on Internet Multimedia Computing and Service. 118--121. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Weichen Wu, Min Tan, Guangjian Zheng, and Jun Yu. 2017. Query Modeling for Click Data Based Image Recognition Using Graph Based Propagation and Sparse Coding. In International Conference on Internet Multimedia Computing and Service. 191--199.Google ScholarGoogle Scholar
  15. Jianwei Yang, Devi Parikh, and Dhruv Batra. 2016. Joint Unsupervised Learning of Deep Representations and Image Clusters. In Computer Vision and Pattern Recognition. 5147--5156.Google ScholarGoogle Scholar
  16. Jun Yu, Yong Rui, and Bo Chen. 2013. Exploiting Click Constraints and Multi-view Features for Image Re-ranking. IEEE Transactions on Multimedia 16, 1 (2013), 159--168.Google ScholarGoogle ScholarCross RefCross Ref
  17. J. Yu, Y. Rui, and D. Tao. 2014. Click prediction for web image reranking using multimodal sparse coding. Image Processing IEEE Transactions on 23, 5 (2014), 2019--2032.Google ScholarGoogle ScholarCross RefCross Ref
  18. Guangjian Zheng, Min Tan, Jun Yu, Qing Wu, and Jianping Fan. 2017. Fine-grained image recognition via weakly supervised click data guided bilinear CNN model. In IEEE International Conference on Multimedia and Expo. 661--666.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Deep click feature based query merging for robust image recognition

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICIMCS '18: Proceedings of the 10th International Conference on Internet Multimedia Computing and Service
        August 2018
        243 pages
        ISBN:9781450365208
        DOI:10.1145/3240876

        Copyright © 2018 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 August 2018

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        ICIMCS '18 Paper Acceptance Rate46of116submissions,40%Overall Acceptance Rate163of456submissions,36%
      • Article Metrics

        • Downloads (Last 12 months)2
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader