skip to main content
10.1145/2671188.2749413acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article
Best Demo

Incremental Multimodal Query Construction for Video Search

Authors Info & Claims
Published:22 June 2015Publication History

ABSTRACT

Recent improvements in content-based video search have led to systems with promising accuracy, thus opening up the possibility for interactive content-based video search to the general public. We present an interactive system based on a state-of-the-art content-based video search pipeline which enables users to do multimodal text-to-video and video-to-video search in large video collections, and to incrementally refine queries through relevance feedback and model visualization. Also, the comprehensive functionalities enhance a flexible formulation of multimodal queries with different characteristics. Quantitative and qualitative analysis shows that our system is capable of assisting users to incrementally build effective queries over complex event topics.

References

  1. P. Natarajan, S. Wu, S. Vitaladevuni, X. Zhuang, S. Tsakalidis, U. Park, and R. Prasad. Multimodal feature fusion for robust event detection in web videos. In CVPR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Tamrakar, S. Ali, Q. Yu, J. Liu, O. Javed, A. Divakaran, H. Cheng, and H. Sawhney. Evaluation of low-level features and their combinations for complex event detection in open source videos. In CVPR, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  3. S.-I. Yu, L. Jiang, Z. Mao, et al. Cmu-informedia @ trecvid. In TRECVID Video Retrieval Evaluation Workshop, 2014.Google ScholarGoogle Scholar
  4. A. Habibian, M. Mazloom, and C. G. Snoek. On-the-fly video event search by semantic signatures. In Proceedings of International Conference on Multimedia Retrieval. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. G. Hauptmann, M. G. Christel, and R. Yan. Video retrieval based on semantic concepts. Proceedings of the IEEE, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  6. L. Jiang, D. Meng, T. Mitamura, and A. G. Hauptmann. Easy samples first: self-paced reranking for zero-example multimedia search. In Proceedings of the ACM International Conference on Multimedia, pages 547--556. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Jiang, D. Meng, S.-I. Yu, Z. Lan, S. Shan, and A. Hauptmann. Self-paced learning with diversity. In Advances in Neural Information Processing Systems 27. 2014.Google ScholarGoogle Scholar
  8. A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convolutional neural networks. In CVPR, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Over, G. Awad, J. Fiscus, and G. Sanders. Trecvid 2013 - an introduction to the goals, tasks, data, evaluation mechanisms, and metrics. TRECVID Workshop, 2013.Google ScholarGoogle Scholar
  11. S. Strassel, A. Morris, J. G. Fiscus, C. Caruso, H. Lee, P. Over, J. Fiumara, B. Shaw, B. Antonishek, and M. Michel. Creating havic: Heterogeneous audio visual internet collection. In LREC. Citeseer, 2012.Google ScholarGoogle Scholar
  12. Y. Miao, F. Metze, and S. Rawat. Deep maxout networks for low-resource speech recognition. In ASRU, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  13. H. Wang and C. Schmid. Action recognition with improved trajectories. In IEEE International Conference on Computer Vision, Sydney, Australia, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C.-C. Chang and C.-J. Lin. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. H. Jegou, M. Douze, and C. Schmid. Product quantization for nearest neighbor search. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Incremental Multimodal Query Construction for Video Search

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval
          June 2015
          700 pages
          ISBN:9781450332743
          DOI:10.1145/2671188

          Copyright © 2015 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 June 2015

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Badges

          Qualifiers

          • research-article

          Acceptance Rates

          ICMR '15 Paper Acceptance Rate48of127submissions,38%Overall Acceptance Rate254of830submissions,31%

          Upcoming Conference

          ICMR '24
          International Conference on Multimedia Retrieval
          June 10 - 14, 2024
          Phuket , Thailand

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader