skip to main content
10.1145/1631058.1631069acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

From text question-answering to multimedia QA on web-scale media resources

Authors Info & Claims
Published:23 October 2009Publication History

ABSTRACT

With the proliferation of text and multimedia information, users are now able to find answers to almost any questions on the Web. Meanwhile, they are also bewildered by the huge amount of information routinely presented to them. Question-answering (QA) is a natural direction to address this information over-loading problem. The aim of QA is to return precise answers to users' questions. Text-based QA research has been carried out for the past 15 years with good success especially for answering fact-based questions. The aim of this paper is to extend the text-based QA research to multimedia QA to tackle a range of factoid, definition and "how-to" QA in a common framework. The system will be designed to find multimedia answers from Web-scale media resources such as Flicker and YouTube. This paper describes the architecture and our recent research on various types of multimedia QA for a range of applications. The paper also discusses directions for future research.

References

  1. A. P. Natsev, A. Haubold, J. Tesic, L. Xie, and R. Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, ACM Multimedia, pp. 991--1000, Augsburg, Germany, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Cees G. M. Snoek and Marcel Worring, Concept-Based Video Retrieval, Foundations and Trends in Information Retrieval, vol. 4, iss. 2, 215--322, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Fellbaum, ed., WordNet: An Electronic Lexical Database. Cambridge, USA: The MIT Press, 1998.Google ScholarGoogle Scholar
  4. D. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer, 60:91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dave Kor and Tat-Seng Chua. Interesting Nuggets and Their Impact on Definitional Question Answering. ACM SIGIR 2007. Amsterdam, Netherlands. July 2007. 335--342. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. eHow: http://www.ehow.com/videos.htmlGoogle ScholarGoogle Scholar
  7. Hui Yang and Tat-Seng Chua, Shuguang Wang and Chun-Keat Koh. Structured use of external knowledge for event-based open-domain question--answering. 26th Int'l ACM SIGIR Conference' 03. Canada, Jul/Aug 2003. 33--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hang Cui, Min-Yen Kan and Tat--Seng Chua. Soft Pattern Matching Models for Definitional Question Answering. ACM Transactions on Information Systems (ACM TOIS). Vol 25(2), April 2007. 30 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. John M. Prager: Open-Domain Question-Answering. Foundations and Trends in Information Retrieval 1(2): 91--231 (2006) Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jinwei Cao, Jay F. Nunamaker. Question Answering On Lecture Videos: A Multifaceted Approach, ACM/IEEE Joint Conference on Digital Libraries, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kai Wang, Zhaoyan Ming, Tat-Seng Chua. A Syntactic Tree Matching Approach to Finding Similar Questions in Community-based QA Services. To appear in ACM SIGIR 2009, Boston, Massachusetts, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K.-Y. Chen, L. Luesukprasert, and S. T. Chou. Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE transactions on knowledge and data engineering. 19(8):1016--1025. 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Cha, H. Kwak, P. Rodriguez, YY. Ahn, and S. Moon. I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system. Proceedings of the 7th ACM SIGCOMM conference on Internet measurement. San Diego, California, USA. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Powerset: a commercial factoid-based search engine that was acquired by Microsoft. See http://www.powerset.com/Google ScholarGoogle Scholar
  15. Guangda Li, Zhaoyan Ming, Haojie Li, Yantao Zheng, Tat-Seng Chua. Video reference: question answering on YouTube. To appear in ACM Multimedia 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Hong, J. Tang, H. Tan, S. Yan, C.-W. Ngo, T.-C. Chua. Event driven summarization for web videos. Submitted to ACM Multimedia 1st Workshop on Social Media (ACM-MM-WSM 2009). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Shiren Ye and Tat-Seng Chua and Jie Lu. Summarizing Definition fromWikipedia. To appear in ACL'09, Singapore. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S.-Y. Neo, J. Zhao, M.-Y. Kan, and T.-S. Chua, Video retrieval using high level features: Exploiting query matching and confidence-based weighting, in CIVR, (H. Sundaram et al., eds.), pp. 143--152, Heidelberg, Germany: Springer-Verlag, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S.-F. Chang, W. Hsu, W. Jiang, L.S. Kennedy, D. Xu, A. Yanagawa and E. Zavesky. Columbia University TRECVID-2006 video search and high-level feature extraction. Proceedings of TRECVID Workshop, Gaithersburg, USA, 2006.Google ScholarGoogle Scholar
  20. TREC: The Text Retrieval Conference. See http://trec.nist.gov/.Google ScholarGoogle Scholar
  21. TRECVID: a video evaluation forum organized in conjunction with TREC. See http://trecvid.nist.org/.Google ScholarGoogle Scholar
  22. T. Yeh, J. J. Lee, T. Darrell. "Photo-based Question Answering", ACM Multimedia, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. E. M. Voorhees. 2001. Overview of the TREC 2001 Question Answering Track. In Proceedings of TREC.Google ScholarGoogle Scholar
  24. W. H. Hsu, L. S. Kennedy, and S. F. Chang. Video search reranking through random walk over document-level context graph. In Proceeding of ACM 14th international conference on Multimedia, Augsburg, Germany, October 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. X. Wu, A. G. Hauptmann, and C.-W. Ngo. Practical elimination of near-duplicates from web video search. Proceedings of the 15th international ACM conference on Multimedia, Augsburg, Germany. 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Y. C. Wu, Y. S. Lee, C.H. Chang. "CLVQ: cross-language video question/answering system", The 6th IEEE international symposium on multimedia software engineering, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Y. C. Wu, J. C. Yang. "A Robust Passage Retrieval Algorithm for Video Question Answering", IEEE Trans. on Circuits and Systems for Video Technology, Vol. 18, No. 10, Oct. 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Y. S. Lee, Y. C. Wu, J.C. Yang. "BVideoQA: Online English/Chinese Bilingual Video Question Answering", Journal of the American Society for Information Science and Technology, 60(3):509--525, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Yahoo alpha search: http://au.alpha.yahoo.com/.Google ScholarGoogle Scholar

Index Terms

  1. From text question-answering to multimedia QA on web-scale media resources

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
        October 2009
        144 pages
        ISBN:9781605587561
        DOI:10.1145/1631058

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 23 October 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader