ABSTRACT
With the proliferation of text and multimedia information, users are now able to find answers to almost any questions on the Web. Meanwhile, they are also bewildered by the huge amount of information routinely presented to them. Question-answering (QA) is a natural direction to address this information over-loading problem. The aim of QA is to return precise answers to users' questions. Text-based QA research has been carried out for the past 15 years with good success especially for answering fact-based questions. The aim of this paper is to extend the text-based QA research to multimedia QA to tackle a range of factoid, definition and "how-to" QA in a common framework. The system will be designed to find multimedia answers from Web-scale media resources such as Flicker and YouTube. This paper describes the architecture and our recent research on various types of multimedia QA for a range of applications. The paper also discusses directions for future research.
- A. P. Natsev, A. Haubold, J. Tesic, L. Xie, and R. Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, ACM Multimedia, pp. 991--1000, Augsburg, Germany, 2007. Google ScholarDigital Library
- Cees G. M. Snoek and Marcel Worring, Concept-Based Video Retrieval, Foundations and Trends in Information Retrieval, vol. 4, iss. 2, 215--322, 2009. Google ScholarDigital Library
- C. Fellbaum, ed., WordNet: An Electronic Lexical Database. Cambridge, USA: The MIT Press, 1998.Google Scholar
- D. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer, 60:91--110, 2004. Google ScholarDigital Library
- Dave Kor and Tat-Seng Chua. Interesting Nuggets and Their Impact on Definitional Question Answering. ACM SIGIR 2007. Amsterdam, Netherlands. July 2007. 335--342. Google ScholarDigital Library
- eHow: http://www.ehow.com/videos.htmlGoogle Scholar
- Hui Yang and Tat-Seng Chua, Shuguang Wang and Chun-Keat Koh. Structured use of external knowledge for event-based open-domain question--answering. 26th Int'l ACM SIGIR Conference' 03. Canada, Jul/Aug 2003. 33--40. Google ScholarDigital Library
- Hang Cui, Min-Yen Kan and Tat--Seng Chua. Soft Pattern Matching Models for Definitional Question Answering. ACM Transactions on Information Systems (ACM TOIS). Vol 25(2), April 2007. 30 pages. Google ScholarDigital Library
- John M. Prager: Open-Domain Question-Answering. Foundations and Trends in Information Retrieval 1(2): 91--231 (2006) Google ScholarDigital Library
- Jinwei Cao, Jay F. Nunamaker. Question Answering On Lecture Videos: A Multifaceted Approach, ACM/IEEE Joint Conference on Digital Libraries, 2004. Google ScholarDigital Library
- Kai Wang, Zhaoyan Ming, Tat-Seng Chua. A Syntactic Tree Matching Approach to Finding Similar Questions in Community-based QA Services. To appear in ACM SIGIR 2009, Boston, Massachusetts, USA. Google ScholarDigital Library
- K.-Y. Chen, L. Luesukprasert, and S. T. Chou. Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE transactions on knowledge and data engineering. 19(8):1016--1025. 2007 Google ScholarDigital Library
- M. Cha, H. Kwak, P. Rodriguez, YY. Ahn, and S. Moon. I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system. Proceedings of the 7th ACM SIGCOMM conference on Internet measurement. San Diego, California, USA. 2007. Google ScholarDigital Library
- Powerset: a commercial factoid-based search engine that was acquired by Microsoft. See http://www.powerset.com/Google Scholar
- Guangda Li, Zhaoyan Ming, Haojie Li, Yantao Zheng, Tat-Seng Chua. Video reference: question answering on YouTube. To appear in ACM Multimedia 2009. Google ScholarDigital Library
- R. Hong, J. Tang, H. Tan, S. Yan, C.-W. Ngo, T.-C. Chua. Event driven summarization for web videos. Submitted to ACM Multimedia 1st Workshop on Social Media (ACM-MM-WSM 2009). Google ScholarDigital Library
- Shiren Ye and Tat-Seng Chua and Jie Lu. Summarizing Definition fromWikipedia. To appear in ACL'09, Singapore. Google ScholarDigital Library
- S.-Y. Neo, J. Zhao, M.-Y. Kan, and T.-S. Chua, Video retrieval using high level features: Exploiting query matching and confidence-based weighting, in CIVR, (H. Sundaram et al., eds.), pp. 143--152, Heidelberg, Germany: Springer-Verlag, 2006. Google ScholarDigital Library
- S.-F. Chang, W. Hsu, W. Jiang, L.S. Kennedy, D. Xu, A. Yanagawa and E. Zavesky. Columbia University TRECVID-2006 video search and high-level feature extraction. Proceedings of TRECVID Workshop, Gaithersburg, USA, 2006.Google Scholar
- TREC: The Text Retrieval Conference. See http://trec.nist.gov/.Google Scholar
- TRECVID: a video evaluation forum organized in conjunction with TREC. See http://trecvid.nist.org/.Google Scholar
- T. Yeh, J. J. Lee, T. Darrell. "Photo-based Question Answering", ACM Multimedia, 2008. Google ScholarDigital Library
- E. M. Voorhees. 2001. Overview of the TREC 2001 Question Answering Track. In Proceedings of TREC.Google Scholar
- W. H. Hsu, L. S. Kennedy, and S. F. Chang. Video search reranking through random walk over document-level context graph. In Proceeding of ACM 14th international conference on Multimedia, Augsburg, Germany, October 2007. Google ScholarDigital Library
- X. Wu, A. G. Hauptmann, and C.-W. Ngo. Practical elimination of near-duplicates from web video search. Proceedings of the 15th international ACM conference on Multimedia, Augsburg, Germany. 2007 Google ScholarDigital Library
- Y. C. Wu, Y. S. Lee, C.H. Chang. "CLVQ: cross-language video question/answering system", The 6th IEEE international symposium on multimedia software engineering, 2004. Google ScholarDigital Library
- Y. C. Wu, J. C. Yang. "A Robust Passage Retrieval Algorithm for Video Question Answering", IEEE Trans. on Circuits and Systems for Video Technology, Vol. 18, No. 10, Oct. 2008. Google ScholarDigital Library
- Y. S. Lee, Y. C. Wu, J.C. Yang. "BVideoQA: Online English/Chinese Bilingual Video Question Answering", Journal of the American Society for Information Science and Technology, 60(3):509--525, 2009. Google ScholarDigital Library
- Yahoo alpha search: http://au.alpha.yahoo.com/.Google Scholar
Index Terms
- From text question-answering to multimedia QA on web-scale media resources
Recommendations
Quality-aware collaborative question answering: methods and evaluation
WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data MiningCommunity Question Answering (QA) portals contain questions and answers contributed by hundreds of millions of users. These databases of questions and answers are of great value if they can be used directly to answer questions from any user. In this ...
Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information
Community question answering (cQA) services have gained popularity over the past years. It not only allows community members to post and answer questions but also enables general users to seek information from a comprehensive set of well-answered ...
Question Answering System Based on Web
ICICTA '12: Proceedings of the 2012 Fifth International Conference on Intelligent Computation Technology and AutomationThis paper summarizes the classification, implementation and evaluation of question answering system (QA). QA is divided into four categories: chat robot, QA based knowledge base, QA retrieval system and QA based on free text. Web QA system is composed ...
Comments