research-article

From text question-answering to multimedia QA on web-scale media resources

Authors:
Tat-Seng Chua

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

,
Richang Hong

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

,
Guangda Li

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

,
Jinhui Tang

School of Computing, National University of Singapore, Singapore, Singapore

School of Computing, National University of Singapore, Singapore, Singapore
View Profile

LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and miningOctober 2009Pages 51–58https://doi.org/10.1145/1631058.1631069

Published:23 October 2009Publication History

LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining

Pages 51–58

ABSTRACT

With the proliferation of text and multimedia information, users are now able to find answers to almost any questions on the Web. Meanwhile, they are also bewildered by the huge amount of information routinely presented to them. Question-answering (QA) is a natural direction to address this information over-loading problem. The aim of QA is to return precise answers to users' questions. Text-based QA research has been carried out for the past 15 years with good success especially for answering fact-based questions. The aim of this paper is to extend the text-based QA research to multimedia QA to tackle a range of factoid, definition and "how-to" QA in a common framework. The system will be designed to find multimedia answers from Web-scale media resources such as Flicker and YouTube. This paper describes the architecture and our recent research on various types of multimedia QA for a range of applications. The paper also discusses directions for future research.

References

A. P. Natsev, A. Haubold, J. Tesic, L. Xie, and R. Yan, Semantic concept-based query expansion and re-ranking for multimedia retrieval, ACM Multimedia, pp. 991--1000, Augsburg, Germany, 2007. Google ScholarDigital Library
Cees G. M. Snoek and Marcel Worring, Concept-Based Video Retrieval, Foundations and Trends in Information Retrieval, vol. 4, iss. 2, 215--322, 2009. Google ScholarDigital Library
C. Fellbaum, ed., WordNet: An Electronic Lexical Database. Cambridge, USA: The MIT Press, 1998.Google Scholar
D. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer, 60:91--110, 2004. Google ScholarDigital Library
Dave Kor and Tat-Seng Chua. Interesting Nuggets and Their Impact on Definitional Question Answering. ACM SIGIR 2007. Amsterdam, Netherlands. July 2007. 335--342. Google ScholarDigital Library
eHow: http://www.ehow.com/videos.htmlGoogle Scholar
Hui Yang and Tat-Seng Chua, Shuguang Wang and Chun-Keat Koh. Structured use of external knowledge for event-based open-domain question--answering. 26th Int'l ACM SIGIR Conference' 03. Canada, Jul/Aug 2003. 33--40. Google ScholarDigital Library
Hang Cui, Min-Yen Kan and Tat--Seng Chua. Soft Pattern Matching Models for Definitional Question Answering. ACM Transactions on Information Systems (ACM TOIS). Vol 25(2), April 2007. 30 pages. Google ScholarDigital Library
John M. Prager: Open-Domain Question-Answering. Foundations and Trends in Information Retrieval 1(2): 91--231 (2006) Google ScholarDigital Library
Jinwei Cao, Jay F. Nunamaker. Question Answering On Lecture Videos: A Multifaceted Approach, ACM/IEEE Joint Conference on Digital Libraries, 2004. Google ScholarDigital Library
Kai Wang, Zhaoyan Ming, Tat-Seng Chua. A Syntactic Tree Matching Approach to Finding Similar Questions in Community-based QA Services. To appear in ACM SIGIR 2009, Boston, Massachusetts, USA. Google ScholarDigital Library
K.-Y. Chen, L. Luesukprasert, and S. T. Chou. Hot topic extraction based on timeline analysis and multidimensional sentence modeling. IEEE transactions on knowledge and data engineering. 19(8):1016--1025. 2007 Google ScholarDigital Library
M. Cha, H. Kwak, P. Rodriguez, YY. Ahn, and S. Moon. I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system. Proceedings of the 7th ACM SIGCOMM conference on Internet measurement. San Diego, California, USA. 2007. Google ScholarDigital Library
Powerset: a commercial factoid-based search engine that was acquired by Microsoft. See http://www.powerset.com/Google Scholar
Guangda Li, Zhaoyan Ming, Haojie Li, Yantao Zheng, Tat-Seng Chua. Video reference: question answering on YouTube. To appear in ACM Multimedia 2009. Google ScholarDigital Library
R. Hong, J. Tang, H. Tan, S. Yan, C.-W. Ngo, T.-C. Chua. Event driven summarization for web videos. Submitted to ACM Multimedia 1st Workshop on Social Media (ACM-MM-WSM 2009). Google ScholarDigital Library
Shiren Ye and Tat-Seng Chua and Jie Lu. Summarizing Definition fromWikipedia. To appear in ACL'09, Singapore. Google ScholarDigital Library
S.-Y. Neo, J. Zhao, M.-Y. Kan, and T.-S. Chua, Video retrieval using high level features: Exploiting query matching and confidence-based weighting, in CIVR, (H. Sundaram et al., eds.), pp. 143--152, Heidelberg, Germany: Springer-Verlag, 2006. Google ScholarDigital Library
S.-F. Chang, W. Hsu, W. Jiang, L.S. Kennedy, D. Xu, A. Yanagawa and E. Zavesky. Columbia University TRECVID-2006 video search and high-level feature extraction. Proceedings of TRECVID Workshop, Gaithersburg, USA, 2006.Google Scholar
TREC: The Text Retrieval Conference. See http://trec.nist.gov/.Google Scholar
TRECVID: a video evaluation forum organized in conjunction with TREC. See http://trecvid.nist.org/.Google Scholar
T. Yeh, J. J. Lee, T. Darrell. "Photo-based Question Answering", ACM Multimedia, 2008. Google ScholarDigital Library
E. M. Voorhees. 2001. Overview of the TREC 2001 Question Answering Track. In Proceedings of TREC.Google Scholar
W. H. Hsu, L. S. Kennedy, and S. F. Chang. Video search reranking through random walk over document-level context graph. In Proceeding of ACM 14th international conference on Multimedia, Augsburg, Germany, October 2007. Google ScholarDigital Library
X. Wu, A. G. Hauptmann, and C.-W. Ngo. Practical elimination of near-duplicates from web video search. Proceedings of the 15th international ACM conference on Multimedia, Augsburg, Germany. 2007 Google ScholarDigital Library
Y. C. Wu, Y. S. Lee, C.H. Chang. "CLVQ: cross-language video question/answering system", The 6th IEEE international symposium on multimedia software engineering, 2004. Google ScholarDigital Library
Y. C. Wu, J. C. Yang. "A Robust Passage Retrieval Algorithm for Video Question Answering", IEEE Trans. on Circuits and Systems for Video Technology, Vol. 18, No. 10, Oct. 2008. Google ScholarDigital Library
Y. S. Lee, Y. C. Wu, J.C. Yang. "BVideoQA: Online English/Chinese Bilingual Video Question Answering", Journal of the American Society for Information Science and Technology, 60(3):509--525, 2009. Google ScholarDigital Library
Yahoo alpha search: http://au.alpha.yahoo.com/.Google Scholar

Index Terms

From text question-answering to multimedia QA on web-scale media resources
1. Information systems
  1. World Wide Web
    1. Web applications
    2. Web services

Recommendations

Quality-aware collaborative question answering: methods and evaluation
WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining

Community Question Answering (QA) portals contain questions and answers contributed by hundreds of millions of users. These databases of questions and answers are of great value if they can be used directly to answer questions from any user. In this ...
Read More
Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information

Community question answering (cQA) services have gained popularity over the past years. It not only allows community members to post and answer questions but also enables general users to seek information from a comprehensive set of well-answered ...
Read More
Question Answering System Based on Web
ICICTA '12: Proceedings of the 2012 Fifth International Conference on Intelligent Computation Technology and Automation

This paper summarizes the classification, implementation and evaluation of question answering system (QA). QA is divided into four categories: chat robot, QA based knowledge base, QA retrieval system and QA based on free text. Web QA system is composed ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
October 2009
144 pages
ISBN:9781605587561
DOI:10.1145/1631058
General Chairs:
Rong Yan
IBM TJ Watson Research Center
,
Qi Tian
Microsoft Research Asia and University of Texas, San Antonio
,
John R. Smith
IBM TJ Watson Research Center
,
Rahul Sukthankar
Intel Research and Carnegie Mellon
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 October 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
multimedia question-answering.
Qualifiers
- research-article
Conference
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 21
  Total Citations
  View Citations
- 361
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

From text question-answering to multimedia QA on web-scale media resources

LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Quality-aware collaborative question answering: methods and evaluation

Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information

Question Answering System Based on Web

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

From text question-answering to multimedia QA on web-scale media resources

LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Quality-aware collaborative question answering: methods and evaluation

Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information

Question Answering System Based on Web

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media