A New Passage Ranking Algorithm for Video Question Answering

Wu, Yu-Chieh; Lee, Yue-Shi; Yang, Jie-Chi; Yen, Show-Jane

doi:10.1007/11949534_56

Yu-Chieh Wu¹⁸,
Yue-Shi Lee²⁰,
Jie-Chi Yang¹⁹ &
…
Show-Jane Yen²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4319))

Included in the following conference series:

Pacific-Rim Symposium on Image and Video Technology

1158 Accesses
2 Citations

Abstract

Developing a question answering (Q/A) system involves in integrating abundant linguistic resources such as syntactic parsers, named entity recognizers which are not only impose time cost but also unavailable in other languages. Ranking-based approaches take the advantage of both efficiency and multilingual portability but most of them bias to high frequent words. In this paper, we propose a new passage ranking algorithm for extending textQ/A toward videoQ/A based on searching lexical information in videos. This method takes both N-gram match and word density into account and finds the optimal match sequence using dynamic programming techniques. Besides, it is very efficient to handle real time tasks for online video question answering. We evaluated our method with 150 actual user’s questions on the 45GB video collections. Nevertheless, four well-known but multilingual portable ranking approaches were adopted to compare. Experimental results show that our method outperforms the second best approach with relatively 25.64% MRR score.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cai, M., Song, J., Lyu, M.R.: A new approach for video text detection. In: Proceedings of International Conference on Image Processing, pp. 117–120 (2002)
Google Scholar
Cao, J., Nunamaker, J.F.: Question answering on lecture videos: a multifaceted approach. In: International Conference on Digital Libraries, pp. 214–215 (2004)
Google Scholar
Chang, F., Chen, G.C., Lin, C.C., Lin, W.H.: Caption analysis and recognition for building video indexing systems. ACM Multimedia systems 10(4), 344–355 (2005)
Article Google Scholar
Cui, H., Sun, R., Li, K., Kan, M., Chua, T.: Question answering passage retrieval using dependency relations. In: Proceedings of the 28th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 400–407 (2005)
Google Scholar
Fan, J., Yau, D.K.Y., Elmagarmid, A.K., Aref, W.G.: Automatic image segmentation by integrating color-edge extraction and seeded region growing. IEEE Trans. On Image Processing 10(10), 1454–1464 (2001)
Article MATH Google Scholar
Hong, T., Lam, S.W., Hull, J.J., Srihari, S.N.: The design of a nearest-neighbor classifier and its use for japanese character recognition. In: Proceedings of Third International Conference on Document Analysis and Recognition, pp. 270–291 (1995)
Google Scholar
Lee, G.G., Seo, J.Y., Lee, S.W., Jung, H.M., Cho, B.H., Lee, C.K., Kwak, B.K., Cha, J.W., Kim, D.S., An, J.H., Kim, H.S.: SiteQ: Engineering high performance QA system using lexico-semantic pattern matching and shallow NLP. In: Proceedings of the 10th Text Retrieval Conference, pp. 437–446 (2001)
Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits and Systems for Video Technology 12(4), 243–255 (2002)
Article Google Scholar
Lin, C.J., Liu, C.C., Chen, H.H.: A simple method for Chinese video OCR and its application to question answering. Computational linguistics and Chinese language processing 6(2), 11–30 (2001)
MathSciNet Google Scholar
Lin, J., Quan, D., Sinha, V., Bakshi, K., Huynh, D., Katz, B., Karger, D.R.: What makes a good answer? the role of context in question answering. In: Proceedings of the 9th international conference on human-computer interaction (INTERACT), pp. 25–32 (2003)
Google Scholar
Lyu, M.R., Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Trans. Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Article Google Scholar
Pasca, M., Harabagiu, S.: High-performance question answering. In: Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 366–374 (2001)
Google Scholar
Robertson, E., Walker, S., Beaulieu, M.: Okapi at TREC-7: automatic ad hoc, filter-ing, VLC and interactive track. In: Proceedings of the 7th Text Retrieval Conference (1998)
Google Scholar
Rus, V., Moldovan, D.: High precision logic form transformation. International Journal on Artificial Intelligence Tools 11(3), 437–454 (2002)
Article Google Scholar
Savoy, J.: Comparative study on monolingual and multilingual search models for use with Asian languages. ACM transactions on Asian language information processing (TALIP) 4(2), 163–189 (2005)
Article Google Scholar
Tellex, S., Katz, B., Lin, J.J., Fernandes, A., Marton, G.: Quantitative evaluation of passage retrieval algorithms for question answering. In: Proceedings of the 26th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 41–47 (2003)
Google Scholar
Voorhees, E.M.: Overview of the TREC 2001 question answering track. In: Proceedings of the 10th Text Retrieval Conference, pp. 42–52 (2001)
Google Scholar
Wu, Y.C., Lee, Y.S., Chang, C.H.: CLVQ: Cross-language video question/answering system. In: Proceedings of 6th IEEE International Symposium on Multimedia Software Engineering, pp. 294–301 (2004)
Google Scholar
Yang, H., Chaison, L., Zhao, Y., Neo, S.Y., Chua, T.S.: VideoQA: Question answering on news video. In: Proceedings of the 11th ACM International Conference on Multimedia, pp. 632–641 (2003a)
Google Scholar
Yang, H., Chua, T.S., Wang, S.G., Koh, C.K.: Structural use of external knowledge for event-based open domain question answering. In: Proceedings of the 26th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 33–40 (2003b)
Google Scholar
Zhang, D., Nunamaker, J.: A natural language approach to content-based video indexing and retrieval for interactive E-learning. IEEE Transactions on Multimedia 6(3), 450–458 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Central University,
Yu-Chieh Wu
Graduate Institute of Network Learning Technology, National Central University, No.300, Jhong-Da Rd., Jhongli City, Taoyuan County, 32001, Taiwan, R.O.C.
Jie-Chi Yang
Department of Computer Science and Information Engineering, Ming Chuan University, No.5, De-Ming Rd, Gweishan District, Taoyuan, 333, Taiwan, R.O.C.
Yue-Shi Lee & Show-Jane Yen

Authors

Yu-Chieh Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yue-Shi Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jie-Chi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Show-Jane Yen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, National Tsing Hua University, HsinChu, Taiwan
Long-Wen Chang
Department of Electrical Engineering, National Chung Cheng University, 621, Chia-Yi, Taiwan, ROC
Wen-Nung Lie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, YC., Lee, YS., Yang, JC., Yen, SJ. (2006). A New Passage Ranking Algorithm for Video Question Answering. In: Chang, LW., Lie, WN. (eds) Advances in Image and Video Technology. PSIVT 2006. Lecture Notes in Computer Science, vol 4319. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949534_56

Download citation

DOI: https://doi.org/10.1007/11949534_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68297-4
Online ISBN: 978-3-540-68298-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics