From symbolic to sub-symbolic information in question classification

Silva, João; Coheur, Luísa; Mendes, Ana Cristina; Wichert, Andreas

doi:10.1007/s10462-010-9188-4

From symbolic to sub-symbolic information in question classification

Published: 03 November 2010

Volume 35, pages 137–154, (2011)
Cite this article

Artificial Intelligence Review Aims and scope Submit manuscript

João Silva¹,
Luísa Coheur¹,
Ana Cristina Mendes¹ &
…
Andreas Wichert¹

962 Accesses
98 Citations
3 Altmetric
Explore all metrics

Abstract

Question Answering (QA) is undoubtedly a growing field of current research in Artificial Intelligence. Question classification, a QA subtask, aims to associate a category to each question, typically representing the semantic class of its answer. This step is of major importance in the QA process, since it is the basis of several key decisions. For instance, classification helps reducing the number of possible answer candidates, as only answers matching the question category should be taken into account. This paper presents and evaluates a rule-based question classifier that partially founds its performance in the detection of the question headword and in its mapping into the target category through the use of WordNet. Moreover, we use the rule-based classifier as a features’ provider of a machine learning-based question classifier. A detailed analysis of the rule-base contribution is presented. Despite using a very compact feature space, state of the art results are obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Amaral C, Cassan A, Figueira H, Martins A, Mendes A, Mendes P, Pinto C, Vidal D (2008) Priberam’s question answering system in QA@CLEF 2007. In: Advances in multilingual and multimodal information retrieval: 8th workshop of the cross-language evaluation forum, CLEF 2007, Budapest, Hungary, September 19–21, 2007. Revised Selected Papers, pp 364–371, Berlin, Heidelberg, 2008. Springer. ISBN 978-3-540-85759-4 http://dx.doi.org/10.1007/978-3-540-85760-0_46
Bhagat R, Leuski A, Hovy E (2005) Shallow semantic parsing despite little training data. In: Proceedings of the ACL/SIGPARSE 9th international workshop on parsing technologies. Vancouver, Canada
Blunsom P, Kocik K, Curran JR (2006) Question classification with log-linear models. In: SIGIR ’06: proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, USA, pp 615–616. ISBN 1-59593-369-7. http://doi.acm.org/10.1145/1148170.1148282
Carlson AJ, Cumby CM, Rosen JL, Roth D (1999) Snow user guide. Technical report UIUC-DCS-R-99-210, Champaign, IL
Chang C-C, Lin C-J (2001) LIBSVM: a library for support vector machines, Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Collins MJ (1999) Head-driven statistical models for natural language parsing. PhD thesis, Philadelphia, PA, USA
Fellbaum C (ed) (1998) WordNet: an electronic lexical database. MIT, Cambridge. URL http://books.google.es/books?hl=es&lr=&id=Rehu8OOzMIMC&oi=fnd&pg=PR11
Hermjakob U, Hovy E, Lin C-Y (2002) Automated question answering in Webclopedia: a demonstration. In: Proceedings of the second international conference on human language technology research, Morgan Kaufmann Publishers Inc, San Francisco, pp 370–371
Huang Z, Thint M, Qin Z (2008) Question classification using head words and their hypernyms. In: EMNLP, pp 927–936
Judge J, Cahill A, van Genabith J (2006) Questionbank: creating a corpus of parse-annotated questions. In: ACL-44: proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Morristown, NJ, USA, pp 497–504. http://dx.doi.org/10.3115/1220175.1220238
Krishnan V, Das S, Chakrabarti S (2005) Enhanced answer type inference from questions using sequential models. In: HLT ’05: proceedings of the conference on human language technology and empirical methods in natural language processing, Association for Computational Linguistics, Morristown, NJ, USA, pp 315–322. http://dx.doi.org/10.3115/1220575.1220615
Kwok CCT, Etzioni O, Weld DS (2001) Scaling question answering to the web. In: WWW ’01: proceedings of the 10th international conference on World Wide Web, ACMNew York, NY, USA, pp 150–161. ISBN 1-58113-348-0. http://doi.acm.org/10.1145/371920.371973
Li Fangtao, Zhang Xian, Yuan Jinhui, Zhu Xiaoyan (August 2008) Classifying what-type questions by head noun tagging. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 481–488, Manchester, UK, Coling 2008 Organizing Committee. URL http://www.aclweb.org/anthology/C08-1061.
Li X, Roth D (2002) Learning question classifiers. In: Proceedings of the 19th international conference on computational linguistics, Association for Computational Linguistics, Morristown, NJ, USA, pp 1–7. http://dx.doi.org/10.3115/1072228.1072378
Mendes A, Coheur L, Mamede NJ, Ribeiro RD, de Matos DM, Batista F (2008) QA@L2F, first steps at QA@CLEF. In: Advances in multilingual and multimodal information retrieval, volume 5152 of lecture notes in computer science. Springer, Berlin
Metzler D, Croft WB (2005) Analysis of statistical question classification for fact-based questions. Inf Retr 8(3): 481–504
Article Google Scholar
Moldovan D, Paşca M, Harabagiu S, Surdeanu Mihai (2003) Performance issues and error analysis in an open-domain question answering system. ACM Trans Inf Syst 21(2): 133–154
Article Google Scholar
Moldovan DI, Harabagiu SM, Paşca M, Mihalcea R, Girju R, Goodrum R, Rus V (2000) The structure and performance of an open-domain question answering system. In: ACL
Newell A (1990) Unified theories of cognition. Harvard University Press, Harvard
Google Scholar
Pan Y, Tang Y, Lin L, Luo Y (2008) Question classification with semantic tree kernel. In: SIGIR ’08: proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval, ACM, New York, NY, USA, pp 837–838. ISBN:978-1-60558-164-4 http://doi.acm.org/10.1145/1390334.1390530
Petrov S, Klein D (2007) Improved inference for unlexicalized parsing. In: Human language technologies 2007: the conference of the North American chapter of the Association for Computational Linguistics; proceedings of the main conference, Association for Computational Linguistics, Rochester, New York, pp 404–411. URL http://www.aclweb.org/anthology/N/N07/N07-1051
Saquete E, Vicedo JL, Martínez-Barco P, Muñoz R, Llorens H (2009) Enhancing QA systems with complex temporal question processing capabilities. J Artif Intell Res 35: 299–330
Google Scholar
Sharada BA, Girish PM (2004) Wordnet has no ‘recycle bin’
Simon Herbert A (1969) The sciences of the artificial, 1st edn. MIT, Cambridge
Google Scholar
Sun R (1995) A two-level hybrid architecture for structuring knowledge for commonsense reasoning. In: Sun R, Bookman LA (eds) Computational architectures integrating neural and symbolic processing, chap 8. Kluwer, Dordrecht, pp 247–282
Google Scholar
Tarski A. (1956) Logic, semantics, metamathematics. Oxford University Press, London
Google Scholar
Zhang D, Lee WS (2003) Question classification using support vector machines. In: SIGIR ’03: proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, ACM, New York, NY, USA, pp 26–32. ISBN:1-58113-646-3. http://doi.acm.org/10.1145/860435.860443

Download references

Author information

Authors and Affiliations

INESC-ID Lisboa/IST, Av. Prof. Cavaco Silva, 2780-990, Porto Salvo Tagus Park, Portugal
João Silva, Luísa Coheur, Ana Cristina Mendes & Andreas Wichert

Authors

João Silva
View author publications
You can also search for this author in PubMed Google Scholar
Luísa Coheur
View author publications
You can also search for this author in PubMed Google Scholar
Ana Cristina Mendes
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Wichert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luísa Coheur.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Silva, J., Coheur, L., Mendes, A.C. et al. From symbolic to sub-symbolic information in question classification. Artif Intell Rev 35, 137–154 (2011). https://doi.org/10.1007/s10462-010-9188-4

Download citation

Published: 03 November 2010
Issue Date: February 2011
DOI: https://doi.org/10.1007/s10462-010-9188-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

From symbolic to sub-symbolic information in question classification

Abstract

Access this article

Similar content being viewed by others

Research on Question Classification Based on Bi-LSTM

A Forward-Selection Algorithm for SVM-Based Question Classification in Cognitive Systems

Learning to Classify Marathi Questions and Identify Answer Type Using Machine Learning Technique

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

From symbolic to sub-symbolic information in question classification

Abstract

Access this article

Similar content being viewed by others

Research on Question Classification Based on Bi-LSTM

A Forward-Selection Algorithm for SVM-Based Question Classification in Cognitive Systems

Learning to Classify Marathi Questions and Identify Answer Type Using Machine Learning Technique

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation