Skip to main content

Taxonomy Building and Machine Learning Based Automatic Classification for Knowledge-Oriented Chinese Questions

  • Conference paper
Book cover Advances in Intelligent Computing (ICIC 2005)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3644))

Included in the following conference series:

Abstract

In this paper, we propose a taxonomy for knowledge-oriented question, and study the machine learning based classification for knowledge-oriented Chinese questions. By knowledge-oriented questions, we mean questions carrying information or knowledge about something, which cannot be well described by previous taxonomies. We build the taxonomy after the study of previous work and analysis of 6776 Chinese knowledge-oriented questions collected from different realistic sources. Then we investigate the new task of knowledge-oriented Chinese questions classification based on this taxonomy. In our approach, the popular SVM learning method is employed as classification algorithm. We explore different features and their combinations and different kernel functions for the classification, and use different performance metrics for evaluation. The results demonstrate that the proposed approach is desirable and robust. Thorough error analysis is also conduced.

This work was supported by NSF of China (Grant No. 60473136), the National High Technology Research and Development Major Program of China (863 Program) (Grant No. 2004AA1Z2280), the Doctoral Program Foundation of the China Ministry of Education (Grant No. 20040698028) and the Project of Tackling Key Problems in Science and Technology of Shaanxi province in China (Grant No. 2003K05-G25).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hermjakob, U.: Parsing and Question Classification for Question Answering. In: Proceedings of the Association for Computational Linguistics 2001 Workshop on Open-Domain Question Answering, pp. 17–22 (2001)

    Google Scholar 

  2. Hovy, E., Hermjakob, U., Ravichandran, D.: A Question/Answer Typology with Surface Text Patterns. In: Proceeding of the Human Language Technology Conference (2002)

    Google Scholar 

  3. Roth, D., Cumby, C., Li, X., Morie, P., Nagarajan, R., Rizzolo, N., Small, K., Yih, W.: Question-Answering via Enhanced Understanding of Questions. In: Proceeding of the 11th Text Retrieval Conference (2002)

    Google Scholar 

  4. Kelly, D., Murdock, V., Yuan, X.J., Croft, W.B., Belkin, N.J.: Features of Documents Relevant to Task- and Fact- Oriented Questions. In: Proceeding of the Eleventh International Conference on Information and Knowledge Management, pp. 645–647 (2002)

    Google Scholar 

  5. Lehnert, W.G.: A Conceptual Theory of Question Answering. In: Natural Language Processing, pp. 651–658 (1986)

    Google Scholar 

  6. Hovy, E., Gerber, L., Hermjakob, U., Junk, M., Lin, C.: Question Answering in Webclopedia. In: Proceedings of the Ninth Text Retrieval Conference, pp. 655–664 (2002)

    Google Scholar 

  7. Li, X., Roth, D.: Learning Question Classifiers. In: Proceedings of the 19th International Conference on Computational Linguistics, pp. 556–562 (2002)

    Google Scholar 

  8. Suzuki, J., Taira, H., Sasaki, Y., Maeda, E.: Question Classification using HDAG Kernel. In: Proceeding of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 61–68 (2003)

    Google Scholar 

  9. http://www.broward.k12.fl.us/learnresource/Info_literacy/Bloom’s_Taxonomy.pdf

  10. Singhal, A., Abney, S., Bacchiani, M., Collins, M., Hindle, D., Pereira, F.: AT&T at TREC-8. In: Proceedings of the 8th Text Retrieval Conference, pp. 317–330 (2000)

    Google Scholar 

  11. Zhang, D., Lee, W.S.: Question Classification using Support Vector Machines. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 26–32 (2003)

    Google Scholar 

  12. Metzler, D., Croft, W.B.: Analysis of Statistical Question Classification for Fact-based Questions. Journal of Information Retrieval, 481–504 (2005)

    Google Scholar 

  13. Zheng, Q., Hu, Y., Zhang, S.: The Research and Implementation of Nature Language Web Answer System. Mini-Micro Systems, 554–560 (2005)

    Google Scholar 

  14. http://www.cwirf.org/

  15. http://www.keenage.com

  16. Cortes, C., Vapnik, V.: Support-Vector Networks. Machine Learning, 273–297 (1995)

    Google Scholar 

  17. Joachims, T.: Estimating the Generalization Performance of a SVM Efficiently. In: Proceeding of the International Conference on Machine Learning, pp. 431–438 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hu, Y., Zheng, Q., Bai, H., Sun, X., Dang, H. (2005). Taxonomy Building and Machine Learning Based Automatic Classification for Knowledge-Oriented Chinese Questions. In: Huang, DS., Zhang, XP., Huang, GB. (eds) Advances in Intelligent Computing. ICIC 2005. Lecture Notes in Computer Science, vol 3644. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11538059_51

Download citation

  • DOI: https://doi.org/10.1007/11538059_51

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28226-6

  • Online ISBN: 978-3-540-31902-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics