Abstract
Question classification plays an important role in question answering (QA) system, and its results directly affect the quality of QA. Traditional methods of question classification include rule-based methods and statistical machine learning methods. They need to manually summarize rules or extract the features of questions. The rule definition and feature selection are subjective and one-sided, which are not conducive to fully understand the semantic information of questions. Based on the above problem, this paper proposes a question classification model based on Bi-LSTM. This model combines words, part of speech (POS) and position information of words to generate embedded representation of words, and uses Bi-LSTM to classify questions. The method can efficiently extract the local features of questions and simplify feature engineering. The accuracy of coarse-grained classification on the question classification data set of Harbin Institute of Technology (HIT) has reached 92.38%.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Zheng, F.S., Liu, T., Qin, B., et al.: Overview of question-answering. J. Chin. Inf. 16(6), 46–52 (2002)
Zhou, X.P.: Research on question classification based on deep learning. Harbin Institute of Technology (2016)
Zhen, L.H., Wang, X.L., Yang, S.C.: Overview on question classification in question—answering system. J. Anhui Univ. Technol. (Nat. Sci. Edn.) 32(1), 48–54 (2015). (In Chinese)
Li, W.: Question classification using language modeling. CIIR Technical report (2007)
Li, X., Roth, D.: Learning question classifiers. In: COLING-2002, pp. 556–562 (2012)
Li, X., Du, Y.P., Huang, X.J., et al.: Question classification using syntactic and semantic information. In: National Conference on Information Retrieval and Content Security (2004). (In Chinese)
Zhang, D., Lee, W.S.: Question classification using support vector machines, pp. 26–32. ACM (2003)
Dan, R., Small, K.: The role of semantic information in learning question classifiers. In: Conference First International Joint Conference on Natural Language Processing, pp. 184–187 (2004)
Li, C., Chai, Y.M., Nan, X.F., et al.: Research on question classification method based on deep learning. Comput. Sci. 43(12), 115–119 (2016). (In Chinese)
Magnini, B., Negri, M., Prevete, R., et al.: Mining knowledge from repeated co-occurrences. DIOGENE at TREC 2002 (2002)
Yang, H., Chua, T.S., Wang, S., Koh, C.K: Structured use of external knowledge for event-based open domain question answering. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 33–40. ACM (2003)
Zhang, Y., Liu, T., Wen, X.: Modified Bayesian model based question classification. J. Chin. Inf. Process. 19(2), 101–106 (2005). (In Chinese)
Tian, W.D., Gao, Y.Y., Zu, Y.L.: Question classification based on self-learning rules and modified Bayes. Res. Comput. Appl. 27(8), 2869–2871 (2010). (In Chinese)
Wen, X., Zhang, Y., Liu, T., et al.: Syntactic structure parsing based Chinese question classification 20(2), 35–41 (2006). (In Chinese)
Sun, J.G., Cai, D.F., Lv, D.X., et al.: HowNet based Chinese question automatic classification. J. Chin. Inf. Process. 21(1), 90–95 (2007). (In Chinese)
Yu, Z.T., Fan, X.Z., Guo, J.Y.: Chinese question classification based on support vector machines. J. South China Univ. Technol. (Nat. Sci. Edn.) 33(9), 25–29 (2005). (In Chinese)
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems 25, pp. 1106–1114 (2012)
Graves, A., Mohamed, A.R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649. IEEE (2013)
Kim, Y.: Convolutional neural networks for sentence classification. Eprint Arxiv (2014)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, p. 271. Association for Computational Linguistics (2004)
Zong, C.Q.: Statistical Natural Language Processing. Tsinghua University Press, Beijing (2013). (In Chinese)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, Q., Mu, L., Zhang, K., Zan, H., Li, Y. (2018). Research on Question Classification Based on Bi-LSTM. In: Hong, JF., Su, Q., Wu, JS. (eds) Chinese Lexical Semantics. CLSW 2018. Lecture Notes in Computer Science(), vol 11173. Springer, Cham. https://doi.org/10.1007/978-3-030-04015-4_44
Download citation
DOI: https://doi.org/10.1007/978-3-030-04015-4_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04014-7
Online ISBN: 978-3-030-04015-4
eBook Packages: Computer ScienceComputer Science (R0)