Abstract
Question classification is an important research content in automatic question-answering system. Chinese question sentences are different from long texts and those short texts like comments on product. They generally contain interrogative words such as who, which, where or how to specify the information required, and include complete grammatical components in the sentence. Based on these characteristics, we propose a more effective feature extraction method for Chinese question classification in this paper. We first extract the head verb of the sentence and its dependency words combined with interrogative words of the sentence as our base features. And then we use latent semantic analysis to help remove semantic noises from the base features. In the end, we expand those features to be semantic representation features by our weighted word-embedding method. Several experimental results show that our semantic joint feature extraction method outperforms classical syntactic based or content vector based method and superior to convolutional neural network based sentence classification method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mao, X.L., Li, X.M.: A survey on question and answering systems. J. Front. Comput. Sci. Technol. 6(3), 193–207 (2012)
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
Zhang, L., Huang, H.Y., Hu, C.L.: On question classification in an ontology-based Chinese question-answering System. J. Libr. Sci. China 2(02), 60–65 (2006)
Zhang, W., Chen, J.J.: Method of information entropy and its application in Chinese question classification. Comput. Eng. Appl. 49(10), 129–131 (2013)
Pan, Z.A.: Research on ontology based problem feature model in Chinese problem classification. Taiyuan University of Technology (2010)
Li, X., Du, Y., Huang, X., Wu, L.: Problem classification based on syntactic information and semantic information. In: National Conference on Information Retrieval and Content Security (2004)
Lin, X.D., Sun, A.D., Lin, P.P., Liu, H.X.: Chinese question classification using SVM based on dependency relations. J. Zhengzhou Univ. 41(1), 69–73 (2009)
Ji, Y., Wang, R.B., Chen, Z.Q.: Question classification in restricted domain using syntactic parsing-based quadratic-Bayesian model. J. Comput. Appl. 32(6), 1685–1687 (2012)
Wen, X., Zhang, Y., Liu, T., Ma, J.S.: Syntactic structure parsing based Chinese question classification. J. Chin. Inf. Process. 20(2), 33–39 (2006)
Ye, Z.L., Yang, Y., Jiang, Z., Ying, H.F.: Short question classification based on semantic extensions. J. Comput. Appl. 35(3), 792–796 (2015)
Duan, L., Chen, J., Niu, Y.: Study on question classification approach mixing multiple semantics characteristics. J. Taiyuan Univ. Technol. 42(5), 494–498 (2011)
Li, X., Roth, D.: Learning question classifiers: the role of semantic information. J. Natl. Lang. Eng. 12(3), 229–250 (2006)
Zhang, D., Lee, W.: Question classification using support vector machines. In: Proceedings of the 26th Annual International ACM SIGIR Conference On Research and Development in Information Retrieval, pp. 26–32. ACM Press, New York (2003)
Liu, T., Che, W., Li, Z.: Language technology platform. J. Chin. Inf. Process. 25(6), 53–62 (2011)
Mikolov, T., Sutskever, I., Chen, K.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Prager, J., Radev, D., Brown, E., Coden, A.: The use of predictive annotation for question answering in TREC8. In: The Eighth Text Retrieval Conference (TREC 8), pp. 500–246. NIST Special Publication (1999)
Hull, D.: Xerox TREC-8 question answering track report. TREC (1999)
Li, X., Roth, D.: Learning question classifiers: the role of semantic information. Nat. Lang. Eng. 12(3), 229–249 (2006)
Li, B., Liu, Y., Ram, A.: Exploring question subjectivity prediction in community QA. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ACM, pp. 735–736 (2008)
Abraham, A., Pedregosa, F., Eickenberg, M.: Machine learning for neuroimaging with scikit-learn. arXiv preprint arXiv:1412.3919 (2014)
Acknowledgment
This work is supported by the National Science Foundation of China (61402119, 61572145).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Li, X., Liu, H., Jiang, S. (2018). Chinese Question Classification Based on Semantic Joint Features. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-73618-1_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)