An Early Prediction and Label Smoothing Alignment Strategy for User Intent Classification of Medical Queries

Luo, Yuyu; Huang, Zhenjie; Wong, Leung-Pun; Zhan, Choujun; Wang, Fu Lee; Hao, Tianyong

doi:10.1007/978-981-19-6142-7_9

An Early Prediction and Label Smoothing Alignment Strategy for User Intent Classification of Medical Queries

Yuyu Luo¹²,
Zhenjie Huang¹²,
Leung-Pun Wong¹³,
Choujun Zhan¹²,
Fu Lee Wang¹³ &
…
Tianyong Hao¹²

Conference paper
First Online: 21 October 2022

674 Accesses
2 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1637))

Abstract

Deep learning models such as RoBERTa and Bi-LSTM are widely utilized in user intention classification tasks. However, in the medical field, there are difficulties in recognizing user intents due to the complexity of medical query representations and medical-specific terms. In this paper, an alignment strategy based on early prediction and label smoothing named EP-LSA is proposed to classify user intents of medical text queries. The EP-LSA strategy uses a Chinese pre-training model RoBERTa to encode sentence features with rich semantic information, predicts the early features of Bi-LSTM in RCNN and aligns them with output features. The early knowledge from early prediction is processed utilizing cross-entropy loss incorporating label smoothing, which enhances random information to the early knowledge and helps the strategy to extract more fine-grained features related to intention labels. Experiment evaluation was performed based on two publicly available datasets KUAKE and CMID. The results demonstrated that the proposed EP-LSA strategy outperformed other baseline methods and demonstrated the effectiveness of the strategy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Cai, R., Zhu, B., Ji, L., Hao, T., Yan, J., Liu, W.: An CNN-LSTM attention approach to understanding user query intent from online health communities. In: 2017 IEEE International Conference on Data Mining Workshops, pp. 430–437 (2017)
Google Scholar
Hao, T., Xie, W., Wu, Q., et al.: Leveraging question target word features through semantic relation expansion for answer type classification. Knowl. Based Syst. 133, 43–52 (2017)
Article Google Scholar
Xie, W., Gao, D., Hao, T.: A feature extraction and expansion-based approach for question target identification and classification. In: Wen, J., Nie, J., Ruan, T., Liu, Y., Qian, T. (eds.) CCIR 2017. LNCS, vol. 10390, pp. 249–260. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-68699-8_20
Chapter Google Scholar
Shimura, K., Li, J., Fukumoto, F.: Text categorization by learning predominant sense of words as auxiliary task. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 1109–1119 (2019)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, pp. 4171–4186 (2019)
Google Scholar
Liu, Y., et al.: Roberta: a robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Guo, H., Liu, T., Liu, F., Li, Y., Hu, W.: Chinese text classification model based on bert and capsule network structure. In: 2021 7th IEEE International Conference on Big Data Security on Cloud, pp. 105–110 (2021)
Google Scholar
Liu, Y., Liu, H., Wong, L.-P., Lee, L.-K., Zhang, H., Hao, T.: A hybrid neural network RBERT-C based on pre-trained RoBERTa and CNN for user intent classification. In: Zhang, H., Zhang, Z., Wu, Z., Hao, T. (eds.) NCAA 2020. CCIS, vol. 1265, pp. 306–319. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-7670-6_26
Chapter Google Scholar
Chen, J., Hu, Y., Liu, J., Xiao, Y., Jiang, H.: Deep short text classification with knowledge powered attention. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 6252–6259 (2019)
Google Scholar
He, R., Lee, W.S., Ng, H.T., Dahlmeier, D.: An interactive multi-task learning network for end-to-end aspect-based sentiment analysis. arXiv preprint arXiv:1906.06906 (2019)
Zhao, S., Liu, T., Zhao, S., Wang, F.: A neural multi-task learning framework to jointly model medical named entity recognition and normalization. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 817–824 (2019)
Google Scholar
Sun, K., Zhang, R., Mensah, S., Mao, Y., Liu, X.: Progressive multi-task learning with controlled information flow for joint entity and relation extraction. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 13851–13859 (2021)
Google Scholar
Lai, S., Xu, L., Liu, K, et al.: Recurrent convolutional neural networks for text classification. In: Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 2267–2273. (2015)
Google Scholar
Zhao, Y., Shen, Y., Yao, J.: Recurrent neural network for text classification with hierarchical multiscale dense connections. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 5450–5456 (2019)
Google Scholar
Johnson, R., Zhang, T.: Deep pyramid convolutional neural networks for text categorization. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pp. 562–570 (2017)
Google Scholar
Zhang, X., Wang, H.: A joint model of intent determination and slot filling for spoken language understanding. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 2993–2999 (2016)
Google Scholar
Sun, X., Lu, W.: Understanding attention for text classification. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 3418–3428 (2020)
Google Scholar
Lai, S., Xu, L., Liu, K., Zhao, J.: Recurrent convolutional neural networks for text classification. In: Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 2267–2273 (2015)
Google Scholar
Wang, S., Huang, M., Deng, Z.: Densely connected CNN with multi-scale feature attention for text classification. In: Twenty-Seventh International Joint Conference on Artificial Intelligence, pp. 4468–4474 (2018)
Google Scholar
Wu, H., He, Z., Zhang, W., Hu, Y., Wu, Y., Yue, Y.: Multi-class text classification model based on weighted word vector and BiLSTM-attention optimization. In: Huang, D.-S., Jo, K.-H., Li, J., Gribova, V., Bevilacqua, V. (eds.) ICIC 2021. LNCS, vol. 12836, pp. 393–400. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-84522-3_32
Chapter Google Scholar
He, C., Chen, S., Huang, S., Zhang, J., Song, X.: Using convolutional neural network with BERT for intent determination. In: 2019 International Conference on Asian Language Processing, pp. 65–70 (2019)
Google Scholar
Lin, Y., et al.: BertGCN: transductive text classification by combining GCN and BERT. arXiv preprint arXiv:2105.05727 (2021)
Cui, Y., et al.: Pre-training with whole word masking for Chinese BERT. IEEE/ACM Trans. Audio Speech Lang. Process. 29, 3504–3514 (2021)
Article Google Scholar
Wang, X., et al.: Learning Intents behind Interactions with Knowledge Graph for Recommendation. In: Proceedings of the Web Conference, pp. 878–887 (2021)
Google Scholar
Zhong, Y., Zhang, Z., Zhang, W., Zhu, J.: BERT-KG: a short text classification model based on knowledge graph and deep semantics. In: CCF International Conference on Natural Language Processing and Chinese Computing, pp. 721–733 (2021)
Google Scholar
Chen, N., Su, X., Liu, T., Hao, Q., Wei, M.: A Benchmark dataset and case study for Chinese medical question intent classification. BMC Med. Inform. Decis. Mak. 20, 1–7 (2020)
Article Google Scholar
Müller, R., Kornblith, S., Hinton, G.: When does label smoothing help. arXiv preprint arXiv:1906.02629 (2019)

Download references

Acknowledgements

This work was supported by Natural Science Foundation of Guangdong Province (2021A1515011339).

Author information

Authors and Affiliations

School of Computer Science, South China Normal University, Guangzhou, China
Yuyu Luo, Zhenjie Huang, Choujun Zhan & Tianyong Hao
School of Science and Technology, Hong Kong Metropolitan University, Hong Kong, China
Leung-Pun Wong & Fu Lee Wang

Authors

Yuyu Luo
View author publications
You can also search for this author in PubMed Google Scholar
Zhenjie Huang
View author publications
You can also search for this author in PubMed Google Scholar
Leung-Pun Wong
View author publications
You can also search for this author in PubMed Google Scholar
Choujun Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Fu Lee Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyong Hao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tianyong Hao .

Editor information

Editors and Affiliations

Harbin Institute of Technology, Shenzhen, China
Haijun Zhang
University of Jinan, Jinan, China
Yuehui Chen
Shenzhen University, Shenzhen, China
Xianghua Chu
Hefei University of Technology, Hefei, China
Zhao Zhang
South China Normal University, Guangzhou, China
Tianyong Hao
Chongqing University, Chongqing, China
Zhou Wu
Western University, London, ON, Canada
Yimin Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Luo, Y., Huang, Z., Wong, LP., Zhan, C., Wang, F.L., Hao, T. (2022). An Early Prediction and Label Smoothing Alignment Strategy for User Intent Classification of Medical Queries. In: Zhang, H., et al. Neural Computing for Advanced Applications. NCAA 2022. Communications in Computer and Information Science, vol 1637. Springer, Singapore. https://doi.org/10.1007/978-981-19-6142-7_9

Download citation

DOI: https://doi.org/10.1007/978-981-19-6142-7_9
Published: 21 October 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-6141-0
Online ISBN: 978-981-19-6142-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics