research-article

Recognizing Medical Search Query Intent by Few-shot Learning

Authors:

Dejing DouAuthors Info & Claims

SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 502 - 512

https://doi.org/10.1145/3477495.3531789

Published: 07 July 2022 Publication History

Abstract

Online healthcare services can provide unlimited and in-time medical information to users, which promotes social goods and breaks the barriers of locations. However, understanding the user intents behind the medical related queries is a challenging problem. Medical search queries are usually short and noisy, lack strict syntactic structure, and also require professional background to understand the medical terms. The medical intents are fine-grained, making them hard to recognize. In addition, many intents only have a few labeled data. To handle these problems, we propose a few-shot learning method for medical search query intent recognition called MEDIC. We extract co-click queries from user search logs as weak supervision to compensate for the lack of labeled data. We also design a new query encoder which learns to represent queries as a combination of semantic knowledge recorded in an external medical knowledge graph, syntactic knowledge which marks the grammatical role of each word in the query, and generic knowledge which is captured by language models pretrained from large-scale text corpus. Experimental results on a real medical search query intent recognition dataset validate the effectiveness of MEDIC.

Supplementary Material

MP4 File (SIGIR-fp1370.mp4)

Presentation video for MEDIC.

Download
162.98 MB

References

[1]

Ioannis Antonellis, Hector Garcia-Molina, and Chi-Chao Chang. 2008. Simrank++ query rewriting through link analysis of the clickgraph. In International World Wide Web Conference. 1177--1178.

[2]

Yujia Bao, Menghua Wu, Shiyu Chang, and Regina Barzilay. 2020. Few-shot text classification with distributional signatures. In International Conference on Learning Representations.

[3]

Luca Bertinetto, Joao F Henriques, Philip HS Torr, and Andrea Vedaldi. 2018. Metalearning with differentiable closed-form solvers. arXiv preprint arXiv:1805.08136 (2018).

[4]

Andrei Broder. 2002. A taxonomy of web search. In ACM SIGIR Forum, Vol. 36. ACM New York, NY, USA, 3--10.

Digital Library

[5]

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language models are few-shot learners. In Advances in Neural Information Processing Systems, Vol. 33. 1877--1901.

[6]

Hengyi Cai, Hongshen Chen, Yonghao Song, Cheng Zhang, Xiaofang Zhao, and Dawei Yin. 2020. Data manipulation: Towards effective instance learning for neural dialogue generation via learning to augment and reweight. In Annual Meeting of the Association for Computational Linguistics. 6334--6343.

[7]

Jindong Chen, Yizhou Hu, Jingping Liu, Yanghua Xiao, and Haiyun Jiang. 2019. Deep short text classification with knowledge powered attention. In AAAI Conference on Artificial Intelligence. 6252--6259.

Digital Library

[8]

Jiaao Chen, Derek Tam, Colin Raffel, Mohit Bansal, and Diyi Yang. 2021. An empirical survey of data augmentation for limited data learning in NLP. arXiv preprint arXiv:2106.07499 (2021).

[9]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 4171--4186.

[10]

Kaize Ding, Jianling Wang, Jundong Li, Dingcheng Li, and Huan Liu. 2020. Be more with less: Hypergraph attention networks for inductive text classification. In Conference on Empirical Methods in Natural Language Processing. 4927--4936.

[11]

Yuxiao Dong, Ziniu Hu, Kuansan Wang, Yizhou Sun, and Jie Tang. 2020. Heterogeneous network representation learning. In International Joint Conference on Artificial Intelligence, Vol. 20. 4861--4867.

[12]

Thomas Dopierre, Christophe Gravier, and Wilfried Logerais. 2021. ProtAugment: Intent detection meta-learning through unsupervised diverse paraphrasing. In Annual Meeting of the Association for Computational Linguistics and International Joint Conference on Natural Language Processing. 2454--2466.

[13]

Ruiying Geng, Binhua Li, Yongbin Li, Jian Sun, and Xiaodan Zhu. 2020. Dynamic memory induction networks for few-shot text classification. In Annual Meeting of the Association for Computational Linguistics. 1087--1094.

[14]

Rafael Glater, Rodrygo LT Santos, and Nivio Ziviani. 2017. Intent-aware semantic query annotation. In International ACM SIGIR Conference on Research and Development in Information Retrieval. 485--494.

Digital Library

[15]

Fred X Han, Di Niu, Haolan Chen, Weidong Guo, Shengli Yan, and Bowei Long. 2020. Meta-learning for query conceptualization at web scale. In ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 3064--3073.

Digital Library

[16]

Michael A Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen, and Dietrich Klakow. 2021. A survey on recent approaches for natural language processing in low-resource scenarios. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2545-- 2568.

[17]

Lianzhe Huang, Dehong Ma, Sujian Li, Xiaodong Zhang, and Houfeng Wang. 2019. Text level graph neural network for text classification. In Conference on Empirical Methods in Natural Language Processing and International Joint Conference on Natural Language Processing. 3444--3450.

[18]

Dimitri Kartsaklis, Mohammad Taher Pilehvar, and Nigel Collier. 2018. Mapping text to knowledge graph entities using multi-sense LSTMs. In Conference on Empirical Methods in Natural Language Processing. 1959--1970.

[19]

Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, and Dilip Krishnan. 2020. Supervised contrastive learning. In Advances in Neural Information Processing Systems. 18661--18673.

[20]

Yoon Kim. 2014. Convolutional neural networks for sentence classification. In Conference on Empirical Methods in Natural Language Processing. 1746--1751.

[21]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. In International Conference on Learning Representations.

[22]

Thomas N Kipf and Max Welling. 2016. Semi-supervised classification with graph convolutional networks. In International Conference on Learning Representations.

[23]

Gregory Koch, Richard Zemel, Ruslan Salakhutdinov, et al. 2015. Siamese neural networks for one-shot image recognition. In ICML Deep Learning Workshop, Vol. 2. Lille.

[24]

Qian Li, Hao Peng, Jianxin Li, Congyin Xia, Renyu Yang, Lichao Sun, Philip S Yu, and Lifang He. 2020. A survey on text classification: From shallow to deep learning. arXiv preprint arXiv:2008.00364 (2020).

[25]

Ruirui Li, Liangda Li, Xian Wu, Yunhong Zhou, and Wei Wang. 2019. Click feedback-aware query recommendation using adversarial examples. In The Web Conference. 2978--2984.

Digital Library

[26]

Pengfei Liu, Xipeng Qiu, and Xuanjing Huang. 2016. Recurrent neural network for text classification with multi-task learning. In International Joint Conference on Artificial Intelligence. 2873--2879.

[27]

Zaiqiao Meng, Fangyu Liu, Thomas Clark, Ehsan Shareghi, and Nigel Collier. 2021. Mixture-of-Partitions: Infusing large biomedical knowledge graphs into BERT. In Conference on Empirical Methods in Natural Language Processing. 4672--4681.

[28]

Subhabrata Mukherjee and Ahmed Awadallah. 2020. Uncertainty-aware selftraining for few-shot text classification. Advances in Neural Information Processing Systems 33 (2020).

[29]

Margi Murphy. 2019. Dr Google will see you now: Search giant wants to cash in on your medical queries. https://www.telegraph.co.uk/technology/2019/03/10/ google-sifting-one-billion-health-questions-day/.

[30]

Aaron van den Oord, Yazhe Li, and Oriol Vinyals. 2018. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018).

[31]

Hao Peng, Tianyu Gao, Xu Han, Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, and Jie Zhou. 2020. Learning from context or names? An empirical study on neural relation extraction. In Conference on Empirical Methods in Natural Language Processing. 3661--3672.

[32]

Xuan-Hieu Phan, Le-Minh Nguyen, and Susumu Horiguchi. 2008. Learning to classify short and sparse text & web with hidden topics from large-scale data collections. In International World Wide Web Conference. 91--100.

Digital Library

[33]

Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Conference on Empirical Methods in Natural Language Processing and International Joint Conference on Natural Language Processing. 3982--3992.

[34]

Jake Snell, Kevin Swersky, and Richard Zemel. 2017. Prototypical networks for few-shot learning. In Advances in Neural Information Processing Systems. 4080--4090.

[35]

Pengfei Sun, Yawen Ouyang, Wenming Zhang, and Xin-yu Dai. 2021. MEDA: Meta-learning with data augmentation for few-shot text classification. In International Joint Conference on Artificial Intelligence. 3929--3935.

[36]

Jin Wang, Zhongyuan Wang, Dawei Zhang, and Jun Yan. 2017. Combining knowledge with deep convolutional neural networks for short text classification. In International Joint Conference on Artificial Intelligence. 2915--2921.

[37]

Yaqing Wang, Song Wang, Quanming Yao, and Dejing Dou. 2021. Hierarchical heterogeneous graph representation learning for short text classification. In Conference on Empirical Methods in Natural Language Processing. 3091--3101.

[38]

Yaqing Wang, Quanming Yao, James T Kwok, and Lionel M Ni. 2020. Generalizing from a few examples: A survey on few-shot learning. Comput. Surveys 53, 3 (2020), 1--34.

Digital Library

[39]

Zhongyuan Wang, Kejun Zhao, Haixun Wang, Xiaofeng Meng, and Ji-Rong Wen. 2015. Query understanding through knowledge-based conceptualization. In International Conference on Artificial Intelligence. 3264--3270.

[40]

Jason Wei and Kai Zou. 2019. EDA: Easy data augmentation techniques for boosting performance on text classification tasks. In Conference on Empirical Methods in Natural Language Processing and International Joint Conference on Natural Language Processing. 6382--6388.

[41]

Liang Yao, Chengsheng Mao, and Yuan Luo. 2019. Graph convolutional networks for text classification. In AAAI Conference on Artificial Intelligence, Vol. 33. 7370-- 7377.

Digital Library

[42]

Chenwei Zhang, Wei Fan, Nan Du, and Philip S Yu. 2016. Mining user intentions from medical queries: A neural network based heterogeneous jointly modeling approach. In International World Wide Web Conference. 1373--1384.

Digital Library

[43]

Hongfei Zhang, Xia Song, Chenyan Xiong, Corby Rosset, Paul N Bennett, Nick Craswell, and Saurabh Tiwary. 2019. Generic intent representation in web search. In International ACM SIGIR Conference on Research and Development in Information Retrieval. 65--74.

Digital Library

[44]

Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu, Chien-Sheng Wu, Yao Wan, S Yu Philip, Richard Socher, and Caiming Xiong. 2020. Discriminative nearest neighbor few-shot intent detection by transferring natural language inference. In Conference on Empirical Methods in Natural Language Processing. 5064--5082.

[45]

Yi Zhao. 2020. Baidu health accelerates: Searches exceed 200 million per day, and the number of top doctors in cooperation has increased by 87a year. https://cj.sina.com.cn/articles/view/1704103183/65928d0f02001uxse? sudaref=www.google.com.hk&display=0&retcode=0.

Cited By

Wang YPiao HDong DYao QZhou JBaeza-Yates RBonchi F(2024)Warming Up Cold-Start CTR Prediction by Learning Item-Specific Feature InteractionsProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671784(3233-3244)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671784
Yan ZAn YXue H(2024)Reinforced Self-Supervised Training for Few-Shot LearningIEEE Signal Processing Letters10.1109/LSP.2024.337048831(731-735)Online publication date: 2024
https://doi.org/10.1109/LSP.2024.3370488
Swathi BGeetha MAttigeri GSuhas MHalaharvi S(2024)Optimizing Question Answering Systems in Education: Addressing Domain-Specific ChallengesIEEE Access10.1109/ACCESS.2024.348322412(156572-156587)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3483224
Show More Cited By

Index Terms

Recognizing Medical Search Query Intent by Few-shot Learning
1. Information systems
  1. Information retrieval
    1. Information retrieval query processing

Recommendations

Impact of query intent and search context on clickthrough behavior in sponsored search

Implicit feedback techniques may be used for query intent detection, taking advantage of user behavior to understand their interests and preferences. In sponsored search, a primary concern is the user's interest in purchasing or utilizing a commercial ...
Intent-aware query similarity
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management

Query similarity calculation is an important problem and has a wide range of applications in IR, including query recommendation, query expansion, and even advertisement matching. Existing work on query similarity aims to provide a single similarity ...
Query intent inference via search engine log

Mining the latent intents behind search queries is critical for contemporary search engines. Therefore, there has been lots of effort on studying how to infer the intents of search queries via search engine query log. However, the task of query log-...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2022

3569 pages

ISBN:9781450387323

DOI:10.1145/3477495

General Chairs:
Enrique Amigo
UNED
,
Pablo Castells
UAM and Amazon
,
Julio Gonzalo
UNED
,
Program Chairs:
Ben Carterette
Spotify
,
J. Shane Culpepper
RMIT University
,
Gabriella Kazai
Waseda University

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key Research and Development Program of China

Conference

SIGIR '22

Sponsor:

SIGIR

SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

July 11 - 15, 2022

Madrid, Spain

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

12
Total Citations
View Citations
504
Total Downloads

Downloads (Last 12 months)81
Downloads (Last 6 weeks)6

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang YPiao HDong DYao QZhou JBaeza-Yates RBonchi F(2024)Warming Up Cold-Start CTR Prediction by Learning Item-Specific Feature InteractionsProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671784(3233-3244)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671784
Yan ZAn YXue H(2024)Reinforced Self-Supervised Training for Few-Shot LearningIEEE Signal Processing Letters10.1109/LSP.2024.337048831(731-735)Online publication date: 2024
https://doi.org/10.1109/LSP.2024.3370488
Swathi BGeetha MAttigeri GSuhas MHalaharvi S(2024)Optimizing Question Answering Systems in Education: Addressing Domain-Specific ChallengesIEEE Access10.1109/ACCESS.2024.348322412(156572-156587)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3483224
Chen JDeng STeng DChen DJia TWang H(2024)APPN: An Attention-based Pseudo-label Propagation Network for few-shot learning with noisy labelsNeurocomputing10.1016/j.neucom.2024.128212602(128212)Online publication date: Oct-2024
https://doi.org/10.1016/j.neucom.2024.128212
Rafiei AMoore RJahromi SHajati FKamaleswaran R(2024)Meta-learning in Healthcare: A SurveySN Computer Science10.1007/s42979-024-03166-95:6Online publication date: 12-Aug-2024
https://doi.org/10.1007/s42979-024-03166-9
Farshidi SRezaee KMazaheri SRahimi ADadashzadeh AZiabakhsh MEskandari SJansen S(2024)Understanding user intent modeling for conversational recommender systems: a systematic literature reviewUser Modeling and User-Adapted Interaction10.1007/s11257-024-09398-xOnline publication date: 6-Jun-2024
https://doi.org/10.1007/s11257-024-09398-x
Shao YLi HWu YLiu YAi QMao JMa YMa S(2023)An Intent Taxonomy of Legal Case RetrievalACM Transactions on Information Systems10.1145/362609342:2(1-27)Online publication date: 11-Dec-2023
https://dl.acm.org/doi/10.1145/3626093
Tan XWu WLuo CChen HDuh WHuang HKato MMothe JPoblete B(2023)SCHash: Speedy Simplicial Complex Neural Networks via Randomized HashingProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591762(1609-1618)Online publication date: 19-Jul-2023
https://dl.acm.org/doi/10.1145/3539618.3591762
An YXue HZhao XWang J(2023)From Instance to Metric Calibration: A Unified Framework for Open-World Few-Shot LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.324402345:8(9757-9773)Online publication date: 1-Aug-2023
https://dl.acm.org/doi/10.1109/TPAMI.2023.3244023
Hao XWang LZhu HGuo X(2023)Joint agricultural intent detection and slot filling based on enhanced heterogeneous attention mechanismComputers and Electronics in Agriculture10.1016/j.compag.2023.107756207:COnline publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1016/j.compag.2023.107756
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten