ABSTRACT
This study takes an investigation on intent detection and slot filling, which are two critical tasks for spoken language understanding, of low-resource languages, for instance, the Vietnamese language. Specifically, recent joint models for intent detection and slot filling have achieved state-of-the-art performance, which has proved the strong relationship between the two tasks. In this paper, we focus on the problem of the joint models for two tasks on low resource language, in which the data resource is limited, which leads to the problem of similarity between samples (e.g., in the same domain). Particularly, we present a new method for the aforementioned issues by combining two recent state-of-the-art models in this research field such as JointBERT and JointIDSF for Vietnamese languages. The experiment on PhoATIS, the first public Vietnamese dataset, and our custom similarity for intent detection and slot filling, indicate the promising results of the proposed method.
- Qian Chen, Zhu Zhuo, and Wen Wang. 2019. BERT for Joint Intent Classification and Slot Filling. CoRR abs/1902.10909(2019). arXiv:1902.10909http://arxiv.org/abs/1902.10909Google Scholar
- Mai Hoang Dao, Thinh Hung Truong, and Dat Quoc Nguyen. 2021. Intent Detection and Slot Filling for Vietnamese. In Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021, Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, and Petr Motlícek (Eds.). ISCA, 4698–4702. https://doi.org/10.21437/Interspeech.2021-618Google Scholar
- Chih-Wen Goo, Guang Gao, Yun-Kai Hsu, Chih-Li Huo, Tsung-Chieh Chen, Keng-Wei Hsu, and Yun-Nung Chen. 2018. Slot-Gated Modeling for Joint Slot Filling and Intent Prediction. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, New Orleans, Louisiana, USA, June 1-6, 2018, Volume 2 (Short Papers), Marilyn A. Walker, Heng Ji, and Amanda Stent (Eds.). Association for Computational Linguistics, 753–757. https://doi.org/10.18653/v1/n18-2118Google ScholarCross Ref
- Homa B Hashemi, Amir Asiaee, and Reiner Kraft. 2016. Query intent detection using convolutional neural networks. In International Conference on Web Search and Data Mining, Workshop on Query Understanding.Google Scholar
- Changliang Li, Liang Li, and Ji Qi. 2018. A Self-Attentive Model with Gate Mechanism for Spoken Language Understanding. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun’ichi Tsujii (Eds.). Association for Computational Linguistics, 3824–3833. https://doi.org/10.18653/v1/d18-1417Google ScholarCross Ref
- Samuel Louvan and Bernardo Magnini. 2020. Recent Neural Methods on Slot Filling and Intent Classification for Task-Oriented Dialogue Systems: A Survey. In Proceedings of the 28th International Conference on Computational Linguistics, COLING 2020, Barcelona, Spain (Online), December 8-13, 2020, Donia Scott, Núria Bel, and Chengqing Zong (Eds.). International Committee on Computational Linguistics, 480–496. https://doi.org/10.18653/v1/2020.coling-main.42Google ScholarCross Ref
- Patti J. Price. 1990. Evaluation of Spoken Language Systems: the ATIS Domain. In Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, USA, June 24-27, 1990. Morgan Kaufmann. https://aclanthology.org/H90-1020/Google Scholar
- Libo Qin, Tailu Liu, Wanxiang Che, Bingbing Kang, Sendong Zhao, and Ting Liu. 2021. A Co-Interactive Transformer for Joint Slot Filling and Intent Detection. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. IEEE, 8193–8197. https://doi.org/10.1109/ICASSP39728.2021.9414110Google ScholarCross Ref
- Henry Weld, Xiaoqi Huang, Siqi Long, Josiah Poon, and Soyeon Caren Han. 2021. A survey of joint intent detection and slot-filling models in natural language understanding. CoRR abs/2101.08091(2021). arXiv:2101.08091https://arxiv.org/abs/2101.08091Google Scholar
- Congying Xia, Chenwei Zhang, Xiaohui Yan, Yi Chang, and Philip S. Yu. 2018. Zero-shot User Intent Detection via Capsule Neural Networks. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018, Ellen Riloff, David Chiang, Julia Hockenmaier, and Jun’ichi Tsujii (Eds.). Association for Computational Linguistics, 3090–3099. https://doi.org/10.18653/v1/d18-1348Google ScholarCross Ref
- Xiaodong Zhang and Houfeng Wang. 2016. A Joint Model of Intent Determination and Slot Filling for Spoken Language Understanding. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016, Subbarao Kambhampati (Ed.). IJCAI/AAAI Press, 2993–2999. http://www.ijcai.org/Abstract/16/425Google Scholar
- Zhen Zhang, Hao Huang, and Kai Wang. 2020. Using Deep Time Delay Neural Network for Slot Filling in Spoken Language Understanding. Symmetry 12, 6 (2020), 993. https://doi.org/10.3390/sym12060993Google ScholarCross Ref
- Zhichang Zhang, Zhenwen Zhang, Haoyuan Chen, and Zhiman Zhang. 2019. A Joint Learning Framework With BERT for Spoken Language Understanding. IEEE Access 7(2019), 168849–168858. https://doi.org/10.1109/ACCESS.2019.2954766Google ScholarCross Ref
- Sendong Zhao, Ting Liu, Sicheng Zhao, and Fei Wang. 2019. A Neural Multi-Task Learning Framework to Jointly Model Medical Named Entity Recognition and Normalization. In The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019. AAAI Press, 817–824. https://doi.org/10.1609/aaai.v33i01.3301817Google ScholarDigital Library
- Su Zhu and Kai Yu. 2017. Encoder-decoder with focus-mechanism for sequence labelling based spoken language understanding. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017, New Orleans, LA, USA, March 5-9, 2017. IEEE, 5675–5679. https://doi.org/10.1109/ICASSP.2017.7953243Google ScholarDigital Library
Index Terms
- Intent Detection and Slot Filling with Low Resource and Domain
Recommendations
Intent-Slot Correlation Modeling for Joint Intent Prediction and Slot Filling
AbstractSlot filling and intent prediction are basic tasks in capturing semantic frame of human utterances. Slots and intent have strong correlation for semantic frame parsing. For each utterance, a specific intent type is generally determined with the ...
An Interactive Two-Pass Decoding Network for Joint Intent Detection and Slot Filling
Natural Language Processing and Chinese ComputingAbstractIntent detection and slot filling are two closely related tasks for building a spoken language understanding (SLU) system. The joint methods for the two tasks focus on modeling the semantic correlations between the intent and slots and applying ...
Multitask learning for multilingual intent detection and slot filling in dialogue systems
AbstractDialogue systems are becoming an ubiquitous presence in our everyday lives having a huge impact on business and society. Spoken language understanding (SLU) is the critical component of every goal-oriented dialogue system or any ...
Highlights- We propose a multilingual multitask approach to fuse the two primary SLU tasks.
Comments