Abstract
As our knowledge, there is no dialog system for mental health-care domain in Hindi. This may be due to unavailability of user utterances corpora in Hindi for this domain. In this paper, we propose a novel algorithmic approach for user utterance generation in Hindi by considering dialects, linguistic attributes, symptoms, frequency of symptoms, and intensity of symptoms and history of symptoms. We use nine symptoms (anger, emptiness, fear, irritation, restlessness, suicide, sadness, tension, worry) as given in DSM5, ICD-11, and WHO guideline. These symptoms were used for generation of utterances and validation of the generated utterances for different type of mental diseases. We collected utterances by interviewing patients in clinic and found that it closely match to the utterance generated by proposed algorithm. The generated utterance corpus is also validated using machine learning methods in the framework of CNN, Bi-LSTM and Dense.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mesnil, G., et al.: Using recurrent neural networks for slot filling in spoken language understanding. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, pp. 530–539(2015)
Hanjung, L.: The Emergence of the Unmarked Order in Hindi. Northeast Linguistics Society: vol.30, article 6 (2000). https://scholarworks.umass.edu/nels/vol30/iss2/6. Accessed 14 Sept 2023
Chen, Q., Zhuo, Z., Wang, W.: BERT for Joint Intent Classification and Slot Filling. ArXiv (2019)
Malviya, S., Mishra, R., Barnwal, S.K., Tiwary, U.S.: HDRS: Hindi dialogue restaurant search corpus for dialogue state tracking in task-oriented environment. IEEE/ACM Trans. Audio, Speech, Lang. Process. 29, 2517–2528 (2021)
Lee, H., Lee, J., Kim, T.Y.: SUMBT: slot-utterance matching for universal and scalable belief tracking. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5478–5483. Association for Computational Linguistics Florence, Italy (2019)
Ma, Z., Sun, B., Li, S.: A two-stage selective fusion framework for joint intent detection and slot filling. IEEE Trans. Neural Netw. Learn. Syst. (2022). https://doi.org/10.1109/TNNLS.2022.3202562
Wu, J., Harris, I. G., Zhao, H., Ling, G.: A graph-to-sequence model for joint intent detection and slot filling. In: 2023 IEEE 17th International Conference on Semantic Computing (ICSC), pp. 131–138. Laguna Hills, CA, USA (2023)
Ma, X., Hovy, E.: End-to-end sequence labeling via Bi-directional LSTM-CNNs-CRF. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1064–1074. Berlin, Germany. Association for Computational Linguistics (2016)
Mrkšić, N., Séaghdha, D.O., Wen, T.H., Thomson, B., Young, S.: Neural belief tracker: data-driven dialogue state tracking. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics vol. 1: Long Papers, pp. 1777–1788, Vancouver, Canada. Association for Computational Linguistics (2017)
Coucke, A., et al.: Snips voice platform: an embedded spoken language understanding system for private-by-design voice interfaces. arXiv:1805.10190 (2018)
Hemphill, C.T., Godfrey, J.J., Doddington, G.R.: The ATIS spoken language systems pilot corpus. Speech and natural language. In: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24–27 (1990)
Ramaneswaran, S., Vijay, S., Srinivasan, K.: TamilATIS: dataset for task-oriented dialog in Tamil. In: Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages (2022)
Kane, B., Rossi, F., Guinaudeau, O., Chiesa, V., Quénel, I., Chau, S.: Joint intent detection and slot filling via CNN-LSTM-CRF. In: 2020 6th IEEE Congress on Information Science and Technology (CiSt), Agadir - Essaouira, Morocco, pp. 342–347. (2020)
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML 2001: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289 (2001)
American Psychiatric Association.Diagnostic and statistical manual of mental disorders (5th ed., text rev.) (2022)
International Classification of Diseases, Eleventh Revision (ICD-11), World Health Organization (WHO) https://icd.who.int/browse1 International Classification of Diseases, Eleventh Revision (ICD-11), World Health Organization (WHO) (2019/2021)
World Health Organization: Doing What Matters in Time of Stress -An Illustrated Guide. World Health Organization (WHO) (2020)
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification Courant Institute of Mathematical Sciences, New York University 719 Broadway, 12th Floor, New York, NY 10003 (2015)
Hochreiter, S., Schmidhuber, J.: Long short term memory. Neural Comput. 9(8), 1735–1780 (1997)
Kim, Y., Jernite, Y., Sontag, D., Rush, A.M.: arXiv:1508.06615 (2015)
National Institute of Mental Health and Neurosciences Bengaluru, National Mental Health Survey of India, 2015–16: Mental Health Systems. (2015–2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Prakash, S., Singh, M.K., Tiwary, U.S., Srivastava, M. (2024). HUCMD: Hindi Utterance Corpus for Mental Disorders. In: Choi, B.J., Singh, D., Tiwary, U.S., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2023. Lecture Notes in Computer Science, vol 14531. Springer, Cham. https://doi.org/10.1007/978-3-031-53827-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-53827-8_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53826-1
Online ISBN: 978-3-031-53827-8
eBook Packages: Computer ScienceComputer Science (R0)