HUCMD: Hindi Utterance Corpus for Mental Disorders

Prakash, Shaurya; Singh, Manoj Kumar; Tiwary, Uma Shanker; Srivastava, Mona

doi:10.1007/978-3-031-53827-8_5

Shaurya Prakash¹¹,
Manoj Kumar Singh¹¹,
Uma Shanker Tiwary¹² &
…
Mona Srivastava¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14531))

Included in the following conference series:

International Conference on Intelligent Human Computer Interaction

474 Accesses

Abstract

As our knowledge, there is no dialog system for mental health-care domain in Hindi. This may be due to unavailability of user utterances corpora in Hindi for this domain. In this paper, we propose a novel algorithmic approach for user utterance generation in Hindi by considering dialects, linguistic attributes, symptoms, frequency of symptoms, and intensity of symptoms and history of symptoms. We use nine symptoms (anger, emptiness, fear, irritation, restlessness, suicide, sadness, tension, worry) as given in DSM5, ICD-11, and WHO guideline. These symptoms were used for generation of utterances and validation of the generated utterances for different type of mental diseases. We collected utterances by interviewing patients in clinic and found that it closely match to the utterance generated by proposed algorithm. The generated utterance corpus is also validated using machine learning methods in the framework of CNN, Bi-LSTM and Dense.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Large-Scale Dialog Corpus Towards Automatic Mental Disease Diagnosis

Conversation Analysis of Remote Dialogue System for Mental Health Interventions

Automated Utterance Labeling of Conversations Using Natural Language Processing

References

Mesnil, G., et al.: Using recurrent neural networks for slot filling in spoken language understanding. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, pp. 530–539(2015)
Google Scholar
Hanjung, L.: The Emergence of the Unmarked Order in Hindi. Northeast Linguistics Society: vol.30, article 6 (2000). https://scholarworks.umass.edu/nels/vol30/iss2/6. Accessed 14 Sept 2023
Chen, Q., Zhuo, Z., Wang, W.: BERT for Joint Intent Classification and Slot Filling. ArXiv (2019)
Google Scholar
Malviya, S., Mishra, R., Barnwal, S.K., Tiwary, U.S.: HDRS: Hindi dialogue restaurant search corpus for dialogue state tracking in task-oriented environment. IEEE/ACM Trans. Audio, Speech, Lang. Process. 29, 2517–2528 (2021)
Article Google Scholar
Lee, H., Lee, J., Kim, T.Y.: SUMBT: slot-utterance matching for universal and scalable belief tracking. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5478–5483. Association for Computational Linguistics Florence, Italy (2019)
Google Scholar
Ma, Z., Sun, B., Li, S.: A two-stage selective fusion framework for joint intent detection and slot filling. IEEE Trans. Neural Netw. Learn. Syst. (2022). https://doi.org/10.1109/TNNLS.2022.3202562
Wu, J., Harris, I. G., Zhao, H., Ling, G.: A graph-to-sequence model for joint intent detection and slot filling. In: 2023 IEEE 17th International Conference on Semantic Computing (ICSC), pp. 131–138. Laguna Hills, CA, USA (2023)
Google Scholar
Ma, X., Hovy, E.: End-to-end sequence labeling via Bi-directional LSTM-CNNs-CRF. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1064–1074. Berlin, Germany. Association for Computational Linguistics (2016)
Google Scholar
Mrkšić, N., Séaghdha, D.O., Wen, T.H., Thomson, B., Young, S.: Neural belief tracker: data-driven dialogue state tracking. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics vol. 1: Long Papers, pp. 1777–1788, Vancouver, Canada. Association for Computational Linguistics (2017)
Google Scholar
Coucke, A., et al.: Snips voice platform: an embedded spoken language understanding system for private-by-design voice interfaces. arXiv:1805.10190 (2018)
Hemphill, C.T., Godfrey, J.J., Doddington, G.R.: The ATIS spoken language systems pilot corpus. Speech and natural language. In: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24–27 (1990)
Google Scholar
Ramaneswaran, S., Vijay, S., Srinivasan, K.: TamilATIS: dataset for task-oriented dialog in Tamil. In: Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages (2022)
Google Scholar
Kane, B., Rossi, F., Guinaudeau, O., Chiesa, V., Quénel, I., Chau, S.: Joint intent detection and slot filling via CNN-LSTM-CRF. In: 2020 6th IEEE Congress on Information Science and Technology (CiSt), Agadir - Essaouira, Morocco, pp. 342–347. (2020)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML 2001: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289 (2001)
Google Scholar
American Psychiatric Association.Diagnostic and statistical manual of mental disorders (5th ed., text rev.) (2022)
Google Scholar
International Classification of Diseases, Eleventh Revision (ICD-11), World Health Organization (WHO) https://icd.who.int/browse1 International Classification of Diseases, Eleventh Revision (ICD-11), World Health Organization (WHO) (2019/2021)
World Health Organization: Doing What Matters in Time of Stress -An Illustrated Guide. World Health Organization (WHO) (2020)
Google Scholar
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification Courant Institute of Mathematical Sciences, New York University 719 Broadway, 12th Floor, New York, NY 10003 (2015)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Kim, Y., Jernite, Y., Sontag, D., Rush, A.M.: arXiv:1508.06615 (2015)
National Institute of Mental Health and Neurosciences Bengaluru, National Mental Health Survey of India, 2015–16: Mental Health Systems. (2015–2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Banaras Hindu University, Varanasi, 221005, India
Shaurya Prakash & Manoj Kumar Singh
Indian Institute of Information Technology, Prayagraj, 211015, India
Uma Shanker Tiwary
Institute of Medical Sciences, Banaras Hindu University, Varanasi, 221005, India
Mona Srivastava

Authors

Shaurya Prakash
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Kumar Singh
View author publications
You can also search for this author in PubMed Google Scholar
Uma Shanker Tiwary
View author publications
You can also search for this author in PubMed Google Scholar
Mona Srivastava
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manoj Kumar Singh .

Editor information

Editors and Affiliations

Soongsil University, Seoul, Korea (Republic of)
Bong Jun Choi
Saint Louis University, St. Louis, MO, USA
Dhananjay Singh
Indian Institute of Information Technology, Allahabad, India
Uma Shanker Tiwary
Pukyong National University, Busan, Korea (Republic of)
Wan-Young Chung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prakash, S., Singh, M.K., Tiwary, U.S., Srivastava, M. (2024). HUCMD: Hindi Utterance Corpus for Mental Disorders. In: Choi, B.J., Singh, D., Tiwary, U.S., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2023. Lecture Notes in Computer Science, vol 14531. Springer, Cham. https://doi.org/10.1007/978-3-031-53827-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-031-53827-8_5
Published: 29 February 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53826-1
Online ISBN: 978-3-031-53827-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics