Skip to main content

HUCMD: Hindi Utterance Corpus for Mental Disorders

  • Conference paper
  • First Online:
Intelligent Human Computer Interaction (IHCI 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14531))

Included in the following conference series:

  • 87 Accesses

Abstract

As our knowledge, there is no dialog system for mental health-care domain in Hindi. This may be due to unavailability of user utterances corpora in Hindi for this domain. In this paper, we propose a novel algorithmic approach for user utterance generation in Hindi by considering dialects, linguistic attributes, symptoms, frequency of symptoms, and intensity of symptoms and history of symptoms. We use nine symptoms (anger, emptiness, fear, irritation, restlessness, suicide, sadness, tension, worry) as given in DSM5, ICD-11, and WHO guideline. These symptoms were used for generation of utterances and validation of the generated utterances for different type of mental diseases. We collected utterances by interviewing patients in clinic and found that it closely match to the utterance generated by proposed algorithm. The generated utterance corpus is also validated using machine learning methods in the framework of CNN, Bi-LSTM and Dense.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mesnil, G., et al.: Using recurrent neural networks for slot filling in spoken language understanding. In: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, pp. 530–539(2015)

    Google Scholar 

  2. Hanjung, L.: The Emergence of the Unmarked Order in Hindi. Northeast Linguistics Society: vol.30, article 6 (2000). https://scholarworks.umass.edu/nels/vol30/iss2/6. Accessed 14 Sept 2023

  3. Chen, Q., Zhuo, Z., Wang, W.: BERT for Joint Intent Classification and Slot Filling. ArXiv (2019)

    Google Scholar 

  4. Malviya, S., Mishra, R., Barnwal, S.K., Tiwary, U.S.: HDRS: Hindi dialogue restaurant search corpus for dialogue state tracking in task-oriented environment. IEEE/ACM Trans. Audio, Speech, Lang. Process. 29, 2517–2528 (2021)

    Article  Google Scholar 

  5. Lee, H., Lee, J., Kim, T.Y.: SUMBT: slot-utterance matching for universal and scalable belief tracking. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 5478–5483. Association for Computational Linguistics Florence, Italy (2019)

    Google Scholar 

  6. Ma, Z., Sun, B., Li, S.: A two-stage selective fusion framework for joint intent detection and slot filling. IEEE Trans. Neural Netw. Learn. Syst. (2022). https://doi.org/10.1109/TNNLS.2022.3202562

  7. Wu, J., Harris, I. G., Zhao, H., Ling, G.: A graph-to-sequence model for joint intent detection and slot filling. In: 2023 IEEE 17th International Conference on Semantic Computing (ICSC), pp. 131–138. Laguna Hills, CA, USA (2023)

    Google Scholar 

  8. Ma, X., Hovy, E.: End-to-end sequence labeling via Bi-directional LSTM-CNNs-CRF. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1064–1074. Berlin, Germany. Association for Computational Linguistics (2016)

    Google Scholar 

  9. Mrkšić, N., Séaghdha, D.O., Wen, T.H., Thomson, B., Young, S.: Neural belief tracker: data-driven dialogue state tracking. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics vol. 1: Long Papers, pp. 1777–1788, Vancouver, Canada. Association for Computational Linguistics (2017)

    Google Scholar 

  10. Coucke, A., et al.: Snips voice platform: an embedded spoken language understanding system for private-by-design voice interfaces. arXiv:1805.10190 (2018)

  11. Hemphill, C.T., Godfrey, J.J., Doddington, G.R.: The ATIS spoken language systems pilot corpus. Speech and natural language. In: Proceedings of a Workshop Held at Hidden Valley, Pennsylvania, June 24–27 (1990)

    Google Scholar 

  12. Ramaneswaran, S., Vijay, S., Srinivasan, K.: TamilATIS: dataset for task-oriented dialog in Tamil. In: Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages (2022)

    Google Scholar 

  13. Kane, B., Rossi, F., Guinaudeau, O., Chiesa, V., Quénel, I., Chau, S.: Joint intent detection and slot filling via CNN-LSTM-CRF. In: 2020 6th IEEE Congress on Information Science and Technology (CiSt), Agadir - Essaouira, Morocco, pp. 342–347. (2020)

    Google Scholar 

  14. Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML 2001: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 282–289 (2001)

    Google Scholar 

  15. American Psychiatric Association.Diagnostic and statistical manual of mental disorders (5th ed., text rev.) (2022)

    Google Scholar 

  16. International Classification of Diseases, Eleventh Revision (ICD-11), World Health Organization (WHO) https://icd.who.int/browse1 International Classification of Diseases, Eleventh Revision (ICD-11), World Health Organization (WHO) (2019/2021)

  17. World Health Organization: Doing What Matters in Time of Stress -An Illustrated Guide. World Health Organization (WHO) (2020)

    Google Scholar 

  18. Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification Courant Institute of Mathematical Sciences, New York University 719 Broadway, 12th Floor, New York, NY 10003 (2015)

    Google Scholar 

  19. Hochreiter, S., Schmidhuber, J.: Long short term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  20. Kim, Y., Jernite, Y., Sontag, D., Rush, A.M.: arXiv:1508.06615 (2015)

  21. National Institute of Mental Health and Neurosciences Bengaluru, National Mental Health Survey of India, 2015–16: Mental Health Systems. (2015–2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manoj Kumar Singh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Prakash, S., Singh, M.K., Tiwary, U.S., Srivastava, M. (2024). HUCMD: Hindi Utterance Corpus for Mental Disorders. In: Choi, B.J., Singh, D., Tiwary, U.S., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2023. Lecture Notes in Computer Science, vol 14531. Springer, Cham. https://doi.org/10.1007/978-3-031-53827-8_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-53827-8_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-53826-1

  • Online ISBN: 978-3-031-53827-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics