Empathizing Before Generation: A Double-Layered Framework for Emotional Support LLM

Zhu, Jiahao; Jiang, Zijian; Zhou, Boyu; Su, Jionglong; Zhang, Jiaming; Li, Zhihao

doi:10.1007/978-981-97-8490-5_35

Jiahao Zhu^15,17,
Zijian Jiang¹⁷,
Boyu Zhou¹⁷,
Jionglong Su¹⁸,
Jiaming Zhang¹⁷ &
…
Zhihao Li^16,17

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15032))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

135 Accesses

Abstract

Large Language Models (LLMs) have found extensive use across different applications due to its diverse capabilities and proficiency in executing instructions. In the case of chatbots, they are frequently required to show empathy when used in the context of emotional support. However, to date their performance is still not satisfactory due to the lack of deep understanding of user related issues. Hence, we introduce the Empathizing Before Generation (EBG), a two-step learning framework that allows LLMs to analyze the chain of thought (COT) prior to generating a response. This model also enables the inference of the 24 emotions conveyed in the user’s text as well as facilitates the generation of empathetic, high-quality and appropriate responses. We create a COT version of the dataset for sentiment inference by utilizing a publicly accessible sentiment dialogue. This dataset is then used as support for the training of two layers of EBG. Experiments indicate that models integrated with the EBG outperform other models in demonstrating empathy, with 98.2% and 92.8% accuracy in emotional attributes and labels respectively. Additionally, there is a notable enhancement in the model’s capacity to comprehend COT instructions, infer emotions, and generate answers that are more satisfactory than other models.

Jiahao Zhu and Zijian Jiang—Contributed Equally

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

https://github.com/henrytanner52/all-MiniLM-L6-v2
https://www.promptingguide.ai/techniques/cot
https://lightning.ai/pages/community/lora-insights
Achiam, J., Adler, S., Agarwal, S., Ahmad, L., Akkaya, I., Aleman, F.L., Almeida, D., Altenschmidt, J., Altman, S., Anadkat, S., et al.: Gpt-4 technical report. arXiv preprint arXiv:2303.08774 (2023)
Brown, T.B., Mann, B., Ryder, N., Subbiah, M., Amodei, D.: Language models are few-shot learners (2020)
Google Scholar
Chiang, W.L., Li, Z., Lin, Z., Sheng, Y., Wu, Z., Zhang, H., Zheng, L., Zhuang, S., Zhuang, Y., Gonzalez, J.E., Stoica, I., Xing, E.P.: Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality (March 2023). https://lmsys.org/blog/2023-03-30-vicuna/
Cui, J., Li, Z., Yan, Y., Chen, B., Yuan, L.: Chatlaw: Open-source legal large language model with integrated external knowledge bases. arXiv preprint arXiv:2306.16092 (2023)
Du, Z., Qian, Y., Liu, X., Ding, M., Qiu, J., Yang, Z., Tang, J.: Glm: General language model pretraining with autoregressive blank infilling. arXiv preprint arXiv:2103.10360 (2021)
Fu, C., Chen, P., Shen, Y., Qin, Y., Zhang, M., Lin, X., Yang, J., Zheng, X., Li, K., Sun, X., Wu, Y., Ji, R.: Mme: A comprehensive evaluation benchmark for multimodal large language models. arXiv preprint arXiv:2306.13394 (2023)
Fu, C., Zhang, R., Wang, Z., Huang, Y., Zhang, Z., Qiu, L., Ye, G., Shen, Y., Zhang, M., Chen, P., Zhao, S., Lin, S., Jiang, D., Yin, D., Gao, P., Li, K., Li, H., Sun, X.: A challenger to gpt-4v? Early explorations of gemini in visual expertise. arXiv preprint arXiv:2312.12436 (2023)
Kang, D., Kim, S., Kwon, T., Moon, S., Cho, H., Yu, Y., Lee, D., Yeo, J.: Can large language models be good emotional supporter? Mitigating preference bias on emotional support conversation. ArXiv abs/2402.13211 (2024). https://api.semanticscholar.org/CorpusID:267759569
Lei, S., Dong, G., Wang, X., Wang, K., Wang, S.: Instructerc: Reforming emotion recognition in conversation with a retrieval multi-task llms framework. ArXiv abs/2309.11911 (2023). https://api.semanticscholar.org/CorpusID:262084263
Li, Z., Chen, G., Shao, R., Jiang, D., Nie, L.: Enhancing the emotional generation capability of large language models via emotional chain-of-thought. arXiv preprint arXiv:2401.06836 (2024)
Liu, J.M., Li, D., Cao, H., Ren, T., Liao, Z., Wu, J.: Chatcounselor: A large language models for mental health support. arXiv preprint arXiv:2309.15461 (2023)
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I., et al.: Improving language understanding by generative pre-training (2018)
Google Scholar
Rashkin, H., Smith, E.M., Li, M., Boureau, Y.L.: Towards empathetic open-domain conversation models: A new benchmark and dataset. arXiv preprint arXiv:1811.00207 (2018)
Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M.A., Lacroix, T., Rozière, B., Goyal, N., Hambro, E., Azhar, F., et al.: Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971 (2023)
Yang, H., Liu, X.Y., Wang, C.D.: Fingpt: Open-source financial large language models. arXiv preprint arXiv:2306.06031 (2023)
Yin, S., Fu, C., Zhao, S., Xu, T., Wang, H., Sui, D., Shen, Y., Li, K., Sun, X., Chen, E.: Woodpecker: Hallucination correction for multimodal large language models. arXiv preprint arXiv:2310.16045 (2023)
Yu, Y., Yang, C.H.H., Kolehmainen, J., Shivakumar, P.G., Gu, Y., Ren, S.R.R., Luo, Q., Gourav, A., Chen, I.F., Liu, Y.C., et al.: Low-rank adaptation of large language model rescoring for parameter-efficient speech recognition. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 1–8. IEEE (2023)
Google Scholar
Zhang, W., Deng, Y., Liu, B., Pan, S.J., Bing, L.: Sentiment analysis in the era of large language models: A reality check. arXiv preprint arXiv:2305.15005 (2023)
Zhang, Y., Wang, M., Tiwari, P., Li, Q., Wang, B., Qin, J.: Dialoguellm: Context and emotion knowledge-tuned llama models for emotion recognition in conversations. arXiv preprint arXiv:2310.11374 (2023)
Zheng, Z., Liao, L., Deng, Y., Nie, L.: Building emotional support chatbots in the era of llms. ArXiv abs/2308.11584 (2023), https://api.semanticscholar.org/CorpusID:261065100
Zhu, W., Wang, X.: Chatmed: A chinese medical large language model. https://github.com/michael-wzhu/ChatMed (2023)

Download references

Author information

Authors and Affiliations

Shantou University, Shantou, China
Jiahao Zhu
Chinese University of Hong Kong, Ma Liu Shui, Hong Kong
Zhihao Li
Mind with Heart Robotics Co. Ltd, Shenzhen, China
Jiahao Zhu, Zijian Jiang, Boyu Zhou, Jiaming Zhang & Zhihao Li
Xi’an Jiaotong-Liverpool University, Suzhou, China
Jionglong Su

Authors

Jiahao Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Zijian Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Boyu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jionglong Su
View author publications
You can also search for this author in PubMed Google Scholar
Jiaming Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhihao Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhihao Li .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Zhouchen Lin
Nankai University, Tianjin, China
Ming-Ming Cheng
Chinese Academy of Sciences, Beijing, China
Ran He
Xinjiang University, Urumqi, Xinjiang, China
Kurban Ubul
Xinjiang University, Urumqi, China
Wushouer Silamu
Peking University, Beijing, China
Hongbin Zha
Tsinghua University, Beijing, China
Jie Zhou
Chinese Academy of Sciences, Beijing, China
Cheng-Lin Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, J., Jiang, Z., Zhou, B., Su, J., Zhang, J., Li, Z. (2025). Empathizing Before Generation: A Double-Layered Framework for Emotional Support LLM. In: Lin, Z., et al. Pattern Recognition and Computer Vision. PRCV 2024. Lecture Notes in Computer Science, vol 15032. Springer, Singapore. https://doi.org/10.1007/978-981-97-8490-5_35

Download citation

DOI: https://doi.org/10.1007/978-981-97-8490-5_35
Published: 07 November 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-8489-9
Online ISBN: 978-981-97-8490-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics