Dynamic Graph-Guided Transferable Regression for Cross-Domain Speech Emotion Recognition

Jiang, Shenjie; Song, Peng; Wang, Run; Li, Shaokai; Zheng, Wenming

doi:10.1007/978-981-99-8565-4_22

Shenjie Jiang¹⁵,
Peng Song¹⁵,
Run Wang¹⁵,
Shaokai Li^15,16,17 &
…
Wenming Zheng¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14463))

Included in the following conference series:

Chinese Conference on Biometric Recognition

786 Accesses

Abstract

To deal with the problem of cross-domain speech emotion recognition (SER), in this paper, we propose a novel dynamic graph-guided transferable regression (DGTR) method. Specifically, a retargeted discriminant linear regression in the source domain is utilized to make the projection matrix discriminative. Meanwhile, an adaptive maximum entropy graph is designed for similarity measurement for different domains. Experiments on four popular datasets show that our method can achieve better performance compared with several related state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

MBDA: A Multi-scale Bidirectional Perception Approach for Cross-Corpus Speech Emotion Recognition

WavFusion: Towards Wav2vec 2.0 Multimodal Speech Emotion Recognition

Emotion recognition in the wild via sparse transductive transfer linear discriminant analysis

Article 05 January 2016

References

Akçay, M.B., Oğuz, K.: Speech emotion recognition: emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers. Speech Commun. 116, 56–76 (2020)
Article Google Scholar
Song, P.: Transfer linear subspace learning for cross-corpus speech emotion recognition. IEEE Trans. Affect. Comput. 10(02), 265–275 (2019)
Article Google Scholar
Zhang, L., Gao, X.: Transfer adaptation learning: a decade survey. IEEE Trans. Neural Netw. Learn. Syst. (2022)
Google Scholar
Pan, S.J., Tsang, I.W., Kwok, J.T., Yang, Q.: Domain adaptation via transfer component analysis. IEEE Trans. Neural Netw. 22(2), 199–210 (2010)
Article Google Scholar
Long, M., Wang, J., Ding, G., Sun, J., Yu, P.S.: Transfer feature learning with joint distribution adaptation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2200–2207 (2013)
Google Scholar
Long, M., Wang, J., Ding, G., Sun, J., Yu, P.S.: Transfer joint matching for unsupervised domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1410–1417 (2014)
Google Scholar
Wang, J., Chen, Y., Hao, S., Feng, W., Shen, Z.: Balanced distribution adaptation for transfer learning. In: 2017 IEEE International Conference on Data Mining (ICDM), pp. 1129–1134. IEEE (2017)
Google Scholar
Zhang, Y., Li, W., Tao, R., Peng, J., Du, Q., Cai, Z.: Cross-scene hyperspectral image classification with discriminative cooperative alignment. IEEE Trans. Geosci. Remote Sens. 59(11), 9646–9660 (2021)
Article Google Scholar
Li, S., Song, P., Zhao, K., Zhang, W., Zheng, W.: Coupled discriminant subspace alignment for cross-database speech emotion recognition. Proc. Interspeech 2022, 4695–4699 (2022)
Article Google Scholar
Zhang, X.Y., Wang, L., Xiang, S., Liu, C.L.: Retargeted least squares regression algorithm. IEEE Trans. Neural Netw. Learn. Syst. 26(9), 2206–2213 (2014)
Article MathSciNet Google Scholar
Mohar, B., Alavi, Y., Chartrand, G., Oellermann, O.: The Laplacian spectrum of graphs. Graph Theory Comb. Appl. 2(871–898), 12 (1991)
Google Scholar
Li, Z., Nie, F., Chang, X., Nie, L., Zhang, H., Yang, Y.: Rank-constrained spectral clustering with flexible embedding. IEEE Trans. Neural Netw. Learn. Syst. 29(12), 6073–6082 (2018)
Article MathSciNet Google Scholar
Fan, K.: On a theorem of Weyl concerning eigenvalues of linear transformations I. Proc. Natl. Acad. Sci. 35(11), 652–655 (1949)
Article MathSciNet Google Scholar
Li, X., Zhang, H., Zhang, R., Liu, Y., Nie, F.: Generalized uncorrelated regression with adaptive graph for unsupervised feature selection. IEEE Trans. Neural Netw. Learn. Syst. 30(5), 1587–1595 (2018)
Article MathSciNet Google Scholar
Wen, J., Zhong, Z., Zhang, Z., Fei, L., Lai, Z., Chen, R.: Adaptive locality preserving regression. IEEE Trans. Circuits Syst. Video Technol. 30(1), 75–88 (2018)
Article Google Scholar
Li, S., Song, P., Zheng, W.: Multi-source discriminant subspace alignment for cross-domain speech emotion recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing (2023)
Google Scholar
Song, P., Zheng, W.: Feature selection based transfer subspace learning for speech emotion recognition. IEEE Trans. Affect. Comput. 11(3), 373–382 (2018)
Article Google Scholar
Yu, C., Wang, J., Chen, Y., Huang, M.: Transfer learning with dynamic adversarial adaptation network. In: 2019 IEEE International Conference on Data Mining (ICDM), pp. 778–786. IEEE (2019)
Google Scholar
Zhu, Y., et al.: Multi-representation adaptation network for cross-domain image classification. Neural Netw. 119, 214–221 (2019)
Article Google Scholar
Zhu, Y., et al.: Deep subdomain adaptation network for image classification. IEEE Trans. Neural Netw. Learn. Syst. 32(4), 1713–1722 (2020)
Article MathSciNet Google Scholar
Cui, S., Wang, S., Zhuo, J., Li, L., Huang, Q., Tian, Q.: Towards discriminability and diversity: batch nuclear-norm maximization under label insufficient situations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3941–3950 (2020)
Google Scholar

Download references

Acknowledgment

This research was supported by the Natural Science Foundation of Shandong Province under Grants ZR2023MF063 and ZR2022MF314, and by the National Natural Science Foundation of China under Grant 61703360.

Author information

Authors and Affiliations

School of Computer and Control Engineering, Yantai University, Yantai, 264005, China
Shenjie Jiang, Peng Song, Run Wang & Shaokai Li
The State Key Laboratory of Tibetan Intelligent Information Processing and Application, Xining, 810008, China
Shaokai Li
Tibetan Information Processing and Machine Translation Key Laboratory of Qinghai Province, Xining, 810008, China
Shaokai Li
Key Laboratory of Child Development and Learning Science of Ministry of Education, Southeast University, Nanjing, 210096, China
Wenming Zheng

Authors

Shenjie Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Song
View author publications
You can also search for this author in PubMed Google Scholar
Run Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shaokai Li
View author publications
You can also search for this author in PubMed Google Scholar
Wenming Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peng Song .

Editor information

Editors and Affiliations

Hefei University of Technology, Hefei, China
Wei Jia
South China University of Technology, Guangzhou, China
Wenxiong Kang
China University of Mining and Technology, Xuzhou, China
Zaiyu Pan
Shandong University, Jinan, China
Xianye Ben
China University of Mining and Technology, Xuzhou, China
Zhengfu Bian
Southern University of Science and Technology, Shenzhen, China
Shiqi Yu
Chinese Academy of Sciences, Beijing, China
Zhaofeng He
China University of Mining and Technology, Xuzhou, China
Jun Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, S., Song, P., Wang, R., Li, S., Zheng, W. (2023). Dynamic Graph-Guided Transferable Regression for Cross-Domain Speech Emotion Recognition. In: Jia, W., et al. Biometric Recognition. CCBR 2023. Lecture Notes in Computer Science, vol 14463. Springer, Singapore. https://doi.org/10.1007/978-981-99-8565-4_22

Download citation

DOI: https://doi.org/10.1007/978-981-99-8565-4_22
Published: 02 December 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8564-7
Online ISBN: 978-981-99-8565-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics