References
Li Y F, Liang D M. Safe semi-supervised learning: a brief introduction. Frontiers of Computer Science, 2019, 13(4): 669–676
Ji Z, Ni J, Liu X, Pang Y. Teachers cooperation: team-knowledge distillation for multiple cross-domain few-shot learning. Frontiers of Computer Science, 2023, 17(2): 172312
Nam H, Kim S H, Ko B Y, Park Y H. Frequency dynamic convolution: frequency-adaptive pattern recognition for sound event detection. In: Proceedings of the 23rd Annual Conference of the International Speech Communication Association. 2022, 2763–2767
Xiao S, Zhang X, Zhang P. Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection. In: Proceedings of ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023, 1–5
Chen S, Wu Y, Wang C, Liu S, Tompkins D, Chen Z, Che W, Yu X, Wei F. BEATs: audio pre-training with acoustic tokenizers. In: Proceedings of the 40th International Conference on Machine Learning. 2023, 5178–5193
Chen Y, Dai X, Liu M, Chen D, Yuan L, Liu Z. Dynamic convolution: attention over convolution kernels. In: Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020, 11027–11036
Li K, Cai P, Song Y. Li USTC team’s submission for DCASE 2023 challenge task4a. Technical Report, DCASE2023 Challenge, 2023
Li K, Song Y, Dai L R, McLoughlin I, Fang X, Liu L. AST-SED: an effective sound event detection method based on audio spectrogram transformer. In: Proceedings of ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2023, 1–5
Acknowledgements
This work was supported by the Zhejiang Provincial Key R&D Program (Nos. 2024C01108, 2023C01030, 2023C01034), the Hangzhou Key R&D Program (Nos. 2023SZD0046, 2024SZD1A03), and the Ningbo Key R&D Program (No. 2024Z114).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Zhang, D., Wu, S., Lu, Z. et al. Improving sound event detection through enhanced feature extraction and attention mechanisms. Front. Comput. Sci. 19, 1910707 (2025). https://doi.org/10.1007/s11704-025-41108-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11704-025-41108-7