Cited By
View all- Wu KLuo WXie ZGuo DZhang ZHong R(2025)Ensemble Prototype Network For Weakly Supervised Temporal Action LocalizationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2024.337746836:3(4560-4574)Online publication date: Mar-2025
- Song PZhou YYang XLiu DHu ZWang DWang M(2024)Efficiently Gluing Pre-Trained Language and Vision Models for Image CaptioningACM Transactions on Intelligent Systems and Technology10.1145/368206715:6(1-16)Online publication date: 29-Jul-2024
- Sun JSong PZhang JGuo D(2024)Syntax-Controllable Video Captioning with Tree-Structural Syntax AugmentationProceedings of the 2024 2nd Asia Conference on Computer Vision, Image Processing and Pattern Recognition10.1145/3663976.3664004(1-7)Online publication date: 26-Apr-2024
- Show More Cited By