Cited By
View all- Zhang MLuo GMa YLi SQian ZZhang XEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)VCMaster: Generating Diverse and Fluent Live Video Comments Based on Multimodal ContextsProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612078(4688-4696)Online publication date: 26-Oct-2023
- Xin BXu NZhai YZhang TLu ZLiu JNie WLi XLiu A(2023)A comprehensive survey on deep-learning-based visual captioningMultimedia Systems10.1007/s00530-023-01175-x29:6(3781-3804)Online publication date: 21-Sep-2023
- Mei TZhang WYao T(2020)Vision and language: from visual perception to content creationAPSIPA Transactions on Signal and Information Processing10.1017/ATSIP.2020.109:1Online publication date: 2020
- Show More Cited By