Cited By
View all- Li YHou XDezhi ZShen LZhao ZCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)FLIP-80M: 80 Million Visual-Linguistic Pairs for Facial Language-Image Pre-TrainingProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3681287(58-67)Online publication date: 28-Oct-2024
- Zhang RWang HDu MLiu HZhou YZeng QEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery LocalizationProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3613767(8749-8759)Online publication date: 26-Oct-2023
- He HWang TYang HFu JYuan NYin JChao HZhang QEl Saddik AMei TCucchiara RBertini MTobon Vallejo DAtrey PHossain M(2023)Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement LearningProceedings of the 31st ACM International Conference on Multimedia10.1145/3581783.3612595(6831-6840)Online publication date: 26-Oct-2023