Cited By
View all- Yuan TZhang XLiu BLiu KJin JJiao Z(2025)Surveillance Video-and-Language Understanding: From Small to Large Multimodal ModelsIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.346243335:1(300-314)Online publication date: 1-Jan-2025
- Wu JFeng YXu HZhu CZheng JWooldridge MDy JNatarajan S(2024)SyFormerProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i6.28417(6021-6029)Online publication date: 20-Feb-2024
- Lin JWang Y(2024)TSFormer: Tracking Structure Transformer for Image InpaintingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/369645220:12(1-23)Online publication date: 20-Sep-2024
- Show More Cited By