Ada-SwinBERT: Adaptive Token Selection for Efficient Video Captioning with Online Self-Distillation | IEEE Conference Publication | IEEE Xplore