Enhancing Video Summarization via Vision-Language Embedding | IEEE Conference Publication | IEEE Xplore