Video representation learning through prediction for online object detection | IEEE Conference Publication | IEEE Xplore