Temporally consistent caption detection in videos using a spatiotemporal 3D method | IEEE Conference Publication | IEEE Xplore