Multiple spatio-temporal scales neural network for contextual visual recognition of human actions | IEEE Conference Publication | IEEE Xplore