Multi-modal human action recognition using deep neural networks fusing image and inertial sensor data | IEEE Conference Publication | IEEE Xplore