Implicitly using Human Skeleton in Self-supervised Learning: Influence on Spatio-temporal Puzzle Solving and on Video Action Recognition Topics: Deep Learning; Image and Video Processing; Robot Vision; Scene Analysis and Understanding In Proceedings of the 2nd International Conference on Robotics, Computer Vision and Intelligent Systems ROBOVIS - Volume 1, 128-135, 2021