Suggestion of probabilistic reward-independent knowledge for dynamic environment in reinforcement learning | IEEE Conference Publication | IEEE Xplore