Profit sharing that can learn deterministic policy for POMDPs environments | IEEE Conference Publication | IEEE Xplore