Traffic Optimization in Satellites Communications: A Multi-agent Reinforcement Learning Approach | IEEE Conference Publication | IEEE Xplore