Joint Channel, Power and Bandwidth Optimization for Anti-jamming Communications: A Multi-agent Q-learning Approach | IEEE Conference Publication | IEEE Xplore