Learning Continuous 3-DoF Air-to-Air Close-in Combat Strategy using Proximal Policy Optimization | IEEE Conference Publication | IEEE Xplore