Discrete reactive power optimization considering safety margin by dimensional Q-learning | IEEE Conference Publication | IEEE Xplore