Extreme Trust Region Policy Optimization for Active Object Recognition | IEEE Journals & Magazine | IEEE Xplore