A Predictive Reward Function for Human-Like Driving Based on a Transition Model of Surrounding Environment | IEEE Conference Publication | IEEE Xplore