Multicar elevator group control: Average reward learning method for service completion time reduction and interference prevention | IEEE Conference Publication | IEEE Xplore