A new methodology for calculating distributions of reward accumulated during a finite interval | IEEE Conference Publication | IEEE Xplore