Learning compound multi-step controllers under unknown dynamics | IEEE Conference Publication | IEEE Xplore