An Approximately Optimal Relative Value Learning Algorithm for Averaged MDPs with Continuous States and Actions

An Approximately Optimal Relative Value Learning Algorithm for Averaged MDPs with Continuous States and Actions | IEEE Conference Publication | IEEE Xplore

IEEE Account

Purchase Details

Profile Information

Need Help?