Using model uncertainty for robust optimization in approximate inference control

Itoh, Hideaki; Sakai, Yoshitaka; Kadoya, Toru; Fukumoto, Hisao; Wakuya, Hiroshi; Furukawa, Tatsuya

doi:10.1007/s10015-017-0361-6

Using model uncertainty for robust optimization in approximate inference control

Original Article
Published: 27 March 2017

Volume 22, pages 327–335, (2017)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Hideaki Itoh¹,
Yoshitaka Sakai²,
Toru Kadoya³,
Hisao Fukumoto¹,
Hiroshi Wakuya¹ &
…
Tatsuya Furukawa¹

299 Accesses
2 Citations
Explore all metrics

Abstract

Recently, the optimization-by-inference approach has been proposed as a new means for solving high-dimensional optimization problems quickly. Approximate Inference COntrol (AICO) is one of the most successful and promising methods that implement the optimization-by-inference approach. AICO is able to solve stochastic optimal control problems and has already been successfully used in many applications. However, it is known that the iterative inference of AICO sometimes fails to converge to the optimal solution. To make the optimization more robust, in this paper, we propose to take model uncertainty into account. In AICO, the cost function to be minimized is accurate around a particular state of a given stochastic system, but the accuracy is uncertain in regions far from that state. Because using such an uncertain function is harmful to the convergence, we modify AICO, so that it does not use the function in uncertain regions. Our method is easy to implement and does not add much computational time to the original AICO. Experiments using two different scenarios show that our method substantially improves AICO in terms of the rate at which the algorithm produces convergent results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Model-Free Optimal Control for Linear Systems with State and Control Inequality Constraints

OPTCON3: An Active Learning Control Algorithm for Nonlinear Quadratic Stochastic Problems

Article Open access 09 December 2019

Adaptive Control for Systems with Output Constraints Using an Online Optimization Method

Article 19 August 2014

References

Attias H (2003) Planning by probabilistic inference. In: Proceedings of the ninth international workshop on artificial intelligence and statistics
Verma D, Rao RP (2005) Goal-based imitation as probabilistic inference over graphical models. In: Advances in neural information processing systems, pp 1393–1400
Toussaint M, Storkey A (2006) Probabilistic inference for solving discrete and continuous state Markov decision processes. In: Proceedings of the 23rd international conference on machine learning, pp 945–952
Toussaint M (2009) Robot trajectory optimization using approximate inference. In: Proceedings of the 26th international conference on machine learning, pp 1049–1056
Kappen HJ, Gómez V, Opper M (2012) Optimal control as a graphical model inference problem. Mach Learn 87(2):159–182
Article MathSciNet MATH Google Scholar
Kumar A, Zilberstein S, Toussaint M (2015) Probabilistic inference techniques for scalable multiagent decision making. J Artif Intell Res 53(1):223–270
MathSciNet MATH Google Scholar
Minka TP (2001), Expectation propagation for approximate Bayesian inference. In: Proceedings of the 17th conference on uncertainty in artificial intelligence, pp 362–369
Rawlik K, Toussaint M, Vijayakumar S (2010), An approximate inference approach to temporal optimization in optimal control. In: Advances in neural information processing systems, pp 2011–2019
Jetchev N, Toussaint M (2013) Fast motion planning from experience: trajectory prediction for speeding up movement generation. Auton Robots 34(1–2):111–127
Article Google Scholar
Ivan V, Zarubin D, Toussaint M, Komura T, Vijayakumar S (2013) Topology-based representations for motion planning and generalization in dynamic environments with interactions. Int J Robot Res 32(9–10):1151–1163
Article Google Scholar
Zarubin D, Pokorny FT, Song D, Toussaint M, Kragic D (2013) Topological synergies for grasp transfer. In: Hand synergies-how to tame the complexity of grapsing, workshop, IEEE international conference on robotics and automation
Kadoya T, Itoh H, Fukumoto H, Wakuya H, Furukawa T (2014) Movement imitation in a humanoid robot with approximate inference control. In: Proceedings of the 19th international symposium on artificial life and robotics, pp 260–263
Watter M, Springenberg J, Boedecker J, Riedmiller M (2015) Embed to control: a locally linear latent dynamics model for control from raw images. In: Advances in neural information processing systems, pp 2728–2736
Toussaint M (2009) Pros and cons of truncated Gaussian EP in the context of approximate inference control. NIPS workshop on probabilistic approaches for robotics and control
Rüeckert E, Mindt M, Peters J, Neumann G (2014) Robust policy updates for stochastic optimal control. In: Proceedings of the 14th IEEE-RAS international conference on humanoid robots (humanoids), pp 388–393
Zarubin D, Ivan V, Toussaint M, Komura T, Vijayakumar S (2012) Hierarchical motion planning in topological representations. In: International conference on robotics science and systems
Rawlik K, Toussaint M, Vijayakumar S (2012) On stochastic optimal control and reinforcement learning by approximate inference. In: International conference on robotics science and systems

Download references

Acknowledgements

We would like to thank the reviewers for their valuable comments. This study was partially supported by the Ministry of Education, Culture, Sports, Science and Technology in Japan, Grant-in-Aid for Scientific Research (C) 15K00341.

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, Graduate School of Science and Engineering, Saga University, 1 Honjo-machi, Saga, 840-8502, Japan
Hideaki Itoh, Hisao Fukumoto, Hiroshi Wakuya & Tatsuya Furukawa
Fujitsu Limited, 1-5-2 Higashi-Shimbashi, Minato-ku, Tokyo, 105-7123, Japan
Yoshitaka Sakai
Hitachi Systems, Ltd., 1-2-1 Osaki, Sinagawa-ku, Tokyo, 141-8672, Japan
Toru Kadoya

Authors

Hideaki Itoh
View author publications
You can also search for this author in PubMed Google Scholar
Yoshitaka Sakai
View author publications
You can also search for this author in PubMed Google Scholar
Toru Kadoya
View author publications
You can also search for this author in PubMed Google Scholar
Hisao Fukumoto
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Wakuya
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuya Furukawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hideaki Itoh.

Additional information

This work was presented in part at the 19th International Symposium on Artificial Life and Robotics, Beppu, Oita, January 22–24, 2014.

About this article

Cite this article

Itoh, H., Sakai, Y., Kadoya, T. et al. Using model uncertainty for robust optimization in approximate inference control. Artif Life Robotics 22, 327–335 (2017). https://doi.org/10.1007/s10015-017-0361-6

Download citation

Received: 04 May 2016
Accepted: 20 February 2017
Published: 27 March 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s10015-017-0361-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Using model uncertainty for robust optimization in approximate inference control

Abstract

Access this article

Similar content being viewed by others

Model-Free Optimal Control for Linear Systems with State and Control Inequality Constraints

OPTCON3: An Active Learning Control Algorithm for Nonlinear Quadratic Stochastic Problems

Adaptive Control for Systems with Output Constraints Using an Online Optimization Method

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Keywords

Navigation

Using model uncertainty for robust optimization in approximate inference control

Abstract

Access this article

Similar content being viewed by others

Model-Free Optimal Control for Linear Systems with State and Control Inequality Constraints

OPTCON3: An Active Learning Control Algorithm for Nonlinear Quadratic Stochastic Problems

Adaptive Control for Systems with Output Constraints Using an Online Optimization Method

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Keywords

Search

Navigation