Unraveling the Rubik’s Cube with Autodidactic Iteration Algorithm

Wen, Kevin Yeap Khai; Wahab, Mohd Nadhir Ab; Seng, Yeap Wei; Chuan, Wu Chia

doi:10.1007/978-981-16-8129-5_54

Kevin Yeap Khai Wen³⁹,
Mohd Nadhir Ab Wahab ORCID: orcid.org/0000-0002-3549-6443⁴⁰,
Yeap Wei Seng⁴¹ &
…
Wu Chia Chuan⁴¹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 829))

1635 Accesses

Abstract

The autodidactic iteration algorithm was designed in hopes of finding an approximate solution to combinatorial optimization puzzles such as the prediction of protein tertiary structures. The prediction of protein tertiary structures allows a better understanding of its function in an organism. The Rubik’s cube is selected as a placeholder for the combinatorial optimization puzzle to represent the protein tertiary structures. The autodidactic iteration algorithm is used in environments with large state spaces and sparse rewards such as combinatorial puzzles like the Rubik’s cube. This research to solve a classic Rubik’s cube using the implementation of the autodidactic iteration algorithm is a reinforcement learning algorithm that can teach itself to solve the Rubik’s cube without human assistance. A neural network model is then trained to solve a scrambled Rubik’s cube while implementing Keras as the deep learning library. The Rubik’s cube is generated in a graphical user interface, GUI using Magic Cube. The Rubik’s cube generated in the GUI can be interacted with by turning the faces of the Rubik’s cube and changing the viewing angle of the Rubik’s cube. The interactive Rubik’s cube then uses the neural network model trained based on the model trained to solve a scrambled Rubik’s cube.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Solving the Rubik’s cube with deep reinforcement learning and search

Article 15 July 2019

Deep Learning Models for Rubik’s Cube with Entropy Modelling

Deep IA-BI and Five Actions in Circling

References

Zhang Jingfen, H.Z.Q.W.: Functional Genomics. Methods Mol. Biol. 815(2), 3–13, 13 3 (2012)
Google Scholar
Shmakov, S.M.A.A.: Solving the Rubik’s cube without human knowledge. University of California, Irvine, California (2018)
Google Scholar
Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 109(3), 5–32 (2020)
Google Scholar
Cappel, L.M.: What is the difference between Deep Learning and Reinforcement Learning?. BIGDATA-MADESIMPLE, 1 March 2019. [Online]. Available: https://bigdata-madesimple.com/what-is-the-difference-between-deep-learning-and-reinforcement-learning/. Accessed 2020
Choudhary, A.: Reinforcement Learning: Introduction to Monte Carlo Learning using the OpenAI Gym Toolkit. Analytics Vidhya, 26 12 2018. [Online]. Available: https://www.analyticsvidhya.com/blog/2018/11/reinforcement-learning-introduction-monte-carlo-learning-openai-gym/?utm_source=blog&utm_medium=introduction-deep-q-learningp-ython. Accessed 2020
Li, Y.: Deep reinforcement learning: an overview. Introducing Deep Reinforcement Learning 16(10), 14–46 (2018)
Google Scholar
Coulom, R.: Efficient selectivity and backup operators in monte-carlo tree search. In: International Conference on Computers and Games, Turin, Italy, 2006
Google Scholar
Pierre Baldi, F.A.: Solving the Rubik’s cube with approximate policy iteration. In: International Conference on Learning Representations (ICLR), New Orleans, Louisiana, 2019
Google Scholar
Agostinelli, F., McAleer, S., Shmakov, A., Baldi, P.: Solving the Rubik’s cube with deep reinforcement learning. Nat. Mach. Intell. 1, 356–362, 5 8 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Plymouth, Plymouth, UK
Kevin Yeap Khai Wen
School of Computer Sciences, Universiti Sains Malaysia, George Town, Penang, Malaysia
Mohd Nadhir Ab Wahab
National Advanced IPv6 Centre (NAv6), Universiti Sains Malaysia, George Town, Penang, Malaysia
Yeap Wei Seng & Wu Chia Chuan

Authors

Kevin Yeap Khai Wen
View author publications
You can also search for this author in PubMed Google Scholar
Mohd Nadhir Ab Wahab
View author publications
You can also search for this author in PubMed Google Scholar
Yeap Wei Seng
View author publications
You can also search for this author in PubMed Google Scholar
Wu Chia Chuan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Penang, Malaysia
Nor Muzlifah Mahyuddin
School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Penang, Malaysia
Nor Rizuan Mat Noor
School of Electrical and Electronic Engineering, Universiti Sains Malaysia, Penang, Malaysia
Harsa Amylia Mat Sakim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wen, K.Y.K., Wahab, M.N.A., Seng, Y.W., Chuan, W.C. (2022). Unraveling the Rubik’s Cube with Autodidactic Iteration Algorithm. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_54

Download citation

DOI: https://doi.org/10.1007/978-981-16-8129-5_54
Published: 11 February 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8128-8
Online ISBN: 978-981-16-8129-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Unraveling the Rubik’s Cube with Autodidactic Iteration Algorithm

Abstract

Access this chapter

Similar content being viewed by others

Solving the Rubik’s cube with deep reinforcement learning and search

Deep Learning Models for Rubik’s Cube with Entropy Modelling

Deep IA-BI and Five Actions in Circling

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Unraveling the Rubik’s Cube with Autodidactic Iteration Algorithm

Abstract

Access this chapter

Similar content being viewed by others

Solving the Rubik’s cube with deep reinforcement learning and search

Deep Learning Models for Rubik’s Cube with Entropy Modelling

Deep IA-BI and Five Actions in Circling

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation