Skip to main content

Abstract

The autodidactic iteration algorithm was designed in hopes of finding an approximate solution to combinatorial optimization puzzles such as the prediction of protein tertiary structures. The prediction of protein tertiary structures allows a better understanding of its function in an organism. The Rubik’s cube is selected as a placeholder for the combinatorial optimization puzzle to represent the protein tertiary structures. The autodidactic iteration algorithm is used in environments with large state spaces and sparse rewards such as combinatorial puzzles like the Rubik’s cube. This research to solve a classic Rubik’s cube using the implementation of the autodidactic iteration algorithm is a reinforcement learning algorithm that can teach itself to solve the Rubik’s cube without human assistance. A neural network model is then trained to solve a scrambled Rubik’s cube while implementing Keras as the deep learning library. The Rubik’s cube is generated in a graphical user interface, GUI using Magic Cube. The Rubik’s cube generated in the GUI can be interacted with by turning the faces of the Rubik’s cube and changing the viewing angle of the Rubik’s cube. The interactive Rubik’s cube then uses the neural network model trained based on the model trained to solve a scrambled Rubik’s cube.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Zhang Jingfen, H.Z.Q.W.: Functional Genomics. Methods Mol. Biol. 815(2), 3–13, 13 3 (2012)

    Google Scholar 

  2. Shmakov, S.M.A.A.: Solving the Rubik’s cube without human knowledge. University of California, Irvine, California (2018)

    Google Scholar 

  3. Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 109(3), 5–32 (2020)

    Google Scholar 

  4. Cappel, L.M.: What is the difference between Deep Learning and Reinforcement Learning?. BIGDATA-MADESIMPLE, 1 March 2019. [Online]. Available: https://bigdata-madesimple.com/what-is-the-difference-between-deep-learning-and-reinforcement-learning/. Accessed 2020

  5. Choudhary, A.: Reinforcement Learning: Introduction to Monte Carlo Learning using the OpenAI Gym Toolkit. Analytics Vidhya, 26 12 2018. [Online]. Available: https://www.analyticsvidhya.com/blog/2018/11/reinforcement-learning-introduction-monte-carlo-learning-openai-gym/?utm_source=blog&utm_medium=introduction-deep-q-learningp-ython. Accessed 2020

  6. Li, Y.: Deep reinforcement learning: an overview. Introducing Deep Reinforcement Learning 16(10), 14–46 (2018)

    Google Scholar 

  7. Coulom, R.: Efficient selectivity and backup operators in monte-carlo tree search. In: International Conference on Computers and Games, Turin, Italy, 2006

    Google Scholar 

  8. Pierre Baldi, F.A.: Solving the Rubik’s cube with approximate policy iteration. In: International Conference on Learning Representations (ICLR), New Orleans, Louisiana, 2019

    Google Scholar 

  9. Agostinelli, F., McAleer, S., Shmakov, A., Baldi, P.: Solving the Rubik’s cube with deep reinforcement learning. Nat. Mach. Intell. 1, 356–362, 5 8 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wen, K.Y.K., Wahab, M.N.A., Seng, Y.W., Chuan, W.C. (2022). Unraveling the Rubik’s Cube with Autodidactic Iteration Algorithm. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. Lecture Notes in Electrical Engineering, vol 829. Springer, Singapore. https://doi.org/10.1007/978-981-16-8129-5_54

Download citation

Publish with us

Policies and ethics