Learning how to play tic-tac-toe with game theory, reinforcement learning... and bunch of matchboxes
-
Updated
Nov 28, 2023 - Python
Learning how to play tic-tac-toe with game theory, reinforcement learning... and bunch of matchboxes
Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.
Do you have too much time available in your life? Do you feel the need to play a childhood retro game? Here it is. :)
Add a description, image, and links to the menace-matchboxes topic page so that developers can more easily learn about it.
To associate your repository with the menace-matchboxes topic, visit your repo's landing page and select "manage topics."