policy
tic-tac-toe-reinforced
alakhsharma22 · PyTorch
or hover any field below to flag it
Overview
Name
tic-tac-toe-reinforced
Author
alakhsharma22
Framework
PyTorch
License
unknown
Skill type
manipulation
Evidence level
untested
Task description
This project implements an AI-powered Tic-Tac-Toe game using Deep Learning and Minimax Algorithm. The AI can either learn from data using Reinforcement Learning or play optimally with the Minimax algorithm. The model is first trained using the imitation learning and then fine tuned by reinforcement
Spaces
Action space
other · 0-dim · 0Hz
Observation space
- type: other
Links
HuggingFace repo
null
Paper (arXiv)
null
Compatible robots
20anybotics-anymal-cnot in seedalohanot in seedgoogle-barkour-vbnot in seedboston-dynamics-spotnot in seedfranka-fr3not in seedgoogle-barkour-v0not in seedagilex-pipernot in seedberkeley-humanoidnot in seedbitcraze-crazyflie-2not in seedanybotics-anymal-bnot in seedagility-cassienot in seedarx-l5not in seedbooster-t1not in seedfranka-emika-pandanot in seedfranka-fr3-v2not in seeddynamixel-2rnot in seedflexiv-rizon4not in seedassetsnot in seedapptronik-apollonot in seedfourier-n1not in seed
Compatible environments
0No environments list tic-tac-toe-reinforced yet.
Datasets that reference this policy
0No datasets reference tic-tac-toe-reinforced yet.