File size: 1,508 Bytes
43c0b9d 54dded2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
tags:
- CartPole-v1
- reinforce
- reinforcement-learning
- custom-implementation
- deep-rl-class
model-index:
- name: CartPole-v1
results:
- task:
type: reinforcement-learning
name: reinforcement-learning
dataset:
name: CartPole-v1
type: CartPole-v1
metrics:
- type: mean_reward
value: 500.00 +/- 0.00
name: mean_reward
verified: false
---
# **Reinforce** Agent playing **CartPole-v1**
I have used Reinforcement learning in a game, Cart Pole. The aim is to keep the equilibrium by moving left/right. While training, the game uses its results/rewards to modify its parameters to get more rewards. Specifically, the model learns what kind of tactics let the cart pole balance, and as it fails, it learns and applies those tactics to balance the cart pole.
Some links I've found helpful include:
https://huggingface.co/learn/deep-rl-course/en/unit0/introduction#certification-process
https://colab.research.google.com/github/huggingface/deep-rl-class/blob/master/notebooks/unit4/unit4.ipynb#scrollTo=NCNvyElRStWG
https://spinningup.openai.com/en/latest/spinningup/rl_intro3.html#don-t-let-the-past-distract-you
https://stable-baselines3.readthedocs.io/en/master/guide/rl_tips.html
https://gymnasium.farama.org/content/migration-guide/
https://github.com/enerrio/CartPole-Reinforcement-Learning
https://www.ibm.com/topics/overfitting
https://learningds.org/ch/04/modeling_loss_functions.html
https://www.geeksforgeeks.org/reinforce-algorithm/
|