File size: 1,508 Bytes
43c0b9d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54dded2
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
tags:
- CartPole-v1
- reinforce
- reinforcement-learning
- custom-implementation
- deep-rl-class
model-index:
- name: CartPole-v1
  results:
  - task:
      type: reinforcement-learning
      name: reinforcement-learning
    dataset:
      name: CartPole-v1
      type: CartPole-v1
    metrics:
    - type: mean_reward
      value: 500.00 +/- 0.00
      name: mean_reward
      verified: false
---

  # **Reinforce** Agent playing **CartPole-v1**
  I have used Reinforcement learning in a game, Cart Pole. The aim is to keep the equilibrium by moving left/right. While training, the game uses its results/rewards to modify its parameters to get more rewards. Specifically, the model learns what kind of tactics let the cart pole balance, and as it fails, it learns and applies those tactics to balance the cart pole. 

Some links I've found helpful include:
https://huggingface.co/learn/deep-rl-course/en/unit0/introduction#certification-process
https://colab.research.google.com/github/huggingface/deep-rl-class/blob/master/notebooks/unit4/unit4.ipynb#scrollTo=NCNvyElRStWG
https://spinningup.openai.com/en/latest/spinningup/rl_intro3.html#don-t-let-the-past-distract-you
https://stable-baselines3.readthedocs.io/en/master/guide/rl_tips.html
https://gymnasium.farama.org/content/migration-guide/
https://github.com/enerrio/CartPole-Reinforcement-Learning
https://www.ibm.com/topics/overfitting
https://learningds.org/ch/04/modeling_loss_functions.html
https://www.geeksforgeeks.org/reinforce-algorithm/