hterrebrood commited on
Commit
54dded2
·
verified ·
1 Parent(s): 0b0d62e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -3
README.md CHANGED
@@ -22,6 +22,15 @@ model-index:
22
  ---
23
 
24
  # **Reinforce** Agent playing **CartPole-v1**
25
- This is a trained model of a **Reinforce** agent playing **CartPole-v1** .
26
- To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
27
-
 
 
 
 
 
 
 
 
 
 
22
  ---
23
 
24
  # **Reinforce** Agent playing **CartPole-v1**
25
+ I have used Reinforcement learning in a game, Cart Pole. The aim is to keep the equilibrium by moving left/right. While training, the game uses its results/rewards to modify its parameters to get more rewards. Specifically, the model learns what kind of tactics let the cart pole balance, and as it fails, it learns and applies those tactics to balance the cart pole.
26
+
27
+ Some links I've found helpful include:
28
+ https://huggingface.co/learn/deep-rl-course/en/unit0/introduction#certification-process
29
+ https://colab.research.google.com/github/huggingface/deep-rl-class/blob/master/notebooks/unit4/unit4.ipynb#scrollTo=NCNvyElRStWG
30
+ https://spinningup.openai.com/en/latest/spinningup/rl_intro3.html#don-t-let-the-past-distract-you
31
+ https://stable-baselines3.readthedocs.io/en/master/guide/rl_tips.html
32
+ https://gymnasium.farama.org/content/migration-guide/
33
+ https://github.com/enerrio/CartPole-Reinforcement-Learning
34
+ https://www.ibm.com/topics/overfitting
35
+ https://learningds.org/ch/04/modeling_loss_functions.html
36
+ https://www.geeksforgeeks.org/reinforce-algorithm/