Connor202020
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -24,4 +24,50 @@ model-index:
|
|
24 |
# **Reinforce** Agent playing **CartPole-v1**
|
25 |
This is a trained model of a **Reinforce** agent playing **CartPole-v1** .
|
26 |
To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
|
27 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
# **Reinforce** Agent playing **CartPole-v1**
|
25 |
This is a trained model of a **Reinforce** agent playing **CartPole-v1** .
|
26 |
To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
|
27 |
+
|
28 |
+
to train a great model, you need to modify the hyperparameters
|
29 |
+
|
30 |
+
cartpole_hyperparameters = {
|
31 |
+
"h_size": 64,
|
32 |
+
"n_training_episodes": 2000,
|
33 |
+
"n_evaluation_episodes": 20,
|
34 |
+
"max_t": 1000,
|
35 |
+
"gamma": 0.99,
|
36 |
+
"lr": 1e-3,
|
37 |
+
"env_id": env_id,
|
38 |
+
"state_space": s_size,
|
39 |
+
"action_space": a_size,
|
40 |
+
}
|
41 |
+
|
42 |
+
```
|
43 |
+
Episode 100 Average Score: 29.39
|
44 |
+
Episode 200 Average Score: 40.43
|
45 |
+
Episode 300 Average Score: 62.50
|
46 |
+
Episode 400 Average Score: 140.69
|
47 |
+
Episode 500 Average Score: 257.97
|
48 |
+
Episode 600 Average Score: 385.96
|
49 |
+
Episode 700 Average Score: 444.55
|
50 |
+
Episode 800 Average Score: 471.07
|
51 |
+
Episode 900 Average Score: 425.36
|
52 |
+
Episode 1000 Average Score: 469.43
|
53 |
+
Episode 1100 Average Score: 482.73
|
54 |
+
Episode 1200 Average Score: 479.17
|
55 |
+
Episode 1300 Average Score: 492.68
|
56 |
+
Episode 1400 Average Score: 487.52
|
57 |
+
Episode 1500 Average Score: 485.91
|
58 |
+
Episode 1600 Average Score: 487.56
|
59 |
+
Episode 1700 Average Score: 485.40
|
60 |
+
Episode 1800 Average Score: 494.59
|
61 |
+
Episode 1900 Average Score: 488.71
|
62 |
+
Episode 2000 Average Score: 493.33
|
63 |
+
Episode 2100 Average Score: 496.70
|
64 |
+
Episode 2200 Average Score: 498.07
|
65 |
+
Episode 2300 Average Score: 498.38
|
66 |
+
Episode 2400 Average Score: 476.29
|
67 |
+
Episode 2500 Average Score: 485.02
|
68 |
+
Episode 2600 Average Score: 481.23
|
69 |
+
Episode 2700 Average Score: 498.21
|
70 |
+
Episode 2800 Average Score: 500.00
|
71 |
+
Episode 2900 Average Score: 496.20
|
72 |
+
Episode 3000 Average Score: 494.15
|
73 |
+
```
|