Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,6 @@ PPO agents trained in a selfplay settings. The agent were trained on observation
|
|
8 |
4 experiments:
|
9 |
- Shared weights for actor and critic
|
10 |
- No shared weights
|
11 |
-
- Resume training for extra steps for both shared and no shared setup
|
|
|
|
|
|
8 |
4 experiments:
|
9 |
- Shared weights for actor and critic
|
10 |
- No shared weights
|
11 |
+
- Resume training for extra steps for both shared and no shared setup
|
12 |
+
|
13 |
+
Please check our [wandb report](https://wandb.ai/dumas/SPAR_RL_ELK/) for more details
|