Butanium
/

selfplay_ppo_pong_v3_pettingzoo_cleanRL

Reinforcement Learning

Model card Files Files and versions Metrics Training metrics Community

Butanium commited on Dec 4, 2023

Commit

9e0babc

•

1 Parent(s): 22947cb

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -8,4 +8,6 @@ PPO agents trained in a selfplay settings. The agent were trained on observation
 4 experiments:
 - Shared weights for actor and critic
 - No shared weights
-- Resume training for extra steps for both shared and no shared setup

 4 experiments:
 - Shared weights for actor and critic
 - No shared weights
+- Resume training for extra steps for both shared and no shared setup
+Please check our [wandb report](https://wandb.ai/dumas/SPAR_RL_ELK/) for more details