Butanium
/

selfplay_ppo_pong_v3_pettingzoo_cleanRL

Reinforcement Learning

Model card Files Files and versions Metrics Training metrics Community

Butanium commited on Dec 4, 2023

Commit

960c752

·

1 Parent(s): b764de6

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
 ---
-license: apache-2.0
 ---

 ---
+pipeline_tag: reinforcement-learning
+tags:
+- ppo
 ---
+PPO agents trained in a selfplay settings. The agent were trained on observation as left player only. This repo include checkpoints collected during training for
+4 experiments:
+- Shared weights for actor and critic
+- No shared weights
+- Resume training for extra steps for both shared and no shared setup