Butanium commited on
Commit
960c752
1 Parent(s): b764de6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -1,3 +1,11 @@
1
  ---
2
- license: apache-2.0
 
 
3
  ---
 
 
 
 
 
 
 
1
  ---
2
+ pipeline_tag: reinforcement-learning
3
+ tags:
4
+ - ppo
5
  ---
6
+
7
+ PPO agents trained in a selfplay settings. The agent were trained on observation as left player only. This repo include checkpoints collected during training for
8
+ 4 experiments:
9
+ - Shared weights for actor and critic
10
+ - No shared weights
11
+ - Resume training for extra steps for both shared and no shared setup