Butanium
/

selfplay_ppo_pong_v3_pettingzoo_cleanRL

Reinforcement Learning

Model card Files Files and versions Metrics Training metrics Community

Butanium commited on Dec 4, 2023

Commit

42497bd

•

1 Parent(s): 68b09a5

Update README.md

Files changed (1) hide show

README.md +9 -8

README.md CHANGED Viewed

@@ -3,6 +3,15 @@ pipeline_tag: reinforcement-learning
 tags:
 - ppo
 ---
 # Environment
 Multiplayer pong_v3 from PettingZoo with :
 - 4 stacked frame
@@ -46,14 +55,6 @@ def get_env(args, run_name):
  return envs
 ```
-# Experiment
-PPO agents trained in a selfplay settings. This repo includes checkpoints collected during training for
-4 experiments:
-- Shared weights for actor and critic
-- No shared weights
-- Resume training for extra steps for both shared and no shared setup
-Please check our [wandb report](https://wandb.ai/dumas/SPAR_RL_ELK/) for more details and the training code on [our GitHub](https://github.com/Butanium/cleanrl/blob/master/multiplayer_pong/ppo_pettingzoo_ma_atari.py)
 # Model architecture
 ```py
 def atari_network(orth_init=False):

 tags:
 - ppo
 ---
+# Experiment
+PPO agents trained in a selfplay settings. This repo includes checkpoints collected during training for
+4 experiments:
+- Shared weights for actor and critic
+- No shared weights
+- Resume training for extra steps for both shared and no shared setup
+Please check our [wandb report](https://wandb.ai/dumas/SPAR_RL_ELK/) for more details and the training code on [our GitHub](https://github.com/Butanium/cleanrl/blob/master/multiplayer_pong/ppo_pettingzoo_ma_atari.py)
 # Environment
 Multiplayer pong_v3 from PettingZoo with :
 - 4 stacked frame
  return envs
 ```
 # Model architecture
 ```py
 def atari_network(orth_init=False):