Tunamelon commited on
Commit
6f7cf4b
1 Parent(s): d6494b5

PPO LunarLander-v2 trained agent

Browse files
Files changed (4) hide show
  1. PPO_89.zip +1 -1
  2. README.md +1 -1
  3. replay.mp4 +0 -0
  4. results.json +1 -1
PPO_89.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:06dc131d55ad5b43bf099c1808092f7c7a3e965f3665199161f1f20887c19dd5
3
  size 152858
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6f063f3ae4eb3a375852616274c6285b389f11c27cf77f1514c1de2e0ff6e74
3
  size 152858
README.md CHANGED
@@ -16,7 +16,7 @@ model-index:
16
  type: LunarLander-v2
17
  metrics:
18
  - type: mean_reward
19
- value: 283.54 +/- 17.81
20
  name: mean_reward
21
  verified: false
22
  ---
 
16
  type: LunarLander-v2
17
  metrics:
18
  - type: mean_reward
19
+ value: 280.14 +/- 17.02
20
  name: mean_reward
21
  verified: false
22
  ---
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 283.53779849999995, "std_reward": 17.8117678525987, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-08-19T15:01:10.168030"}
 
1
+ {"mean_reward": 280.1437801, "std_reward": 17.01612686757744, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-08-19T15:01:24.981869"}