wowthecoder commited on
Commit
5e1ed3a
·
verified ·
1 Parent(s): 200290d

Push agent to the Hub

Browse files
README.md CHANGED
@@ -1,30 +1,30 @@
1
- ---
2
- tags:
3
- - LunarLander-v2
4
- - ppo
5
- - deep-reinforcement-learning
6
- - reinforcement-learning
7
- - custom-implementation
8
- - deep-rl-course
9
- model-index:
10
- - name: PPO
11
- results:
12
- - task:
13
- type: reinforcement-learning
14
- name: reinforcement-learning
15
- dataset:
16
- name: LunarLander-v2
17
- type: LunarLander-v2
18
- metrics:
19
- - type: mean_reward
20
- value: 61.64 +/- 134.03
21
- name: mean_reward
22
- verified: false
23
- ---
24
-
25
- # PPO Agent Playing LunarLander-v2
26
-
27
- This is a trained model of a PPO agent playing LunarLander-v2.
28
-
29
- # Hyperparameters
30
 
 
1
+ ---
2
+ tags:
3
+ - CartPole-v1
4
+ - ppo
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - custom-implementation
8
+ - deep-rl-course
9
+ model-index:
10
+ - name: PPO
11
+ results:
12
+ - task:
13
+ type: reinforcement-learning
14
+ name: reinforcement-learning
15
+ dataset:
16
+ name: CartPole-v1
17
+ type: CartPole-v1
18
+ metrics:
19
+ - type: mean_reward
20
+ value: 499.20 +/- 2.40
21
+ name: mean_reward
22
+ verified: false
23
+ ---
24
+
25
+ # PPO Agent Playing CartPole-v1
26
+
27
+ This is a trained model of a PPO agent playing CartPole-v1.
28
+
29
+ # Hyperparameters
30
 
logs/events.out.tfevents.1748165048.ff3a9e2058a7.1231.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3efe6b2388cd95b73e4cdb20bd9a3be082fc0efac533f806a70f4e1383cafe72
3
+ size 666048
model.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b640ff05bf252f4784c029f789a11c025b8e424763346f6718cb1a3e1d1c9a36
3
- size 43291
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0d90448daca90d9e08942ca75d769d8e2545721b1f1ebd1e593348197fb6ba6
3
+ size 40859
replay.mp4 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:106d4243a63ea5b4c4c05f666639ef786b92a11f400298dbe9dd2e73e67759a4
3
- size 177771
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18770d704d189e2699894ae6869c3a54f00dd0915f857cae55d4c2bab98d7782
3
+ size 50945
results.json CHANGED
@@ -1 +1 @@
1
- {"env_id": "LunarLander-v2", "mean_reward": 61.63650514566075, "std_reward": 134.03258442274785, "n_evaluation_episodes": 10, "eval_datetime": "2025-05-24T22:13:02.383407"}
 
1
+ {"env_id": "CartPole-v1", "mean_reward": 499.2, "std_reward": 2.4, "n_evaluation_episodes": 10, "eval_datetime": "2025-05-25T09:28:23.609401"}