Initial commit

Files changed (10) hide show

README.md ADDED Viewed

+---
+library_name: stable-baselines3
+tags:
+- udem1
+- deep-reinforcement-learning
+- reinforcement-learning
+- stable-baselines3
+model-index:
+- name: PPO
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: udem1
+      type: udem1
+    metrics:
+    - type: mean_reward
+      value: -1081.15 +/- 86.12
+      name: mean_reward
+      verified: false
+---
+# **PPO** Agent playing **udem1**
+This is a trained model of a **PPO** agent playing **udem1**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
+```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
+```

a2c-udem1.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5e66c18479993242b52f380bbdeffc73d22320556a417a58c937014e83fd8491
+size 1680641732

a2c-udem1/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 2.1.0

a2c-udem1/data ADDED Viewed

The diff for this file is too large to render. See raw diff

a2c-udem1/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:56cc6b8566e83dd48b7aad8a264354cdfa6e182ae026b513162db4c62f0b53ae
+size 1116319344

a2c-udem1/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:624cebfa2773175b166fd4734c1ed5b7c5a058b2e1b0bf782e3e195b6bfa8fdf
+size 558161726

a2c-udem1/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
+size 431

a2c-udem1/system_info.txt ADDED Viewed

+- OS: Linux-5.15.90.1-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP Fri Jan 27 02:56:13 UTC 2023
+- Python: 3.11.5
+- Stable-Baselines3: 2.1.0
+- PyTorch: 2.0.1+cu117
+- GPU Enabled: True
+- Numpy: 1.26.0
+- Cloudpickle: 2.2.1
+- Gymnasium: 0.29.1
+- OpenAI Gym: 0.26.2

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"mean_reward": -1081.1514985211193, "std_reward": 86.1153949026763, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-11-03T01:18:22.251769"}