Upload of a new agent trained with Remote-DQN on Taxi-v3
Browse files- README.md +60 -0
- algorithm.zip +3 -0
- replay.mp4 +0 -0
- results.json +1 -0
- system.json +1 -0
README.md
ADDED
@@ -0,0 +1,60 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- Taxi-v3
|
4 |
+
- reinforcement-learning
|
5 |
+
- rl-framework
|
6 |
+
model-index:
|
7 |
+
- name: Remote-DQN-Taxi-v3
|
8 |
+
results:
|
9 |
+
- task:
|
10 |
+
type: reinforcement-learning
|
11 |
+
name: reinforcement-learning
|
12 |
+
dataset:
|
13 |
+
name: Taxi-v3
|
14 |
+
type: Taxi-v3
|
15 |
+
metrics:
|
16 |
+
- type: mean_reward
|
17 |
+
value: -200.00 +/- 0.00
|
18 |
+
name: mean_reward
|
19 |
+
verified: false
|
20 |
+
---
|
21 |
+
|
22 |
+
|
23 |
+
# Custom implemented Remote-DQN agent playing on *Taxi-v3*
|
24 |
+
|
25 |
+
This is a trained model of an agent playing on the environment *Taxi-v3*.
|
26 |
+
The agent was trained with a Remote-DQN algorithm and evaluated for 50 episodes.
|
27 |
+
See further agent and evaluation metadata in the according README section.
|
28 |
+
|
29 |
+
|
30 |
+
## Import
|
31 |
+
The Python module used for training and uploading/downloading is [rl-framework](https://github.com/alexander-zap/rl-framework).
|
32 |
+
It is an easy-to-read, plug-and-use Reinforcement Learning framework and provides standardized interfaces
|
33 |
+
and implementations to various Reinforcement Learning methods and environments.
|
34 |
+
|
35 |
+
Also it provides connectors for the upload and download to popular model version control systems,
|
36 |
+
including the HuggingFace Hub.
|
37 |
+
|
38 |
+
## Usage
|
39 |
+
```python
|
40 |
+
|
41 |
+
from rl-framework import StableBaselinesAgent, StableBaselinesAlgorithm
|
42 |
+
|
43 |
+
# Create new agent instance
|
44 |
+
agent = StableBaselinesAgent(
|
45 |
+
algorithm=StableBaselinesAlgorithm.Remote-DQN
|
46 |
+
algorithm_parameters={
|
47 |
+
...
|
48 |
+
},
|
49 |
+
)
|
50 |
+
|
51 |
+
# Download existing agent from HF Hub
|
52 |
+
repository_id = "zap-thamm/Remote-DQN-Taxi-v3"
|
53 |
+
file_name = "algorithm.zip"
|
54 |
+
agent.download(repository_id=repository_id, filename=file_name)
|
55 |
+
|
56 |
+
```
|
57 |
+
|
58 |
+
Further examples can be found in the [exploration section of the rl-framework repository](https://github.com/alexander-zap/rl-framework/tree/main/exploration).
|
59 |
+
|
60 |
+
|
algorithm.zip
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1f3c2bfa5151a43dad3d4e3daadcea481a7d48ece46992e245b7d99443e30459
|
3 |
+
size 612148
|
replay.mp4
ADDED
Binary file (385 kB). View file
|
|
results.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"env_id": "Taxi-v3", "mean_reward": -200.0, "n_eval_episodes": 50, "eval_datetime": "2023-12-28T01:06:36.190507"}
|
system.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"OS": "Windows-10-10.0.19045-SP0 10.0.19045", "Python": "3.10.8", "Stable-Baselines3": "2.2.1", "PyTorch": "2.1.2+cpu", "GPU Enabled": "False", "Numpy": "1.26.2", "Cloudpickle": "3.0.0", "Gymnasium": "0.29.1"}
|