Commit
•
3c4d2cc
1
Parent(s):
3cd0bfa
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- deep-reinforcement-learning
|
4 |
+
- reinforcement-learning
|
5 |
+
- decision-transformer
|
6 |
+
- gym-continous-control
|
7 |
+
|
8 |
+
---
|
9 |
+
# Decision Transformer model trained on expert trajectories sampled from the Gym HalfCheetah environment
|
10 |
+
This is a trained [Decision Transformer](https://arxiv.org/abs/2106.01345) model trained on expert trajectories sampled from the Gym Hopper environment
|
11 |
+
|
12 |
+
<video src="https://huggingface.co/edbeeching/decision-transformer-gym-halfcheetah-expert/resolve/main/halfcheetah-expert.mp4" controls autoplay loop></video>
|
13 |
+
|
14 |
+
The following normlization coeficients are required to use this model:
|
15 |
+
|
16 |
+
mean = [ -0.04489148, 0.03232588, 0.06034835, -0.17081226, -0.19480659, -0.05751596, 0.09701628, 0.03239211, 11.047426, -0.07997331, -0.32363534, 0.36297753, 0.42322603, 0.40836546, 1.1085187, -0.4874403, -0.0737481 ]
|
17 |
+
|
18 |
+
std = [0.04002118, 0.4107858, 0.54217845, 0.41522816, 0.23796624, 0.62036866, 0.30100912, 0.21737163, 2.2105937, 0.572586, 1.7255033, 11.844218, 12.06324, 7.0495934, 13.499867, 7.195647, 5.0264325]
|
19 |
+
|
20 |
+
See our [Blog Post](https://colab.research.google.com/drive/1K3UuajwoPY1MzRKNkONNRS3gS5DxZ-qF?usp=sharing), [Colab notebook](https://colab.research.google.com/drive/1K3UuajwoPY1MzRKNkONNRS3gS5DxZ-qF?usp=sharing) or [Example Script](https://github.com/huggingface/transformers/tree/main/examples/research_projects/decision_transformer) for usage.
|
21 |
+
|