Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
robotman0
/
ppo-LunarLander-v2
like
0
Reinforcement Learning
Transformers
TensorBoard
LunarLander-v2
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
ppo-LunarLander-v2
/
mike2
/
policy.optimizer.pth
Commit History
default params for 1 million timesteps
d0f75d7
robotman0
commited on
Dec 13, 2022