neaven77 commited on
Commit
f2b0548
1 Parent(s): e53e3cf

Upload folder using huggingface_hub

Browse files
README.md CHANGED
@@ -1,11 +1,10 @@
1
  ---
 
2
  tags:
3
  - LunarLander-v2
4
- - ppo
5
  - deep-reinforcement-learning
6
  - reinforcement-learning
7
- - custom-implementation
8
- - deep-rl-course
9
  model-index:
10
  - name: PPO
11
  results:
@@ -17,14 +16,22 @@ model-index:
17
  type: LunarLander-v2
18
  metrics:
19
  - type: mean_reward
20
- value: -114.24 +/- 78.22
21
  name: mean_reward
22
  verified: false
23
  ---
24
 
25
- # PPO Agent Playing LunarLander-v2
 
 
26
 
27
- This is a trained model of a PPO agent playing LunarLander-v2.
 
28
 
29
- # Hyperparameters
30
-
 
 
 
 
 
 
1
  ---
2
+ library_name: stable-baselines3
3
  tags:
4
  - LunarLander-v2
 
5
  - deep-reinforcement-learning
6
  - reinforcement-learning
7
+ - stable-baselines3
 
8
  model-index:
9
  - name: PPO
10
  results:
 
16
  type: LunarLander-v2
17
  metrics:
18
  - type: mean_reward
19
+ value: 273.91 +/- 19.44
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
+ # **PPO** Agent playing **LunarLander-v2**
25
+ This is a trained model of a **PPO** agent playing **LunarLander-v2**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
 
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
30
 
31
+
32
+ ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
35
+
36
+ ...
37
+ ```
config.json CHANGED
@@ -1 +1 @@
1
- {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7842672c4700>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7842672c4790>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7842672c4820>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7842672c48b0>", "_build": "<function ActorCriticPolicy._build at 0x7842672c4940>", "forward": "<function ActorCriticPolicy.forward at 0x7842672c49d0>", "extract_features": "<function ActorCriticPolicy.extract_features at 0x7842672c4a60>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7842672c4af0>", "_predict": "<function ActorCriticPolicy._predict at 0x7842672c4b80>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7842672c4c10>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7842672c4ca0>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x7842672c4d30>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x784267265f40>"}, "verbose": 1, "policy_kwargs": {}, "num_timesteps": 1048576, "_total_timesteps": 1000000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1729035310669125360, "learning_rate": 0.0003, "tensorboard_log": null, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVdQgAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYACAAAAAAAAM0Ucrt7RJK6CYWOue9shTK2GBe7AU6iOAAAgD8AAIA/Mzflu0ilgbqzeh84M5iysr6hb7tVVTW3AACAPwAAgD9NYhy9XIsIumEftTdAXGAyvBWaOgAb0LYAAIA/AACAPzN9g73DKUi64r90OhomDbZ7Pom752eMuQAAgD8AAIA/AKbsPEi/jrr6dsc60a+pNaV+O7t9Zee5AACAPwAAgD9m1LW8CrdEuWLOP7nJf8GzJZMEu5LDYDgAAIA/AACAP+aUAz32LGq6IGaQOpcIZjXvPNa69fCluQAAgD8AAIA/zQ6ePMroCT51UdW9y+WyvqdGCz2nXTS9AAAAAAAAAAAzpqC8w9UxOdFhEDm8BK4zp48VO5SeK7gAAIA/AACAPzYceb4AMqk+Ll22PqH1vb5IAXm940/GPQAAAAAAAAAAgGGKvRRAjrrHYJ+4gI2PsoB4O7uyv7Y3AACAPwAAgD+aRvW9H7eLu5jdsD4TUQS+gYOUvfiZpD4AAIA/AAAAAACAZblci326OsBJuBt7i7MZWiK68pJoNwAAgD8AAIA/5uNfPZQuwj8GeoE+j9CIvLeqhD115wc+AAAAAAAAAAAgJAC+PXY4u8FZPLtxSKa4qwx/PBo4cDoAAIA/AACAP81497zDVSi6ksO5uZEBJbVUACY7LvfXOAAAgD8AAIA/mnmaOo8eW7osKJy7VZZaOIjo1TraH5E4AACAPwAAgD+zABO9e26JuvZyj7s+H5U2iWTmud6OBrYAAIA/AACAPwAAOrwfBc63K3XLOsaXBTYwkZk5QKzwuQAAgD8AAIA/AGn+vFLA7LmlXTE4niR/MwpKJTpzjU63AACAPwAAgD+aImE95quVPw14Dz57khC/JenFPWU3yTwAAAAAAAAAAACltrz2+EO6J+4oOMlQSzOHFBe7QqNDtwAAgD8AAIA/mrOXva4thrq7PzY40R4eM30CljkNAU+3AACAPwAAgD+aJ/C89hw4ulZWJbjyRYyzW5WcOlzGPDcAAIA/AACAP1pY5L0freO5Pl7fuuM4UraMhUE6AWkBOgAAgD8AAIA/M1AcvVyDGbqdpcw22mmFMSHrvTplLOy1AACAPwAAgD8AkJG69jRTuktij7l3CiWz6PcROvPDpjgAAIA/AACAP81ok7xIt5a6brEVt9MCRrJ3c6g5Kq8qNgAAgD8AAIA/Zj5LPFyvR7riBqc6d91CNaNJnLtMh8S5AACAPwAAgD+aSpa9hcPeucyzp7KTy/IuQ4VJu9LDKDMAAIA/AACAP7Oyfr2P3h+6EjK/t2KksLKulSg6IP7jNgAAgD8AAIA/sxh+PcP1frog+GU6PEhuNqTfGzveaoO5AACAPwAAgD+a5b67w4lmutv42TpZiCs26Z+ROM2G+7kAAIA/AACAP5rrarxcAzO682O0Npp00DAlF6A5Bp7VtQAAgD8AAIA/5ozFvezJyLndOXW43DZNs6paPrlltow3AACAPwAAgD+aJbs8+4+ZvFr38L1QiS08Mkj4PexaozoAAIA/AACAP00MRL17Oo26CqeaOVUjUbNzV8q6ryuwuAAAgD8AAIA/s5aJPlm08D4/NJa+4Ikvv63Ybz4DaFK+AAAAAAAAAABAXrK9d281PlIqmL0LoL++i5W8vRjXLbwAAAAAAAAAADPb+Tv21Ae6QKpCu7mIAbdBDgW6O51kOgAAgD8AAIA/ZrBLPFzbCbrSkq26TPoHtkbVgTrrDc05AACAPwAAgD8zI/Y6SJOEur2QoraJf3UwwjJEu0pmuzUAAIA/AACAPwAjkTyPFgC6TWFhOgA4zrOZjKM6sZyEuQAAgD8AAIA/5oJuvVy7HbqRIwk7UUAuOMZCr7pMrqS5AACAPwAAgD+aHu88w7E4urCvJjp9lZC1z0E6O0xoRLkAAIA/AACAPwAGyr0paGy6yAOXvD0sszzRU4+6KjWbvQAAgD8AAIA/rbUCPqQtDT5G9re+niCOvm1YJL0VBTa+AAAAAAAAAAAAoLu7e9qGuvIIq7uHZUE4DONwOirQErYAAIA/AACAP5qmZT32rF+6+uKKu8RUlbaXmaA6ZREINgAAgD8AAIA/TZWOvfZkR7qsbYM5BuufNDu/WDou3pm4AACAPwAAgD8ATME7pNxCOtGFvbp0am28l3ZQOiGxlTsAAAAAAAAAALOuc71IO6O6pSUlOZOKFzS3tp+6IBg+uAAAgD8AAIA/es83PvSvXz92N8Y9fYskv9vSlT5U5K+8AAAAAAAAAABmUoe8SPGMuulkmDrAvAk2REX9OgxHrrkAAIA/AACAP02iWb3h4oC6WALlOprD1zVdLhU77rUFugAAgD8AAIA/TYVzvY9eTrqmlHs5nb2tNPupvrp+vpC4AACAPwAAgD8zlgm9KehDuhLzlbSpIy2wzH3AuSKhfDMAAIA/AACAP4C4lL1cbV+8IwP2PX3/fD0O6pS97t28OgAAgD8AAIA/NeWYvgj7/T56/pA+2ioTv8z4er5gxE4+AAAAAAAAAAA6vFQ+p4tkPuJ7xL5cJ96+TkRQPr4JXb4AAAAAAAAAAGbcdryPThC6ZmiUNTFUkTAx2ME4Gj+vtAAAgD8AAIA/M3/mPCnweroim7k74V+2N4quSroOd282AACAPwAAgD+zpRK9w6l/urq/XjiPb3kzIMY3uOjggbcAAIA/AACAP02vdr0fDfy51iVbOV3pnzTp4ck6s6N+uAAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYktASwiGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVswAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJZAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiS0CFlIwBQ5R0lFKULg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": -0.04857599999999995, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVNgwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQGTCuYhMajyMAWyUTegDjAF0lEdAtO/QS6DoQnV9lChoBkdAcdKnEETxomgHTXkBaAhHQLTwz32mHgx1fZQoaAZHQHOA59E1EVpoB01eAmgIR0C08M/GIbfhdX2UKGgGR0Bk9o4n4O+aaAdN6ANoCEdAtPFHPQfIS3V9lChoBkdAcBrntv4ub2gHTbkBaAhHQLTxaXyRSxZ1fZQoaAZHQGPd90A93bFoB03oA2gIR0C08bJg9eQddX2UKGgGR0Bwu86vJRwZaAdNlAJoCEdAtPM4xFiKBXV9lChoBkdAS4FrsSkCWGgHS5doCEdAtPOrDgqEvnV9lChoBkdAbiY4bS7XhGgHTTwBaAhHQLT0KGACnxd1fZQoaAZHQHN0HLFGXoloB021AWgIR0C09FUknkT6dX2UKGgGR0BlH12aDwpfaAdN6ANoCEdAtPRh34bjtHV9lChoBkdAccS3bmEGq2gHTUwCaAhHQLT0kNZeRgZ1fZQoaAZHQHH7/lMh5gRoB02gA2gIR0C09L4oqkM1dX2UKGgGR0BfmAHE/B3zaAdN6ANoCEdAtPT1B8hLXnV9lChoBkdAaJ4PRzBAOmgHTegDaAhHQLT3KFuvUz91fZQoaAZHQEoofvnbItFoB0uDaAhHQLT3eNgBtDV1fZQoaAZHQHF6k1EVnEloB03EAWgIR0C0+wbIxQBQdX2UKGgGR0Bg7M3XI2fkaAdN6ANoCEdAtPsmC2+fy3V9lChoBkdAanIDQJHAh2gHTegDaAhHQLT7Z94/u9h1fZQoaAZHQGkawHqu8sdoB03oA2gIR0C0+5E6HTJAdX2UKGgGR0BmBBwbVBldaAdN6ANoCEdAtPyJo+Ofd3V9lChoBkdAaGykj5bhWGgHTegDaAhHQLT80lFMIu51fZQoaAZHQGS0U5lvqC9oB03oA2gIR0C0/RhHTZxrdX2UKGgGR0Bk+0n5SFXaaAdN6ANoCEdAtP1/juKGcnV9lChoBkdAcPGGxUvPC2gHTbsBaAhHQLT9lwZflZJ1fZQoaAZHQHE6AWnCO3loB02lA2gIR0C0/aKhUR4AdX2UKGgGR0BBDizLOiWWaAdLe2gIR0C0/dvTCtRvdX2UKGgGR0BznE0cfeUIaAdNTwNoCEdAtP4gtXgccXV9lChoBkdAcIJm0mdAgWgHTRIBaAhHQLT+mZ/kNnZ1fZQoaAZHQGdtZEMLF4toB03oA2gIR0C0/xKtDD0ldX2UKGgGR0BT2kQf6oETaAdLpWgIR0C0//yzcAR1dX2UKGgGR0BozDQTmGM5aAdN6ANoCEdAtQLskzGgjHV9lChoBkdAUYdjoZAIIGgHS4BoCEdAtQPl7F85S3V9lChoBkdAcqjzp5eJHmgHS8ZoCEdAtQQ80YTCcnV9lChoBkdAcICmKIi1RmgHTYACaAhHQLUFvhNucc51fZQoaAZHQHD/D0cwQDpoB03hAWgIR0C1BlPN3W4FdX2UKGgGR0BxBRhy8zyjaAdN3wNoCEdAtQfCymhufnV9lChoBkdAZ6mFeOXE62gHTegDaAhHQLUJgUFSsKd1fZQoaAZHQGrxYuTRplBoB03oA2gIR0C1CqePeYUndX2UKGgGR0Bl1Qzi0fHQaAdN6ANoCEdAtQqphnanJnV9lChoBkdAcVo0uUUwjGgHTQsDaAhHQLULiWCVbA11fZQoaAZHQHEcAb6xgRdoB00yAmgIR0C1DJlQMx46dX2UKGgGR0BxdItWdVebaAdNCANoCEdAtQ1B23azvHV9lChoBkdAYsubDuSfUWgHTegDaAhHQLUOJcPOIIp1fZQoaAZHQHLhS2UjcEhoB01MA2gIR0C1DwHVwxWUdX2UKGgGR0BNvkvTPSlWaAdLkmgIR0C1D0Ih+vyLdX2UKGgGR0BjiTEP1+RYaAdN6ANoCEdAtRDPcnE2pHV9lChoBkdAaLursByS3mgHTegDaAhHQLURJ2zv7WN1fZQoaAZHQGNLsWweNkxoB03oA2gIR0C1EdUfgaWHdX2UKGgGR0BiQDW3BpHqaAdN6ANoCEdAtRIiL876pHV9lChoBkdAYbuQtBfKIWgHTegDaAhHQLUSI6By0a91fZQoaAZHQGU6HeizsyBoB03oA2gIR0C1Ejxf8dgfdX2UKGgGR0Bm+PYvnKW+aAdN6ANoCEdAtRRsyvcJt3V9lChoBkdAcw6qHXVbzWgHTSkDaAhHQLUVC0p3HJd1fZQoaAZHQHFnCuuA7PpoB0vkaAhHQLUXUnogV451fZQoaAZHQGR0ylFc6eZoB03oA2gIR0C1F1MkY4yXdX2UKGgGR0Bx7A9zOopAaAdNDQJoCEdAtRgcj2SMcnV9lChoBkdAcsAgte2NN2gHTRsCaAhHQLUYWCOWBz51fZQoaAZHQGZFpblijL1oB03oA2gIR0C1GMdDpkf+dX2UKGgGR0BpIqPZIxxlaAdN6ANoCEdAtRjG+0w8GXV9lChoBkdAZbzxlQMx5GgHTegDaAhHQLUYxy5Zr591fZQoaAZHQGhqyGrS3LFoB03oA2gIR0C1GMbTx5LRdX2UKGgGR0BnKIqwyIpIaAdN6ANoCEdAtRjHSYw7DHV9lChoBkdAZjII5YHPeGgHTegDaAhHQLUYx6Q/5cl1fZQoaAZHQGIyUqpcX3xoB03oA2gIR0C1GMiGahHtdX2UKGgGR0Bl1evbGm1qaAdN6ANoCEdAtRjJGWldknV9lChoBkdAYwYr92ovSWgHTegDaAhHQLUYyVeruIB1fZQoaAZHQHBOq8QI2O1oB004A2gIR0C1GcnDBMzudX2UKGgGR0BmHWzfJmulaAdN6ANoCEdAtRnpkK/mDHV9lChoBkdAahNHZsbedmgHTegDaAhHQLUbJNlyzX11fZQoaAZHQHDnCq2jO9poB01yA2gIR0C1Gy9ELH+7dX2UKGgGR0Bxog2XLNfPaAdNVgJoCEdAtRuEW69TP3V9lChoBkdAckqRoAXEZWgHTTADaAhHQLUc0iy6cy51fZQoaAZHQHGWfiYLLIRoB00lA2gIR0C1HPVdPci4dX2UKGgGR0BpGYGSpzcRaAdN6ANoCEdAtR0BKAavR3V9lChoBkdARSVZA6dUbWgHS3xoCEdAtR3uenQ6ZHV9lChoBkdAZjJyZrpJPWgHTegDaAhHQLUd7s/6frd1fZQoaAZHQGkdYZVGTcJoB03oA2gIR0C1Hg4bn5i3dX2UKGgGR0BymJ9d/rjYaAdNCwJoCEdAtR5ODCgsb3V9lChoBkdARJjQqqfe12gHS4VoCEdAtR5OdYnv2HV9lChoBkdAcfkp2ll9SmgHTZYCaAhHQLUe6YlpoK51fZQoaAZHQGf7dl/YraxoB03oA2gIR0C1HzxgZ0jkdX2UKGgGR0Bpd+fNA1NyaAdN6ANoCEdAtR9bRZ2ZA3V9lChoBkdAaHYN4qwyI2gHTegDaAhHQLUfhAprk811fZQoaAZHQHI2DkMkQf9oB03oAWgIR0C1IIcGgSOBdX2UKGgGR0Bk4skrwvxpaAdN6ANoCEdAtSIbP3SKFnV9lChoBkdAbu6j3225QWgHTZkBaAhHQLUicpXIU8F1fZQoaAZHQHO9QOBlMAZoB00uA2gIR0C1Iy4A4n4PdX2UKGgGR0Bv80aqCHymaAdLwWgIR0C1JAazu4PPdX2UKGgGR0BkLM1fmcOLaAdN6ANoCEdAtSRMDB/I83V9lChoBkdAYyBlq8DjimgHTegDaAhHQLUku4R28qZ1fZQoaAZHQGVRbVrhzeZoB03oA2gIR0C1JTgEZBLPdX2UKGgGR0Bqupf0Eov0aAdN6ANoCEdAtSVwipvP1XV9lChoBkdAR5A8p1A7gmgHS2BoCEdAtSVw7p3X7XV9lChoBkdAcQ3/tY0VJ2gHTTcDaAhHQLUlkcu8K5V1fZQoaAZHQGZPIHTqjahoB03oA2gIR0C1JZywwCbMdX2UKGgGR0Byhpr0rbxmaAdN5wNoCEdAtSW8/oq0+nV9lChoBkdAcCiYmb9ZR2gHS8ZoCEdAtSdAcWCVbHV9lChoBkdAcJkkJ8fFJmgHTbkCaAhHQLUnxhakhzN1fZQoaAZHQFAkgZ0jkdVoB0uQaAhHQLUpBhXbM5h1ZS4="}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 240, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVdgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoCIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoESiWCAAAAAAAAAABAQEBAQEBAZRoFUsIhZRoGXSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBEoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaAtLCIWUaBl0lFKUjARoaWdolGgRKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgLSwiFlGgZdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=", "dtype": "float32", "bounded_below": "[ True True True True True True True True]", "bounded_above": "[ True True True True True True True True]", "_shape": [8], "low": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "low_repr": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high_repr": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.discrete.Discrete'>", ":serialized:": "gAWV2wAAAAAAAACMGWd5bW5hc2l1bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIBAAAAAAAAACUhpRSlIwFc3RhcnSUaAhoDkMIAAAAAAAAAACUhpRSlIwGX3NoYXBllCmMBWR0eXBllGgOjApfbnBfcmFuZG9tlE51Yi4=", "n": "4", "start": "0", "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 64, "n_steps": 2048, "gamma": 0.999, "gae_lambda": 0.98, "ent_coef": 0.01, "vf_coef": 0.5, "max_grad_norm": 0.5, "batch_size": 64, "n_epochs": 10, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "system_info": {"OS": "Linux-6.1.85+-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun 27 21:05:47 UTC 2024", "Python": "3.10.12", "Stable-Baselines3": "2.0.0a5", "PyTorch": "2.4.1+cu121", "GPU Enabled": "False", "Numpy": "1.26.4", "Cloudpickle": "2.2.1", "Gymnasium": "0.28.1", "OpenAI Gym": "0.25.2"}}
 
1
+ {"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==", "__module__": "stable_baselines3.common.policies", "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ", "__init__": "<function ActorCriticPolicy.__init__ at 0x7958585969e0>", "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x795858596a70>", "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x795858596b00>", "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x795858596b90>", "_build": "<function ActorCriticPolicy._build at 0x795858596c20>", "forward": "<function ActorCriticPolicy.forward at 0x795858596cb0>", "extract_features": "<function ActorCriticPolicy.extract_features at 0x795858596d40>", "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x795858596dd0>", "_predict": "<function ActorCriticPolicy._predict at 0x795858596e60>", "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x795858596ef0>", "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x795858596f80>", "predict_values": "<function ActorCriticPolicy.predict_values at 0x795858597010>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x79585853ac80>"}, "verbose": 1, "policy_kwargs": {}, "num_timesteps": 19376, "_total_timesteps": 1000000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1730001373119574991, "learning_rate": 0.0003, "tensorboard_log": null, "_last_obs": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAAGam6rnhkIq6NvPevJSgnzblAqA5210PtgAAgD8AAIA/c8jivew++bv4FiA+ccRMvMdBVT1T5yo9AACAPwAAgD/mFdS9e/yOusPSyDiUvAc0pOAaO6pG6bcAAIA/AAAAAE0/Bb2pK2W8NgiWve7qZD39jKI9lcPdOwAAgD8AAIA/ZiIUvVwrR7ob5Kg6EaMwNNo80rqOyMS5AACAPwAAgD9mLWi9rquGuhLUdrxcZog83FDYO8IKbb0AAIA/AACAPwD/kbwpCHK6IqJ2vcEM1zLppQe4/j7XswAAgD8AAIA/AIKnvLiTprvnBjW8X16TPItk7zwQT3q9AACAPwAAgD9ApUQ+0+hOPyrAHj6zR/S+PJpcPolrnTwAAAAAAAAAAECw673DwUa6Bh6VOpPM/7XDjjg78Fm2uQAAgD8AAAAAAMZ2PLgA4zquFQy+3hc6vgdbC712PYE7AAAAAAAAAACaTmO9w9EtusUdUTrVoyU0dahCugpJdbkAAIA/AACAP5pNhTsUlpW6JmCNO53ZpbZRdgg7hZWjugAAgD8AAIA/Tf1avRT+mbqlGcC8dBqcPCKI7jpSGYc9AACAPwAAgD+NOSe+CoclPE4ExTom9be4Lh21vd1w47kAAIA/AACAP5qAi709IVe7sv+ZvRKxGz1hDas8I2cCvgAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="}, "_last_original_obs": null, "_episode_num": 0, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": 0.983616, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVNgIAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQHCGBzV+Zw6MAWyUTY4CjAF0lEdAlLxLxI8QqnV9lChoBkdAZINztkWhy2gHTegDaAhHQJTSPLkjopx1fZQoaAZHQGZEwnhKlHloB03oA2gIR0CU0j6tT1kEdX2UKGgGR0BpUJaTwDvFaAdN6ANoCEdAlNJAcT8HfXV9lChoBkdAZ67n6Eal12gHTegDaAhHQJTSQlzEJjV1fZQoaAZHQGOubHAAQxxoB03oA2gIR0CU0kS4OMESdX2UKGgGR0BhYJJd0JWvaAdN6ANoCEdAlNJGViWmg3V9lChoBkdAZMMp97Wuo2gHTegDaAhHQJTSSI68xsV1fZQoaAZHQGFkxWT5ftxoB03oA2gIR0CU0knnuAqedX2UKGgGR0BieOGGmDUWaAdN6ANoCEdAlNJL4vexfXV9lChoBkdAYhzmEoOQQ2gHTegDaAhHQJTSTgYP5Hp1fZQoaAZHQGCPIJiRW91oB03oA2gIR0CU0lCYCyQgdX2UKGgGR0BmAvO8kD6naAdN6ANoCEdAlNJSeVcD83V9lChoBkdAZBS49X9zfmgHTegDaAhHQJTSVGqgh8p1fZQoaAZHQGUhHDBMzuZoB03oA2gIR0CU0lXpnpSrdX2UKGgGR0BlVB6t1ZDBaAdN6ANoCEdAlNJXctXgcnV9lChoBkdAQPZ9Cu2ZzGgHS5loCEdAlOxdeUpuuXVlLg=="}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="}, "_n_updates": 252, "observation_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVdgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoCIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoESiWCAAAAAAAAAABAQEBAQEBAZRoFUsIhZRoGXSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBEoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaAtLCIWUaBl0lFKUjARoaWdolGgRKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgLSwiFlGgZdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=", "dtype": "float32", "bounded_below": "[ True True True True True True True True]", "bounded_above": "[ True True True True True True True True]", "_shape": [8], "low": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "low_repr": "[-90. -90. -5. -5. -3.1415927 -5.\n -0. -0. ]", "high_repr": "[90. 90. 5. 5. 3.1415927 5.\n 1. 1. ]", "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.discrete.Discrete'>", ":serialized:": "gAWV2wAAAAAAAACMGWd5bW5hc2l1bS5zcGFjZXMuZGlzY3JldGWUjAhEaXNjcmV0ZZSTlCmBlH2UKIwBbpSMFW51bXB5LmNvcmUubXVsdGlhcnJheZSMBnNjYWxhcpSTlIwFbnVtcHmUjAVkdHlwZZSTlIwCaTiUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYkMIBAAAAAAAAACUhpRSlIwFc3RhcnSUaAhoDkMIAAAAAAAAAACUhpRSlIwGX3NoYXBllCmMBWR0eXBllGgOjApfbnBfcmFuZG9tlE51Yi4=", "n": "4", "start": "0", "_shape": [], "dtype": "int64", "_np_random": null}, "n_envs": 16, "n_steps": 1024, "gamma": 0.999, "gae_lambda": 0.98, "ent_coef": 0.01, "vf_coef": 0.5, "max_grad_norm": 0.5, "batch_size": 64, "n_epochs": 4, "clip_range": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "clip_range_vf": null, "normalize_advantage": true, "target_kl": null, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"}, "system_info": {"OS": "Linux-6.1.85+-x86_64-with-glibc2.35 # 1 SMP PREEMPT_DYNAMIC Thu Jun 27 21:05:47 UTC 2024", "Python": "3.10.12", "Stable-Baselines3": "2.0.0a5", "PyTorch": "2.4.1+cu121", "GPU Enabled": "True", "Numpy": "1.26.4", "Cloudpickle": "2.2.1", "Gymnasium": "0.28.1", "OpenAI Gym": "0.25.2"}}
ppo-LunarLander-v2.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:78bf08f25e7770711bfaa2dbe444329ea6228262af6508b6c642007c4aafb83e
3
- size 149669
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5fd7d530f33f43b8a844b7f287650db423627a964d284bed112001d1453883ac
3
+ size 144641
ppo-LunarLander-v2/data CHANGED
@@ -4,54 +4,54 @@
4
  ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
5
  "__module__": "stable_baselines3.common.policies",
6
  "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
7
- "__init__": "<function ActorCriticPolicy.__init__ at 0x7842672c4700>",
8
- "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x7842672c4790>",
9
- "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x7842672c4820>",
10
- "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x7842672c48b0>",
11
- "_build": "<function ActorCriticPolicy._build at 0x7842672c4940>",
12
- "forward": "<function ActorCriticPolicy.forward at 0x7842672c49d0>",
13
- "extract_features": "<function ActorCriticPolicy.extract_features at 0x7842672c4a60>",
14
- "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x7842672c4af0>",
15
- "_predict": "<function ActorCriticPolicy._predict at 0x7842672c4b80>",
16
- "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x7842672c4c10>",
17
- "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x7842672c4ca0>",
18
- "predict_values": "<function ActorCriticPolicy.predict_values at 0x7842672c4d30>",
19
  "__abstractmethods__": "frozenset()",
20
- "_abc_impl": "<_abc._abc_data object at 0x784267265f40>"
21
  },
22
  "verbose": 1,
23
  "policy_kwargs": {},
24
- "num_timesteps": 1048576,
25
  "_total_timesteps": 1000000,
26
  "_num_timesteps_at_start": 0,
27
  "seed": null,
28
  "action_noise": null,
29
- "start_time": 1729035310669125360,
30
  "learning_rate": 0.0003,
31
  "tensorboard_log": null,
32
  "_last_obs": {
33
  ":type:": "<class 'numpy.ndarray'>",
34
- ":serialized:": "gAWVdQgAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYACAAAAAAAAM0Ucrt7RJK6CYWOue9shTK2GBe7AU6iOAAAgD8AAIA/Mzflu0ilgbqzeh84M5iysr6hb7tVVTW3AACAPwAAgD9NYhy9XIsIumEftTdAXGAyvBWaOgAb0LYAAIA/AACAPzN9g73DKUi64r90OhomDbZ7Pom752eMuQAAgD8AAIA/AKbsPEi/jrr6dsc60a+pNaV+O7t9Zee5AACAPwAAgD9m1LW8CrdEuWLOP7nJf8GzJZMEu5LDYDgAAIA/AACAP+aUAz32LGq6IGaQOpcIZjXvPNa69fCluQAAgD8AAIA/zQ6ePMroCT51UdW9y+WyvqdGCz2nXTS9AAAAAAAAAAAzpqC8w9UxOdFhEDm8BK4zp48VO5SeK7gAAIA/AACAPzYceb4AMqk+Ll22PqH1vb5IAXm940/GPQAAAAAAAAAAgGGKvRRAjrrHYJ+4gI2PsoB4O7uyv7Y3AACAPwAAgD+aRvW9H7eLu5jdsD4TUQS+gYOUvfiZpD4AAIA/AAAAAACAZblci326OsBJuBt7i7MZWiK68pJoNwAAgD8AAIA/5uNfPZQuwj8GeoE+j9CIvLeqhD115wc+AAAAAAAAAAAgJAC+PXY4u8FZPLtxSKa4qwx/PBo4cDoAAIA/AACAP81497zDVSi6ksO5uZEBJbVUACY7LvfXOAAAgD8AAIA/mnmaOo8eW7osKJy7VZZaOIjo1TraH5E4AACAPwAAgD+zABO9e26JuvZyj7s+H5U2iWTmud6OBrYAAIA/AACAPwAAOrwfBc63K3XLOsaXBTYwkZk5QKzwuQAAgD8AAIA/AGn+vFLA7LmlXTE4niR/MwpKJTpzjU63AACAPwAAgD+aImE95quVPw14Dz57khC/JenFPWU3yTwAAAAAAAAAAACltrz2+EO6J+4oOMlQSzOHFBe7QqNDtwAAgD8AAIA/mrOXva4thrq7PzY40R4eM30CljkNAU+3AACAPwAAgD+aJ/C89hw4ulZWJbjyRYyzW5WcOlzGPDcAAIA/AACAP1pY5L0freO5Pl7fuuM4UraMhUE6AWkBOgAAgD8AAIA/M1AcvVyDGbqdpcw22mmFMSHrvTplLOy1AACAPwAAgD8AkJG69jRTuktij7l3CiWz6PcROvPDpjgAAIA/AACAP81ok7xIt5a6brEVt9MCRrJ3c6g5Kq8qNgAAgD8AAIA/Zj5LPFyvR7riBqc6d91CNaNJnLtMh8S5AACAPwAAgD+aSpa9hcPeucyzp7KTy/IuQ4VJu9LDKDMAAIA/AACAP7Oyfr2P3h+6EjK/t2KksLKulSg6IP7jNgAAgD8AAIA/sxh+PcP1frog+GU6PEhuNqTfGzveaoO5AACAPwAAgD+a5b67w4lmutv42TpZiCs26Z+ROM2G+7kAAIA/AACAP5rrarxcAzO682O0Npp00DAlF6A5Bp7VtQAAgD8AAIA/5ozFvezJyLndOXW43DZNs6paPrlltow3AACAPwAAgD+aJbs8+4+ZvFr38L1QiS08Mkj4PexaozoAAIA/AACAP00MRL17Oo26CqeaOVUjUbNzV8q6ryuwuAAAgD8AAIA/s5aJPlm08D4/NJa+4Ikvv63Ybz4DaFK+AAAAAAAAAABAXrK9d281PlIqmL0LoL++i5W8vRjXLbwAAAAAAAAAADPb+Tv21Ae6QKpCu7mIAbdBDgW6O51kOgAAgD8AAIA/ZrBLPFzbCbrSkq26TPoHtkbVgTrrDc05AACAPwAAgD8zI/Y6SJOEur2QoraJf3UwwjJEu0pmuzUAAIA/AACAPwAjkTyPFgC6TWFhOgA4zrOZjKM6sZyEuQAAgD8AAIA/5oJuvVy7HbqRIwk7UUAuOMZCr7pMrqS5AACAPwAAgD+aHu88w7E4urCvJjp9lZC1z0E6O0xoRLkAAIA/AACAPwAGyr0paGy6yAOXvD0sszzRU4+6KjWbvQAAgD8AAIA/rbUCPqQtDT5G9re+niCOvm1YJL0VBTa+AAAAAAAAAAAAoLu7e9qGuvIIq7uHZUE4DONwOirQErYAAIA/AACAP5qmZT32rF+6+uKKu8RUlbaXmaA6ZREINgAAgD8AAIA/TZWOvfZkR7qsbYM5BuufNDu/WDou3pm4AACAPwAAgD8ATME7pNxCOtGFvbp0am28l3ZQOiGxlTsAAAAAAAAAALOuc71IO6O6pSUlOZOKFzS3tp+6IBg+uAAAgD8AAIA/es83PvSvXz92N8Y9fYskv9vSlT5U5K+8AAAAAAAAAABmUoe8SPGMuulkmDrAvAk2REX9OgxHrrkAAIA/AACAP02iWb3h4oC6WALlOprD1zVdLhU77rUFugAAgD8AAIA/TYVzvY9eTrqmlHs5nb2tNPupvrp+vpC4AACAPwAAgD8zlgm9KehDuhLzlbSpIy2wzH3AuSKhfDMAAIA/AACAP4C4lL1cbV+8IwP2PX3/fD0O6pS97t28OgAAgD8AAIA/NeWYvgj7/T56/pA+2ioTv8z4er5gxE4+AAAAAAAAAAA6vFQ+p4tkPuJ7xL5cJ96+TkRQPr4JXb4AAAAAAAAAAGbcdryPThC6ZmiUNTFUkTAx2ME4Gj+vtAAAgD8AAIA/M3/mPCnweroim7k74V+2N4quSroOd282AACAPwAAgD+zpRK9w6l/urq/XjiPb3kzIMY3uOjggbcAAIA/AACAP02vdr0fDfy51iVbOV3pnzTp4ck6s6N+uAAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYktASwiGlIwBQ5R0lFKULg=="
35
  },
36
  "_last_episode_starts": {
37
  ":type:": "<class 'numpy.ndarray'>",
38
- ":serialized:": "gAWVswAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJZAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiS0CFlIwBQ5R0lFKULg=="
39
  },
40
  "_last_original_obs": null,
41
  "_episode_num": 0,
42
  "use_sde": false,
43
  "sde_sample_freq": -1,
44
- "_current_progress_remaining": -0.04857599999999995,
45
  "_stats_window_size": 100,
46
  "ep_info_buffer": {
47
  ":type:": "<class 'collections.deque'>",
48
- ":serialized:": "gAWVNgwAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQGTCuYhMajyMAWyUTegDjAF0lEdAtO/QS6DoQnV9lChoBkdAcdKnEETxomgHTXkBaAhHQLTwz32mHgx1fZQoaAZHQHOA59E1EVpoB01eAmgIR0C08M/GIbfhdX2UKGgGR0Bk9o4n4O+aaAdN6ANoCEdAtPFHPQfIS3V9lChoBkdAcBrntv4ub2gHTbkBaAhHQLTxaXyRSxZ1fZQoaAZHQGPd90A93bFoB03oA2gIR0C08bJg9eQddX2UKGgGR0Bwu86vJRwZaAdNlAJoCEdAtPM4xFiKBXV9lChoBkdAS4FrsSkCWGgHS5doCEdAtPOrDgqEvnV9lChoBkdAbiY4bS7XhGgHTTwBaAhHQLT0KGACnxd1fZQoaAZHQHN0HLFGXoloB021AWgIR0C09FUknkT6dX2UKGgGR0BlH12aDwpfaAdN6ANoCEdAtPRh34bjtHV9lChoBkdAccS3bmEGq2gHTUwCaAhHQLT0kNZeRgZ1fZQoaAZHQHH7/lMh5gRoB02gA2gIR0C09L4oqkM1dX2UKGgGR0BfmAHE/B3zaAdN6ANoCEdAtPT1B8hLXnV9lChoBkdAaJ4PRzBAOmgHTegDaAhHQLT3KFuvUz91fZQoaAZHQEoofvnbItFoB0uDaAhHQLT3eNgBtDV1fZQoaAZHQHF6k1EVnEloB03EAWgIR0C0+wbIxQBQdX2UKGgGR0Bg7M3XI2fkaAdN6ANoCEdAtPsmC2+fy3V9lChoBkdAanIDQJHAh2gHTegDaAhHQLT7Z94/u9h1fZQoaAZHQGkawHqu8sdoB03oA2gIR0C0+5E6HTJAdX2UKGgGR0BmBBwbVBldaAdN6ANoCEdAtPyJo+Ofd3V9lChoBkdAaGykj5bhWGgHTegDaAhHQLT80lFMIu51fZQoaAZHQGS0U5lvqC9oB03oA2gIR0C0/RhHTZxrdX2UKGgGR0Bk+0n5SFXaaAdN6ANoCEdAtP1/juKGcnV9lChoBkdAcPGGxUvPC2gHTbsBaAhHQLT9lwZflZJ1fZQoaAZHQHE6AWnCO3loB02lA2gIR0C0/aKhUR4AdX2UKGgGR0BBDizLOiWWaAdLe2gIR0C0/dvTCtRvdX2UKGgGR0BznE0cfeUIaAdNTwNoCEdAtP4gtXgccXV9lChoBkdAcIJm0mdAgWgHTRIBaAhHQLT+mZ/kNnZ1fZQoaAZHQGdtZEMLF4toB03oA2gIR0C0/xKtDD0ldX2UKGgGR0BT2kQf6oETaAdLpWgIR0C0//yzcAR1dX2UKGgGR0BozDQTmGM5aAdN6ANoCEdAtQLskzGgjHV9lChoBkdAUYdjoZAIIGgHS4BoCEdAtQPl7F85S3V9lChoBkdAcqjzp5eJHmgHS8ZoCEdAtQQ80YTCcnV9lChoBkdAcICmKIi1RmgHTYACaAhHQLUFvhNucc51fZQoaAZHQHD/D0cwQDpoB03hAWgIR0C1BlPN3W4FdX2UKGgGR0BxBRhy8zyjaAdN3wNoCEdAtQfCymhufnV9lChoBkdAZ6mFeOXE62gHTegDaAhHQLUJgUFSsKd1fZQoaAZHQGrxYuTRplBoB03oA2gIR0C1CqePeYUndX2UKGgGR0Bl1Qzi0fHQaAdN6ANoCEdAtQqphnanJnV9lChoBkdAcVo0uUUwjGgHTQsDaAhHQLULiWCVbA11fZQoaAZHQHEcAb6xgRdoB00yAmgIR0C1DJlQMx46dX2UKGgGR0BxdItWdVebaAdNCANoCEdAtQ1B23azvHV9lChoBkdAYsubDuSfUWgHTegDaAhHQLUOJcPOIIp1fZQoaAZHQHLhS2UjcEhoB01MA2gIR0C1DwHVwxWUdX2UKGgGR0BNvkvTPSlWaAdLkmgIR0C1D0Ih+vyLdX2UKGgGR0BjiTEP1+RYaAdN6ANoCEdAtRDPcnE2pHV9lChoBkdAaLursByS3mgHTegDaAhHQLURJ2zv7WN1fZQoaAZHQGNLsWweNkxoB03oA2gIR0C1EdUfgaWHdX2UKGgGR0BiQDW3BpHqaAdN6ANoCEdAtRIiL876pHV9lChoBkdAYbuQtBfKIWgHTegDaAhHQLUSI6By0a91fZQoaAZHQGU6HeizsyBoB03oA2gIR0C1Ejxf8dgfdX2UKGgGR0Bm+PYvnKW+aAdN6ANoCEdAtRRsyvcJt3V9lChoBkdAcw6qHXVbzWgHTSkDaAhHQLUVC0p3HJd1fZQoaAZHQHFnCuuA7PpoB0vkaAhHQLUXUnogV451fZQoaAZHQGR0ylFc6eZoB03oA2gIR0C1F1MkY4yXdX2UKGgGR0Bx7A9zOopAaAdNDQJoCEdAtRgcj2SMcnV9lChoBkdAcsAgte2NN2gHTRsCaAhHQLUYWCOWBz51fZQoaAZHQGZFpblijL1oB03oA2gIR0C1GMdDpkf+dX2UKGgGR0BpIqPZIxxlaAdN6ANoCEdAtRjG+0w8GXV9lChoBkdAZbzxlQMx5GgHTegDaAhHQLUYxy5Zr591fZQoaAZHQGhqyGrS3LFoB03oA2gIR0C1GMbTx5LRdX2UKGgGR0BnKIqwyIpIaAdN6ANoCEdAtRjHSYw7DHV9lChoBkdAZjII5YHPeGgHTegDaAhHQLUYx6Q/5cl1fZQoaAZHQGIyUqpcX3xoB03oA2gIR0C1GMiGahHtdX2UKGgGR0Bl1evbGm1qaAdN6ANoCEdAtRjJGWldknV9lChoBkdAYwYr92ovSWgHTegDaAhHQLUYyVeruIB1fZQoaAZHQHBOq8QI2O1oB004A2gIR0C1GcnDBMzudX2UKGgGR0BmHWzfJmulaAdN6ANoCEdAtRnpkK/mDHV9lChoBkdAahNHZsbedmgHTegDaAhHQLUbJNlyzX11fZQoaAZHQHDnCq2jO9poB01yA2gIR0C1Gy9ELH+7dX2UKGgGR0Bxog2XLNfPaAdNVgJoCEdAtRuEW69TP3V9lChoBkdAckqRoAXEZWgHTTADaAhHQLUc0iy6cy51fZQoaAZHQHGWfiYLLIRoB00lA2gIR0C1HPVdPci4dX2UKGgGR0BpGYGSpzcRaAdN6ANoCEdAtR0BKAavR3V9lChoBkdARSVZA6dUbWgHS3xoCEdAtR3uenQ6ZHV9lChoBkdAZjJyZrpJPWgHTegDaAhHQLUd7s/6frd1fZQoaAZHQGkdYZVGTcJoB03oA2gIR0C1Hg4bn5i3dX2UKGgGR0BymJ9d/rjYaAdNCwJoCEdAtR5ODCgsb3V9lChoBkdARJjQqqfe12gHS4VoCEdAtR5OdYnv2HV9lChoBkdAcfkp2ll9SmgHTZYCaAhHQLUe6YlpoK51fZQoaAZHQGf7dl/YraxoB03oA2gIR0C1HzxgZ0jkdX2UKGgGR0Bpd+fNA1NyaAdN6ANoCEdAtR9bRZ2ZA3V9lChoBkdAaHYN4qwyI2gHTegDaAhHQLUfhAprk811fZQoaAZHQHI2DkMkQf9oB03oAWgIR0C1IIcGgSOBdX2UKGgGR0Bk4skrwvxpaAdN6ANoCEdAtSIbP3SKFnV9lChoBkdAbu6j3225QWgHTZkBaAhHQLUicpXIU8F1fZQoaAZHQHO9QOBlMAZoB00uA2gIR0C1Iy4A4n4PdX2UKGgGR0Bv80aqCHymaAdLwWgIR0C1JAazu4PPdX2UKGgGR0BkLM1fmcOLaAdN6ANoCEdAtSRMDB/I83V9lChoBkdAYyBlq8DjimgHTegDaAhHQLUku4R28qZ1fZQoaAZHQGVRbVrhzeZoB03oA2gIR0C1JTgEZBLPdX2UKGgGR0Bqupf0Eov0aAdN6ANoCEdAtSVwipvP1XV9lChoBkdAR5A8p1A7gmgHS2BoCEdAtSVw7p3X7XV9lChoBkdAcQ3/tY0VJ2gHTTcDaAhHQLUlkcu8K5V1fZQoaAZHQGZPIHTqjahoB03oA2gIR0C1JZywwCbMdX2UKGgGR0Byhpr0rbxmaAdN5wNoCEdAtSW8/oq0+nV9lChoBkdAcCiYmb9ZR2gHS8ZoCEdAtSdAcWCVbHV9lChoBkdAcJkkJ8fFJmgHTbkCaAhHQLUnxhakhzN1fZQoaAZHQFAkgZ0jkdVoB0uQaAhHQLUpBhXbM5h1ZS4="
49
  },
50
  "ep_success_buffer": {
51
  ":type:": "<class 'collections.deque'>",
52
  ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
53
  },
54
- "_n_updates": 240,
55
  "observation_space": {
56
  ":type:": "<class 'gymnasium.spaces.box.Box'>",
57
  ":serialized:": "gAWVdgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoCIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoESiWCAAAAAAAAAABAQEBAQEBAZRoFUsIhZRoGXSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBEoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaAtLCIWUaBl0lFKUjARoaWdolGgRKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgLSwiFlGgZdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=",
@@ -76,15 +76,15 @@
76
  "dtype": "int64",
77
  "_np_random": null
78
  },
79
- "n_envs": 64,
80
- "n_steps": 2048,
81
  "gamma": 0.999,
82
  "gae_lambda": 0.98,
83
  "ent_coef": 0.01,
84
  "vf_coef": 0.5,
85
  "max_grad_norm": 0.5,
86
  "batch_size": 64,
87
- "n_epochs": 10,
88
  "clip_range": {
89
  ":type:": "<class 'function'>",
90
  ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"
 
4
  ":serialized:": "gAWVOwAAAAAAAACMIXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5wb2xpY2llc5SMEUFjdG9yQ3JpdGljUG9saWN5lJOULg==",
5
  "__module__": "stable_baselines3.common.policies",
6
  "__doc__": "\n Policy class for actor-critic algorithms (has both policy and value prediction).\n Used by A2C, PPO and the likes.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param ortho_init: Whether to use or not orthogonal initialization\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param full_std: Whether to use (n_features x n_actions) parameters\n for the std instead of only (n_features,) when using gSDE\n :param use_expln: Use ``expln()`` function instead of ``exp()`` to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param squash_output: Whether to squash the output using a tanh function,\n this allows to ensure boundaries when using gSDE.\n :param features_extractor_class: Features extractor to use.\n :param features_extractor_kwargs: Keyword arguments\n to pass to the features extractor.\n :param share_features_extractor: If True, the features extractor is shared between the policy and value networks.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n ",
7
+ "__init__": "<function ActorCriticPolicy.__init__ at 0x7958585969e0>",
8
+ "_get_constructor_parameters": "<function ActorCriticPolicy._get_constructor_parameters at 0x795858596a70>",
9
+ "reset_noise": "<function ActorCriticPolicy.reset_noise at 0x795858596b00>",
10
+ "_build_mlp_extractor": "<function ActorCriticPolicy._build_mlp_extractor at 0x795858596b90>",
11
+ "_build": "<function ActorCriticPolicy._build at 0x795858596c20>",
12
+ "forward": "<function ActorCriticPolicy.forward at 0x795858596cb0>",
13
+ "extract_features": "<function ActorCriticPolicy.extract_features at 0x795858596d40>",
14
+ "_get_action_dist_from_latent": "<function ActorCriticPolicy._get_action_dist_from_latent at 0x795858596dd0>",
15
+ "_predict": "<function ActorCriticPolicy._predict at 0x795858596e60>",
16
+ "evaluate_actions": "<function ActorCriticPolicy.evaluate_actions at 0x795858596ef0>",
17
+ "get_distribution": "<function ActorCriticPolicy.get_distribution at 0x795858596f80>",
18
+ "predict_values": "<function ActorCriticPolicy.predict_values at 0x795858597010>",
19
  "__abstractmethods__": "frozenset()",
20
+ "_abc_impl": "<_abc._abc_data object at 0x79585853ac80>"
21
  },
22
  "verbose": 1,
23
  "policy_kwargs": {},
24
+ "num_timesteps": 19376,
25
  "_total_timesteps": 1000000,
26
  "_num_timesteps_at_start": 0,
27
  "seed": null,
28
  "action_noise": null,
29
+ "start_time": 1730001373119574991,
30
  "learning_rate": 0.0003,
31
  "tensorboard_log": null,
32
  "_last_obs": {
33
  ":type:": "<class 'numpy.ndarray'>",
34
+ ":serialized:": "gAWVdQIAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYAAgAAAAAAAGam6rnhkIq6NvPevJSgnzblAqA5210PtgAAgD8AAIA/c8jivew++bv4FiA+ccRMvMdBVT1T5yo9AACAPwAAgD/mFdS9e/yOusPSyDiUvAc0pOAaO6pG6bcAAIA/AAAAAE0/Bb2pK2W8NgiWve7qZD39jKI9lcPdOwAAgD8AAIA/ZiIUvVwrR7ob5Kg6EaMwNNo80rqOyMS5AACAPwAAgD9mLWi9rquGuhLUdrxcZog83FDYO8IKbb0AAIA/AACAPwD/kbwpCHK6IqJ2vcEM1zLppQe4/j7XswAAgD8AAIA/AIKnvLiTprvnBjW8X16TPItk7zwQT3q9AACAPwAAgD9ApUQ+0+hOPyrAHj6zR/S+PJpcPolrnTwAAAAAAAAAAECw673DwUa6Bh6VOpPM/7XDjjg78Fm2uQAAgD8AAAAAAMZ2PLgA4zquFQy+3hc6vgdbC712PYE7AAAAAAAAAACaTmO9w9EtusUdUTrVoyU0dahCugpJdbkAAIA/AACAP5pNhTsUlpW6JmCNO53ZpbZRdgg7hZWjugAAgD8AAIA/Tf1avRT+mbqlGcC8dBqcPCKI7jpSGYc9AACAPwAAgD+NOSe+CoclPE4ExTom9be4Lh21vd1w47kAAIA/AACAP5qAi709IVe7sv+ZvRKxGz1hDas8I2cCvgAAgD8AAIA/lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwiGlIwBQ5R0lFKULg=="
35
  },
36
  "_last_episode_starts": {
37
  ":type:": "<class 'numpy.ndarray'>",
38
+ ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="
39
  },
40
  "_last_original_obs": null,
41
  "_episode_num": 0,
42
  "use_sde": false,
43
  "sde_sample_freq": -1,
44
+ "_current_progress_remaining": 0.983616,
45
  "_stats_window_size": 100,
46
  "ep_info_buffer": {
47
  ":type:": "<class 'collections.deque'>",
48
+ ":serialized:": "gAWVNgIAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHQHCGBzV+Zw6MAWyUTY4CjAF0lEdAlLxLxI8QqnV9lChoBkdAZINztkWhy2gHTegDaAhHQJTSPLkjopx1fZQoaAZHQGZEwnhKlHloB03oA2gIR0CU0j6tT1kEdX2UKGgGR0BpUJaTwDvFaAdN6ANoCEdAlNJAcT8HfXV9lChoBkdAZ67n6Eal12gHTegDaAhHQJTSQlzEJjV1fZQoaAZHQGOubHAAQxxoB03oA2gIR0CU0kS4OMESdX2UKGgGR0BhYJJd0JWvaAdN6ANoCEdAlNJGViWmg3V9lChoBkdAZMMp97Wuo2gHTegDaAhHQJTSSI68xsV1fZQoaAZHQGFkxWT5ftxoB03oA2gIR0CU0knnuAqedX2UKGgGR0BieOGGmDUWaAdN6ANoCEdAlNJL4vexfXV9lChoBkdAYhzmEoOQQ2gHTegDaAhHQJTSTgYP5Hp1fZQoaAZHQGCPIJiRW91oB03oA2gIR0CU0lCYCyQgdX2UKGgGR0BmAvO8kD6naAdN6ANoCEdAlNJSeVcD83V9lChoBkdAZBS49X9zfmgHTegDaAhHQJTSVGqgh8p1fZQoaAZHQGUhHDBMzuZoB03oA2gIR0CU0lXpnpSrdX2UKGgGR0BlVB6t1ZDBaAdN6ANoCEdAlNJXctXgcnV9lChoBkdAQPZ9Cu2ZzGgHS5loCEdAlOxdeUpuuXVlLg=="
49
  },
50
  "ep_success_buffer": {
51
  ":type:": "<class 'collections.deque'>",
52
  ":serialized:": "gAWVIAAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKULg=="
53
  },
54
+ "_n_updates": 252,
55
  "observation_space": {
56
  ":type:": "<class 'gymnasium.spaces.box.Box'>",
57
  ":serialized:": "gAWVdgIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWCAAAAAAAAAABAQEBAQEBAZRoCIwCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksIhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoESiWCAAAAAAAAAABAQEBAQEBAZRoFUsIhZRoGXSUUpSMBl9zaGFwZZRLCIWUjANsb3eUaBEoliAAAAAAAAAAAAC0wgAAtMIAAKDAAACgwNsPScAAAKDAAAAAgAAAAICUaAtLCIWUaBl0lFKUjARoaWdolGgRKJYgAAAAAAAAAAAAtEIAALRCAACgQAAAoEDbD0lAAACgQAAAgD8AAIA/lGgLSwiFlGgZdJRSlIwIbG93X3JlcHKUjFtbLTkwLiAgICAgICAgLTkwLiAgICAgICAgIC01LiAgICAgICAgIC01LiAgICAgICAgIC0zLjE0MTU5MjcgIC01LgogIC0wLiAgICAgICAgIC0wLiAgICAgICBdlIwJaGlnaF9yZXBylIxTWzkwLiAgICAgICAgOTAuICAgICAgICAgNS4gICAgICAgICA1LiAgICAgICAgIDMuMTQxNTkyNyAgNS4KICAxLiAgICAgICAgIDEuICAgICAgIF2UjApfbnBfcmFuZG9tlE51Yi4=",
 
76
  "dtype": "int64",
77
  "_np_random": null
78
  },
79
+ "n_envs": 16,
80
+ "n_steps": 1024,
81
  "gamma": 0.999,
82
  "gae_lambda": 0.98,
83
  "ent_coef": 0.01,
84
  "vf_coef": 0.5,
85
  "max_grad_norm": 0.5,
86
  "batch_size": 64,
87
+ "n_epochs": 4,
88
  "clip_range": {
89
  ":type:": "<class 'function'>",
90
  ":serialized:": "gAWVxQIAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLAUsTQwSIAFMAlE6FlCmMAV+UhZSMSS91c3IvbG9jYWwvbGliL3B5dGhvbjMuMTAvZGlzdC1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjARmdW5jlEuEQwIEAZSMA3ZhbJSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjEkvdXNyL2xvY2FsL2xpYi9weXRob24zLjEwL2Rpc3QtcGFja2FnZXMvc3RhYmxlX2Jhc2VsaW5lczMvY29tbW9uL3V0aWxzLnB5lHVOTmgAjBBfbWFrZV9lbXB0eV9jZWxslJOUKVKUhZR0lFKUjBxjbG91ZHBpY2tsZS5jbG91ZHBpY2tsZV9mYXN0lIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaB99lH2UKGgWaA2MDF9fcXVhbG5hbWVfX5SMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUjA9fX2Fubm90YXRpb25zX1+UfZSMDl9fa3dkZWZhdWx0c19flE6MDF9fZGVmYXVsdHNfX5ROjApfX21vZHVsZV9flGgXjAdfX2RvY19flE6MC19fY2xvc3VyZV9flGgAjApfbWFrZV9jZWxslJOURz/JmZmZmZmahZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjAu"
ppo-LunarLander-v2/policy.optimizer.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:58905a68df75bc45df78fd12e6cca31d558facf2cb35250e5f2f1afa3281ecaf
3
- size 87978
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:64821ba9a485fab59f72ecb9a9162b0b4f9941ba36e5463c7415dbcd246f3ca6
3
+ size 88362
ppo-LunarLander-v2/policy.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1dd1326266a642389d0f73457df159d7cab163612923c1b5c33e62b51831c36e
3
- size 43634
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b2bdb0f66096826d5766b84bc6b7a74dfce15a36406e4e9f347cd5daf4cd046
3
+ size 43762
ppo-LunarLander-v2/system_info.txt CHANGED
@@ -2,7 +2,7 @@
2
  - Python: 3.10.12
3
  - Stable-Baselines3: 2.0.0a5
4
  - PyTorch: 2.4.1+cu121
5
- - GPU Enabled: False
6
  - Numpy: 1.26.4
7
  - Cloudpickle: 2.2.1
8
  - Gymnasium: 0.28.1
 
2
  - Python: 3.10.12
3
  - Stable-Baselines3: 2.0.0a5
4
  - PyTorch: 2.4.1+cu121
5
+ - GPU Enabled: True
6
  - Numpy: 1.26.4
7
  - Cloudpickle: 2.2.1
8
  - Gymnasium: 0.28.1
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1 +1 @@
1
- {"env_id": "LunarLander-v2", "mean_reward": -114.23968531066376, "std_reward": 78.22341453939087, "n_evaluation_episodes": 10, "eval_datetime": "2024-10-27T03:27:56.780446"}
 
1
+ {"mean_reward": 273.9132449, "std_reward": 19.436598091694247, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-10-27T03:57:13.251368"}