This repo contains a SAC checkpoint trained via StableBaselines3 on Ant-v4
from gymnasium with the recommended hyperparameters from RLZoo:
model = SAC("MlpPolicy", env, learning_starts=10000)
The versions are:
python==3.12.3
gymnasium==0.29.1
stable_baselines3==2.3.2
torch==2.3.1