hterrebrood
/

CartPole-v1

Reinforcement Learning

custom-implementation

Model card Files Files and versions Community

CartPole-v1 / README.md

hterrebrood's picture

Update README.md

54dded2 verified 9 months ago

|

history blame contribute delete

1.51 kB

	---
	tags:
	- CartPole-v1
	- reinforce
	- reinforcement-learning
	- custom-implementation
	- deep-rl-class
	model-index:
	- name: CartPole-v1
	results:
	- task:
	type: reinforcement-learning
	name: reinforcement-learning
	dataset:
	name: CartPole-v1
	type: CartPole-v1
	metrics:
	- type: mean_reward
	value: 500.00 +/- 0.00
	name: mean_reward
	verified: false
	---

	# Reinforce Agent playing CartPole-v1
	I have used Reinforcement learning in a game, Cart Pole. The aim is to keep the equilibrium by moving left/right. While training, the game uses its results/rewards to modify its parameters to get more rewards. Specifically, the model learns what kind of tactics let the cart pole balance, and as it fails, it learns and applies those tactics to balance the cart pole.

	Some links I've found helpful include:
	https://huggingface.co/learn/deep-rl-course/en/unit0/introduction#certification-process
	https://colab.research.google.com/github/huggingface/deep-rl-class/blob/master/notebooks/unit4/unit4.ipynb#scrollTo=NCNvyElRStWG
	https://spinningup.openai.com/en/latest/spinningup/rl_intro3.html#don-t-let-the-past-distract-you
	https://stable-baselines3.readthedocs.io/en/master/guide/rl_tips.html
	https://gymnasium.farama.org/content/migration-guide/
	https://github.com/enerrio/CartPole-Reinforcement-Learning
	https://www.ibm.com/topics/overfitting
	https://learningds.org/ch/04/modeling_loss_functions.html
	https://www.geeksforgeeks.org/reinforce-algorithm/