Reinforcement Learning related models
Davide Buoso
lambdavi
AI & ML interests
PhD Student @ VANDAL (Polytechnic University of Turin).
Interested in the intersection of Robotics and Generative AI.
Organizations
None yet
Collections
2
Papers
1
models
18

lambdavi/span-marker-luke-legal
Token Classification
•
Updated
•
3
•
3

lambdavi/legal-luke-base-ner
Token Classification
•
Updated
•
10
•
1

lambdavi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated

lambdavi/ppo-Pyramids
Reinforcement Learning
•
Updated
•
17

lambdavi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated

lambdavi/ddpg-PandaReach-v3
Reinforcement Learning
•
Updated

lambdavi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
120

lambdavi/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated

lambdavi/span-marker-luke-base-conll2003
Token Classification
•
Updated
•
2
•
2

lambdavi/luke-base_finetuned_conll2003
Token Classification
•
Updated
•
2
datasets
None public yet