Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
liked
a model
4 days ago
Qwen/Qwen2.5-Omni-7B
updated
a model
6 days ago
ibndias/gemma-3-1b-reasoning-grpo
published
a model
6 days ago
ibndias/gemma-3-1b-reasoning-grpo
Organizations
Collections
2
Papers
2
models
16

ibndias/gemma-3-1b-reasoning-grpo
Text Generation
•
Updated

ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
9

ibndias/Qwen-2.5-7B-Simple-RL
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
4

ibndias/Qwen-2.5-7B_Base_Math_smalllr
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated
•
4

ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
8

ibndias/taxi-v3
Reinforcement Learning
•
Updated

ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
5