GRPO RL model
SunJack
SunJack
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
GRPO
updated
a model
about 1 month ago
SunJack/Qwen2.5-3B-R1-GGUF
updated
a model
about 1 month ago
SunJack/Qwen2.5-3B-R1
Organizations
Collections
1
models
14

SunJack/Qwen2.5-3B-R1-GGUF
Updated
•
43

SunJack/Qwen2.5-3B-R1
Updated
•
12

SunJack/Phi-4-R1
Updated

SunJack/Phi-4-R1-GGUF
Updated

SunJack/Qwen2.5-7b-sft
Updated
•
5

SunJack/phi4-o1
Updated
•
106

SunJack/Qwen2.5-3B-GRPO_lora
Updated

SunJack/qwen2.5-7b-o1
Updated
•
27
•
1

SunJack/qwen2.5-7b-cve
Updated
•
58
•
1

SunJack/qwen2-7b-ruozhiba-finetuning
Updated
•
43
•
2