-
AmberYifan/llama2-7b-sft-ultrachat-safeRLHF
Text Generation • Updated • 57 -
AmberYifan/mistral-v0.1-7b-sft-ultrachat-safeRLHF
Text Generation • Updated • 14 -
AmberYifan/Mistral-7B-v0.3-sft-ultrachat-safeRLHF
Text Generation • Updated • 40 -
AmberYifan/Gemma-2-9B-sft-ultrachat-safeRLHF
Text Generation • Updated • 6
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
AmberYifan/Qwen2.5-1.5B-Open-R1-SFT-Math-GRPO-Code
published
a model
13 days ago
AmberYifan/Qwen2.5-1.5B-Open-R1-SFT-Math-GRPO-Code
updated
a model
13 days ago
AmberYifan/Qwen2.5-1.5B-Open-R1-SFT-Math
Organizations
Collections
2
This collection contains safetyQA dataset for safe SPIN training and trained models
models
324
AmberYifan/Qwen2.5-1.5B-Open-R1-SFT-Math-GRPO-Code
Updated
•
4
AmberYifan/Qwen2.5-1.5B-Open-R1-SFT-Math
Text Generation
•
Updated
•
67
AmberYifan/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
18
AmberYifan/Mistral-7B-v0.3-sft-ultrachat-SPIN-Mistral-8x7B-Instruct-v0.1
Text Generation
•
Updated
•
9
AmberYifan/Qwen2-7B-sft-ultrachat-SPIN-gpt4o
Text Generation
•
Updated
•
10
AmberYifan/Llama-3.1-8B-sft-ultrachat-SPIN-Llama-3.1-70B-Instruct
Text Generation
•
Updated
•
7
AmberYifan/Llama-3.1-8B-sft-ultrachat-peers-pool
Text Generation
•
Updated
•
16
•
1
AmberYifan/Qwen2.5-7B-sft-ultrachat-SPIN-gpt4o
Text Generation
•
Updated
•
18
•
1
AmberYifan/Llama-3.1-8B-sft-ultrachat-SPIN-gpt4o
Text Generation
•
Updated
•
7
AmberYifan/Qwen2-7B-sft-ultrachat-SPIN-Qwen2.5-72B-Instruct
Text Generation
•
Updated
•
14
datasets
25
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
•
22
AmberYifan/sft-spin-filter
Updated
•
6
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
•
23
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
21
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
9
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
•
16
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
•
21
AmberYifan/spin-v-diverse
Viewer
•
Updated
•
55k
•
5
AmberYifan/dpo-v
Viewer
•
Updated
•
55k
•
21
AmberYifan/spin-v
Viewer
•
Updated
•
55k
•
13