-
-
-
-
-
-
Inference status
Active filters:
dpo
CultriX/Lama-DPOlphin-8B-Q3_K_M-GGUF
Text Generation
•
Updated
•
4
•
1
CultriX/Lama-DPOlphin-8B-Q4_K_S-GGUF
Text Generation
•
Updated
•
1
CultriX/Lama-DPOlphin-8B-Q5_K_S-GGUF
Text Generation
•
Updated
•
1
•
1
CultriX/Lama-DPOlphin-8B-Q5_K_M-GGUF
Text Generation
•
Updated
•
1
•
1
CultriX/Lama-DPOlphin-8B-Q6_K-GGUF
Text Generation
•
Updated
•
6
•
2
CultriX/Lama-DPOlphin-8B-Q8_0-GGUF
Text Generation
•
Updated
•
3
•
1
QuantFactory/Fireball-3.1-8B-ORPO-GGUF
Text Generation
•
Updated
•
10
•
2
CultriX/Lama-DPOlphin-8B-Q4_K_M-GGUF
Text Generation
•
Updated
•
13
•
1
mradermacher/Lama-DPOlphin-8B-GGUF
Updated
•
11
•
1
tsavage68/Na_L3_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_L3_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_L3_350steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
10
tsavage68/Na_L3_250steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
11
tsavage68/Na_L3_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
14
tsavage68/Na_L3_350steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
9
mradermacher/Lama-DPOlphin-8B-i1-GGUF
Updated
•
164
•
1
tsavage68/Na_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_M2_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
11
tsavage68/Na_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_M2_200steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
•
12
tsavage68/Na_M2_100steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
10
SongTonyLi/SFT_D1chosenThenDPO_D2a_Instruct_argilla_math_results
Text Generation
•
Updated
•
10
Jatin313/tiny-chatbot-dpo
Updated
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
9
bartowski/TwinLlama-3.1-8B-DPO3-GGUF
Text Generation
•
Updated
•
47
nomadrp/tq-aya101-6langs
Updated
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
38
tsavage68/Na_M2_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
11
tsavage68/Na_M2_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
13
tsavage68/Na_M2_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
•
13