Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
dataset:HuggingFaceH4/ultrafeedback_binarized
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
999
Full-text search
Edit filters
Sort: Trending
Active filters:
HuggingFaceH4/ultrafeedback_binarized
Clear all
DUAL-GPO/zephyr-7b-gpo-v1-i1
Updated
May 6
DUAL-GPO/zephyr-7b-gpo-log-v3-i1
Updated
May 6
DUAL-GPO/zephyr-7b-gpo-v2-i1
Updated
May 6
ShenaoZ/0.0001_gemmait_withdpo_4iters_bs256_555lr_iter_1
Text Generation
•
Updated
May 6
AmberYifan/zephyr-7b-sft-safeDPO
Text Generation
•
Updated
May 6
•
2
ShenaoZ/0.0001_zephyrgemmasft_withdpo_3iters_bs256_555lr_iter_1
Text Generation
•
Updated
May 6
•
2
DUAL-GPO/phi-2-gpo-renew2-b0.001-0.5ultrafeedback-rank256-i1
Updated
May 6
•
2
ShenaoZ/0.0001_zephyrgemmasft_withdpo_4iters_bs256_555lr_iter_1
Text Generation
•
Updated
May 6
DUAL-GPO/phi-2-gpo-renew2-b0.001-0.05ultrafeedback-rank128-i1
Updated
May 6
•
3
DUAL-GPO/zephyr-7b-gpo-log-test-i0
Updated
May 6
DUAL-GPO/zephyr-7b-gpo-test-i0
Updated
May 6
DUAL-GPO/zephyr-7b-lgpo-v1-i1
Updated
May 7
DUAL-GPO-2/zephyr-7b-gpo-v6-i1
Updated
May 7
•
1
DUAL-GPO/zephyr-7b-gpo-v5-i1
Updated
May 7
•
2
ShenaoZ/0.0_withdpo_4iters_bs256_5102lr_iter_1
Text Generation
•
Updated
May 7
Minbyul/biomistral-7b-wo-kqa_golden-iter-sft-dpo-step1
Text Generation
•
Updated
May 7
Felladrin/gguf-TinyLlama-1.1B-Chat-v1.0
Updated
May 7
•
1
DUAL-GPO-2/phi-2-gpo-renew2-b0.001-vllm-merge-20k-i1
Updated
May 7
DUAL-GPO/phi-2-gpo-renew2-b0.001-vllm-merge-20k-log-i1
Updated
May 7
ShenaoZ/0.01_withdpo_4iters_bs256_531lr_iter_1
Text Generation
•
Updated
May 7
•
2
ShenaoZ/0.001_withdpo_4iters_bs256_531lr_iter_1
Text Generation
•
Updated
May 7
•
2
ale-bay/zephyr-7b-dpo-full
Text Generation
•
Updated
May 7
•
1
lole25/zephyr-7b-gpo-v6-i1
Updated
May 7
•
2
lole25/zephyr-7b-gpo-v7-i1
Updated
May 8
•
4
DUAL-GPO/zephyr-7b-gpo-v8-i1
Updated
May 8
•
10
ShenaoZ/0.0001_withdpo_3iters_bs256_5102lr_iter_1
Text Generation
•
Updated
May 8
•
1
ShenaoZ/0.0001_withdpo_5iters_bs256_5102lr_iter_1
Text Generation
•
Updated
May 7
AmberYifan/zephyr-7b-sft-safeDPO3
Text Generation
•
Updated
May 9
DUAL-GPO-2/zephyr-7b-gpo-v2-i0
Updated
May 8
•
3
lole25/zephyr-7b-gpo-v9-i1
Updated
May 8
•
10
Previous
1
...
14
15
16
17
18
...
34
Next