Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
online-dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Misc with no match
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
53
Full-text search
Edit filters
Sort: Trending
Active filters:
online-dpo
Clear all
XueyingJia/qwen-1.5b-HH-online-dpo-ground-truth-lead-xs-batch
Updated
Dec 11, 2024
XueyingJia/qwen-1.5b-HH-online-dpo-xs-batch
Updated
Dec 11, 2024
XueyingJia/qwen-1.5b-sft-HH-online-dpo-ground-truth-lead
Updated
Dec 11, 2024
XueyingJia/qwen-1.5b-sft-HH-online-dpo
Updated
Dec 11, 2024
XueyingJia/qwen-0.5b-sft-HH-online-dpo-ground-truth-lead
Updated
Dec 11, 2024
XueyingJia/qwen-0.5b-sft-HH-online-dpo
Updated
Dec 11, 2024
XueyingJia/qwen2.5-0.5b-oaif
Updated
about 1 month ago
XueyingJia/qwen2.5-1.5b-oaif
Updated
about 1 month ago
XueyingJia/test_h100
Updated
about 1 month ago
XueyingJia/qwen2.5-7b-oaif
Updated
about 1 month ago
XueyingJia/qwen2.5-7b-ours
Updated
about 1 month ago
XueyingJia/qwen2.5-14b-oaif
Updated
about 1 month ago
XueyingJia/qwen2.5-14b-ours
Updated
about 1 month ago
XueyingJia/qwen2.5-1.5b-ours
Updated
about 1 month ago
XueyingJia/qwen2.5-0.5b-ours
Updated
about 1 month ago
XueyingJia/Qwen2-1.5B-Instruct-oaif-4-epoch
Updated
about 1 month ago
XueyingJia/Qwen2-1.5B-Instruct-oaif-2-epoch
Updated
about 1 month ago
XueyingJia/Qwen2-1.5B-Instruct-ours-2-epoch
Updated
about 1 month ago
XueyingJia/Qwen2-1.5B-Instruct-ours-2-epoch-duplicate
Updated
about 1 month ago
XueyingJia/Qwen2-1.5B-Instruct-ours-4-epoch
Updated
about 1 month ago
XueyingJia/qwen2.5-1.5B-Instruct-tldr-ours
Updated
29 days ago
XueyingJia/qwen2.5-1.5B-Instruct-Mistral-reward-ours
Updated
29 days ago
XueyingJia/qwen2.5-1.5B-Instruct-Mistral-reward-oaif
Updated
29 days ago
Previous
1
2
Next