Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

Misc with no match

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

53

Full-text search

Active filters: online-dpo

XueyingJia/qwen-1.5b-HH-online-dpo-ground-truth-lead-xs-batch

Updated Dec 11, 2024

XueyingJia/qwen-1.5b-HH-online-dpo-xs-batch

Updated Dec 11, 2024

XueyingJia/qwen-1.5b-sft-HH-online-dpo-ground-truth-lead

Updated Dec 11, 2024

XueyingJia/qwen-1.5b-sft-HH-online-dpo

Updated Dec 11, 2024

XueyingJia/qwen-0.5b-sft-HH-online-dpo-ground-truth-lead

Updated Dec 11, 2024

XueyingJia/qwen-0.5b-sft-HH-online-dpo

Updated Dec 11, 2024

XueyingJia/qwen2.5-0.5b-oaif

Updated about 1 month ago

XueyingJia/qwen2.5-1.5b-oaif

Updated about 1 month ago

XueyingJia/test_h100

Updated about 1 month ago

XueyingJia/qwen2.5-7b-oaif

Updated about 1 month ago

XueyingJia/qwen2.5-7b-ours

Updated about 1 month ago

XueyingJia/qwen2.5-14b-oaif

Updated about 1 month ago

XueyingJia/qwen2.5-14b-ours

Updated about 1 month ago

XueyingJia/qwen2.5-1.5b-ours

Updated about 1 month ago

XueyingJia/qwen2.5-0.5b-ours

Updated about 1 month ago

XueyingJia/Qwen2-1.5B-Instruct-oaif-4-epoch

Updated about 1 month ago

XueyingJia/Qwen2-1.5B-Instruct-oaif-2-epoch

Updated about 1 month ago

XueyingJia/Qwen2-1.5B-Instruct-ours-2-epoch

Updated about 1 month ago

XueyingJia/Qwen2-1.5B-Instruct-ours-2-epoch-duplicate

Updated about 1 month ago

XueyingJia/Qwen2-1.5B-Instruct-ours-4-epoch

Updated about 1 month ago

XueyingJia/qwen2.5-1.5B-Instruct-tldr-ours

Updated 29 days ago

XueyingJia/qwen2.5-1.5B-Instruct-Mistral-reward-ours

Updated 29 days ago

XueyingJia/qwen2.5-1.5B-Instruct-Mistral-reward-oaif

Updated 29 days ago