-
-
-
-
-
-
Inference status
Active filters:
rlhf
merve/peft-copy-test
Text Generation
•
Updated
•
5
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
•
26
•
10
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
12
•
12
PKU-Alignment/beaver-7b-v1.0-reward
Reinforcement Learning
•
Updated
•
492
•
16
PKU-Alignment/beaver-dam-7b
Updated
•
582
•
6
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
•
433
•
9
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
•
1
•
23
fnlp/moss-rlhf-reward-model-7B-en
fnlp/moss-rlhf-sft-model-7B-en
fnlp/moss-rlhf-policy-model-7B-en
lightonai/alfred-40b-0723
Text Generation
•
Updated
•
31
•
45
kashif/stack-llama-2
Text Generation
•
Updated
•
778
•
15
barnybug/stack-llama-2-ggml
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
•
25
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
•
14
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
•
47
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
•
17
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
•
22
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
•
20
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
•
238
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
•
286
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
•
19
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
•
21
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
•
17
ContextualAI/archangel_slic_pythia2-8b
Text Generation
•
Updated
•
15
ContextualAI/archangel_slic_pythia6-9b
Text Generation
•
Updated
•
16
ContextualAI/archangel_slic_pythia12-0b
Text Generation
•
Updated
•
15
ContextualAI/archangel_slic_llama7b
Text Generation
•
Updated
•
16
•
1
ContextualAI/archangel_slic_llama13b
Text Generation
•
Updated
•
16