-
-
-
-
-
-
Inference status
Active filters:
rlhf
sileod/deberta-v3-large-tasksource-nli
Zero-Shot Classification
•
Updated
•
571
•
34
sileod/deberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
20.5k
•
120
mlabonne/NeuralHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
154
•
152
simonveitner/MathHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
18
•
1
joey00072/ToxicHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
45
•
19
argilla/distilabeled-OpenHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
35
•
31
argilla/distilabeled-Marcoro14-7B-slerp-full
Text Generation
•
Updated
•
719
•
2
TheBloke/NeuralBeagle14-7B-GGUF
Updated
•
475
•
26
argilla/CapybaraHermes-2.5-Mistral-7B
Updated
•
33
•
68
tasksource/deberta-small-long-nli
Zero-Shot Classification
•
Updated
•
23.1k
•
40
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
•
9.09k
•
102
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
4.61k
•
56
mlabonne/AlphaMonarch-7B
Text Generation
•
Updated
•
12.6k
•
149
mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF
Updated
•
141
•
1
mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF
Updated
•
234
•
1
stanfordnlp/SteamSHP-flan-t5-xl
Text2Text Generation
•
Updated
•
72
•
43
stanfordnlp/SteamSHP-flan-t5-large
Text2Text Generation
•
Updated
•
40
•
33
trl-lib/llama-7b-se-peft
sileod/deberta-v3-large-tasksource-rlhf-reward-model
Text Classification
•
Updated
•
42
•
11
trl-lib/llama-7b-se-rl-peft
Updated
•
103
trl-lib/llama-7b-se-rm-peft
toloka/gpt2-large-rl-prompt-writing
Text Generation
•
Updated
•
21
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-deepspeed
Text Generation
•
Updated
•
15
•
5
AdamG012/chat-opt-1.3b-rlhf-critic-deepspeed
Text Generation
•
Updated
•
13
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-ema-deepspeed
Text Generation
•
Updated
•
11
•
8
sileod/mdeberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
113
•
15
agi-css/socially-good-lm
Text Generation
•
Updated
•
12
•
5
agi-css/hh-rlhf-sft
Text Generation
•
Updated
•
16
•
3
agi-css/better-base
Text Generation
•
Updated
•
12
•
5
argilla/roberta-base-reward-model-falcon-dolly
Text Classification
•
Updated
•
16
•
4