Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

321

Full-text search

Active filters: rlhf

merve/peft-copy-test

Text Generation • Updated Jun 14, 2023 • 5

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated May 9, 2024 • 26 • 10

lyogavin/Anima33B-DPO-Belle-1k

Text Generation • Updated Jul 2, 2023 • 1

lyogavin/Anima33B-DPO-Belle-1k-merged

Text Generation • Updated Jul 2, 2023 • 12 • 12

PKU-Alignment/beaver-7b-v1.0-reward

Reinforcement Learning • Updated Apr 20, 2024 • 492 • 16

PKU-Alignment/beaver-dam-7b

Updated Jul 10, 2023 • 582 • 6

PKU-Alignment/beaver-7b-v1.0-cost

Reinforcement Learning • Updated Apr 20, 2024 • 433 • 9

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 1 • 23

fnlp/moss-rlhf-reward-model-7B-en

Updated Jul 13, 2023 • 9

fnlp/moss-rlhf-sft-model-7B-en

Updated Jul 14, 2023 • 2

fnlp/moss-rlhf-policy-model-7B-en

Updated Jul 17, 2023 • 1

lightonai/alfred-40b-0723

Text Generation • Updated Aug 11, 2023 • 31 • 45

kashif/stack-llama-2

Text Generation • Updated Aug 8, 2023 • 778 • 15

barnybug/stack-llama-2-ggml

Updated Aug 10, 2023 • 4

vwxyzjn/starcoderbase-triviaqa

Text Generation • Updated Aug 29, 2023 • 25

lvwerra/starcoderbase-gsm8k

Text Generation • Updated Aug 30, 2023 • 14

ContextualAI/archangel_sft_pythia1-4b

Text Generation • Updated Jan 11, 2024 • 47

ContextualAI/archangel_sft_pythia2-8b

Text Generation • Updated Jan 11, 2024 • 17 • 1

ContextualAI/archangel_sft_pythia6-9b

Text Generation • Updated Jan 11, 2024 • 22

ContextualAI/archangel_sft_pythia12-0b

Text Generation • Updated Jan 11, 2024 • 20

ContextualAI/archangel_sft_llama7b

Text Generation • Updated Jan 11, 2024 • 238 • 1

ContextualAI/archangel_sft_llama13b

Text Generation • Updated Jan 11, 2024 • 286

ContextualAI/archangel_sft_llama30b

Text Generation • Updated Jan 11, 2024 • 19

ContextualAI/archangel_slic_llama30b

Text Generation • Updated Jan 11, 2024 • 21

ContextualAI/archangel_slic_pythia1-4b

Text Generation • Updated Jan 11, 2024 • 17

ContextualAI/archangel_slic_pythia2-8b

Text Generation • Updated Jan 11, 2024 • 15

ContextualAI/archangel_slic_pythia6-9b

Text Generation • Updated Jan 11, 2024 • 16

ContextualAI/archangel_slic_pythia12-0b

Text Generation • Updated Jan 11, 2024 • 15

ContextualAI/archangel_slic_llama7b

Text Generation • Updated Jan 11, 2024 • 16 • 1

ContextualAI/archangel_slic_llama13b

Text Generation • Updated Jan 11, 2024 • 16