Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ContextualAI
/
Contextual_KTO_Mistral_PairRM
like
31
Follow
ContextualAI
65
Text Generation
Transformers
Safetensors
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
English
mistral
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.01306
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Contextual_KTO_Mistral_PairRM
Commit History
Update README.md
98bee13
verified
xwinxu
commited on
Apr 26, 2024
Update README.md
bdf7fe0
verified
xwinxu
commited on
Mar 7, 2024
Update README.md
31efc9a
verified
xwinxu
commited on
Mar 7, 2024
Update README.md
8d0fec9
verified
xwinxu
commited on
Mar 7, 2024
Update README.md
8b7e5cc
verified
xwinxu
commited on
Mar 7, 2024
Fix tokenizer chat template
8cffcfe
verified
shikib
commited on
Mar 6, 2024
Update README.md
06fc6e3
verified
xwinxu
commited on
Mar 5, 2024
Upload MistralForCausalLM
d8380f4
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
eb151d5
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
66b1fa9
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
e531f7b
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
0d81ff6
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
231bafb
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
dba0d32
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
257fdd0
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
c96f499
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
c47c194
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
0b9cba1
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
f652cba
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
cbf882a
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
45df619
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
2c7e3b1
verified
Muennighoff
commited on
Mar 5, 2024
Upload tokenizer
8922fc2
verified
Muennighoff
commited on
Mar 5, 2024
Upload README.md with huggingface_hub
927e33a
verified
Muennighoff
commited on
Mar 5, 2024
initial commit
4564abd
verified
xwinxu
commited on
Mar 5, 2024