RoleBeagle-11B

img

A DPO-finetune from vicgalle/CarbonBeagle-11B-truthy over a subset of OpenHermesPreferences containting RP conversations. It keeps most of the intelligence from CarbonBeagle-11B, and hopefuly can role-play better.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 76.06
AI2 Reasoning Challenge (25-Shot) 72.35
HellaSwag (10-Shot) 89.77
MMLU (5-Shot) 66.35
TruthfulQA (0-shot) 77.92
Winogrande (5-shot) 84.06
GSM8k (5-shot) 65.88
Downloads last month
12
Safetensors
Model size
10.7B params
Tensor type
FP16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for vicgalle/RoleBeagle-11B

Merges
1 model
Quantizations
2 models

Dataset used to train vicgalle/RoleBeagle-11B

Spaces using vicgalle/RoleBeagle-11B 6

Evaluation results