Llama 3 8B Instruct no refusal

This is a model that uses the orthogonal feature ablation as featured in this paper.

Calibration data:

The model is still refusing some instructions related to violence, I suspect that a full fine-tune might be needed to remove the rest of the refusals. Use this model responsibly, I decline any liability resulting of the use of this model.

I will post the code later.

Downloads last month
174
Safetensors
Model size
8.03B params
Tensor type
FP16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for theo77186/Llama-3-8B-Instruct-norefusal

Adapters
1 model
Merges
1 model
Quantizations
3 models

Spaces using theo77186/Llama-3-8B-Instruct-norefusal 6