vikarti-anatra
's Collections
Interesting ones
updated
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B
Paper
•
2310.20624
•
Published
•
13
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
•
2310.20587
•
Published
•
18
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B
Paper
•
2311.00117
•
Published
VideoFusion: Decomposed Diffusion Models for High-Quality Video
Generation
Paper
•
2303.08320
•
Published
•
3
Vikhrmodels/Vikhr-7B-instruct_0.4
Text Generation
•
8B
•
Updated
•
1.53k
•
34
IlyaGusev/saiga_llama3_8b
Text Generation
•
8B
•
Updated
•
3.95k
•
•
126
QuixiAI/wizard_vicuna_70k_unfiltered
Viewer
•
Updated
•
34.6k
•
214
•
169
failspy/llama-3-70B-Instruct-abliterated
Text Generation
•
71B
•
Updated
•
10.1k
•
•
110
Zoyd/Sao10K_L3-8B-Stheno-v3.1-8_0bpw_exl2
Text Generation
•
Updated
•
4
•
3
Zoyd/Sao10K_L3-8B-Stheno-v3.1-6_5bpw_exl2
Text Generation
•
Updated
•
12
•
1
sophosympatheia/Aurora-Nights-70B-v1.0
Text Generation
•
69B
•
Updated
•
1.44k
•
22
PygmalionAI/mythalion-13b
Text Generation
•
13B
•
Updated
•
2.37k
•
•
157
Nitral-AI/Poppy_Porpoise-1.0-L3-8B
Text Generation
•
8B
•
Updated
•
36
•
24
NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss
Text Generation
•
47B
•
Updated
•
11
•
37
microsoft/Phi-3-medium-128k-instruct
Text Generation
•
14B
•
Updated
•
10k
•
383
Azazelle/L3-RP_io
Text Generation
•
8B
•
Updated
•
5
•
3
Lewdiculous/Poppy_Porpoise-1.0-L3-8B-GGUF-IQ-Imatrix
8B
•
Updated
•
223
•
15
ACECODER: Acing Coder RL via Automated Test-Case Synthesis
Paper
•
2502.01718
•
Published
•
29
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training
Tokens
Paper
•
2504.07096
•
Published
•
76