111 117 1277

Xi

xi0v

AI & ML interests

Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.

Recent Activity

liked a model about 6 hours ago

John6666/noobai-cyberfix-v2-10vpred-perp-sdxl

liked a model about 6 hours ago

John6666/rouwei-v07vpred-sdxl

liked a model about 8 hours ago

Sao10K/70B-L3.3-mhnnn-x1

View all activity

Organizations

xi0v's activity

liked 2 models about 6 hours ago

John6666/noobai-cyberfix-v2-10vpred-perp-sdxl

Text-to-Image • Updated Dec 29, 2024 • 143 • 2

John6666/rouwei-v07vpred-sdxl

Text-to-Image • Updated 1 day ago • 2

liked 3 models about 8 hours ago

liked a dataset about 22 hours ago

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated 5 days ago • 1.85M • 1.16k • 134

liked a model 1 day ago

prithivMLmods/Qwen2.5-32B-DeepSeek-R1-Instruct

Text Generation • Updated 5 days ago • 46 • 9

liked a dataset 1 day ago

nbeerbower/GreatFirewall-DPO

Viewer • Updated 11 days ago • 492 • 123 • 4

reacted to hexgrad's post with ➕👀 2 days ago

Post

1868

Technical question: Is Abliteration still an effective method for uncensoring LLMs? Generally, what are the most effective methods to uncensor LLMs?

An effective uncensoring method would ideally be low-cost, data-efficient, and above all, successfully uncensor an LLM with minimal benchmark regressions.

"Tiananmen Square", "Winnie-the-Pooh", etc and more broadly "China influence/censorship" are some common criticisms leveled at DeepSeek.

I am vaguely aware of "Abliteration", a technique coined by @failspy (apologies if that attribution is incorrect) and originally described in a mid-2024 paper titled "Refusal in Language Models Is Mediated by a Single Direction" https://arxiv.org/abs/2406.11717

Abliteration is proposed as a relatively cheap and effective way to bypass censorship in models. However, it is not without criticism: https://www.reddit.com/r/LocalLLaMA/comments/1f07b4b/abliteration_fails_to_uncensor_models_while_it/

Curious to hear people's takes on Abliteration or other uncensoring methods, especially as it relates to DeepSeek.

8 replies

upvoted a collection 2 days ago

FuseO1-Preview

Collection

System-II Reasoning Fusion of LLMs • 10 items • Updated 3 days ago • 15

liked a model 3 days ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 27 days ago • 89.1k • • 490

published a model 3 days ago

xi0v/Erebus-120B

Text Generation • Updated 3 days ago

updated a model 3 days ago

xi0v/Erebus-120B

Text Generation • Updated 3 days ago

published a model 3 days ago

xi0v/Hexstia-120b

Text Generation • Updated 3 days ago

updated a model 3 days ago

xi0v/Hexstia-120b

Text Generation • Updated 3 days ago

upvoted an article 4 days ago

Article

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

Oct 3, 2023

• 8

liked 2 models 4 days ago

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated about 20 hours ago • 12.1k • • 502

LOL2024/illustrious-v0.1-vpred-test-8762710

Text-to-Image • Updated 16 days ago • 25 • 1

reacted to mkurman's post with 🔥 4 days ago

Post

2695

Ok, my 14B DeepSeek R1 merge with Qwen2.5 1M is really hot right now—it's got 2.6k downloads! It's sitting pretty as the top trending model on the third page. 🔥

Check it out if you haven't already!
mkurman/Qwen2.5-14B-DeepSeek-R1-1M

11 replies