Xi's picture

Xi

xi0v

AI & ML interests

Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.

Recent Activity

liked a model about 6 hours ago
John6666/noobai-cyberfix-v2-10vpred-perp-sdxl
liked a model about 6 hours ago
John6666/rouwei-v07vpred-sdxl
liked a model about 8 hours ago
Sao10K/70B-L3.3-mhnnn-x1
View all activity

Organizations

GEM benchmark's profile picture OpenGVLab's profile picture BigScience Biomedical Datasets's profile picture fast.ai community's profile picture LLMs's profile picture ONNXConfig for all's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture lora concepts library's profile picture DeepGHS's profile picture Open-Source AI Meetup's profile picture Arabic Machine Learning 's profile picture DataScienceGuild's profile picture Literally Me FRFR Research Society's profile picture Tune a video concepts library's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture The Waifu Research Department's profile picture Blog-explorers's profile picture OpenSky's profile picture CyberHarem's profile picture ICCV2023's profile picture Tensor Diffusion's profile picture ICML2023's profile picture huggingPartyParis's profile picture MultiπŸ€–Transformers's profile picture AI Hobbyist's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Project Fluently's profile picture LocalLLaMA's profile picture MLX Community's profile picture INNOVA AI's profile picture Narra's profile picture AstraLLMs's profile picture 0ai's profile picture C4AI Community's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face for Legal's profile picture Raye's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture

xi0v's activity

reacted to hexgrad's post with βž•πŸ‘€ 2 days ago
view post
Post
1868
Technical question: Is Abliteration still an effective method for uncensoring LLMs? Generally, what are the most effective methods to uncensor LLMs?

An effective uncensoring method would ideally be low-cost, data-efficient, and above all, successfully uncensor an LLM with minimal benchmark regressions.

"Tiananmen Square", "Winnie-the-Pooh", etc and more broadly "China influence/censorship" are some common criticisms leveled at DeepSeek.

I am vaguely aware of "Abliteration", a technique coined by @failspy (apologies if that attribution is incorrect) and originally described in a mid-2024 paper titled "Refusal in Language Models Is Mediated by a Single Direction" https://arxiv.org/abs/2406.11717

Abliteration is proposed as a relatively cheap and effective way to bypass censorship in models. However, it is not without criticism: https://www.reddit.com/r/LocalLLaMA/comments/1f07b4b/abliteration_fails_to_uncensor_models_while_it/

Curious to hear people's takes on Abliteration or other uncensoring methods, especially as it relates to DeepSeek.
Β·
upvoted an article 4 days ago
view article
Article

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

β€’ 8
reacted to mkurman's post with πŸ”₯ 4 days ago
view post
Post
2695
Ok, my 14B DeepSeek R1 merge with Qwen2.5 1M is really hot right nowβ€”it's got 2.6k downloads! It's sitting pretty as the top trending model on the third page. πŸ”₯

Check it out if you haven't already!
mkurman/Qwen2.5-14B-DeepSeek-R1-1M
Β·