Great to hear LoRA extraction is fairly easy, I've been looking for it just yesterday, believing the paper linked in my post that its authors are the first such inventors..: https://www.reddit.com/r/LocalLLaMA/comments/1iejts2/call_for_qlora_extractor_development_cooperation/
AI Safety Research
AISafety
AI & ML interests
LLMs, planning, EA
Recent Activity
liked
a model
4 days ago
mistralai/Mistral-Small-24B-Instruct-2501
Organizations
None yet
AISafety's activity
reacted to
sayakpaul's
post with ๐ฅ
3 days ago
Post
1696
We have been cooking a couple of fine-tuning runs on CogVideoX with finetrainers, smol datasets, and LoRA to generate cool video effects like crushing, dissolving, etc.
We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:
* Models and datasets: https://huggingface.co/finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py
We are also releasing a LoRA extraction utility from a fully fine-tuned checkpoint. I know that kind of stuff has existed since eternity, but the quality on video models was nothing short of spectacular. Below are some links:
* Models and datasets: https://huggingface.co/finetrainers
* finetrainers: https://github.com/a-r-r-o-w/finetrainers
* LoRA extraction: https://github.com/huggingface/diffusers/blob/main/scripts/extract_lora_from_model.py
mistralai/Mistral-Small-24B-Instruct-2501
Text Generation
โข
Updated
โข
18.4k
โข
โข
558
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
โข
Updated
โข
25k
โข
174
Qwen/Qwen2.5-14B-Instruct-1M
Text Generation
โข
Updated
โข
10.7k
โข
215
ggml-org/LoRA-Qwen2.5-32B-Instruct-abliterated-F16-GGUF
Updated
โข
220
โข
3
upvoted
an
article
7 days ago
Article
Open-R1: a fully open reproduction of DeepSeek-R1
โข
607
Synthetic Data Collection PAUSED Jan 16
29
#21 opened 29 days ago
by
hexgrad
reacted to
hexgrad's
post with ๐
27 days ago
Post
19479
๐ฃ Looking for labeled, high-quality synthetic audio/TTS data ๐ฃ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.
If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.
What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. โค๏ธ
More details at hexgrad/Kokoro-82M#21
If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.
What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. โค๏ธ
More details at hexgrad/Kokoro-82M#21