Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12.7
TFLOPS
11
12
ll
PRO
Etherll
Follow
21world's profile picture
HazemE's profile picture
JelaMiraj's profile picture
13 followers
·
11 following
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
open-r1/OlympicCoder-7B
liked
a Space
22 days ago
Reality123b/XylariaDeepReason
reacted
to
s-emanuilov
's
post
with 🔥
about 1 month ago
Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.
View all activity
Organizations
models
24
Sort: Recently updated
Etherll/Qwen2.5-CodeFIM-1.5B-v2
Text Generation
•
Updated
Nov 11, 2024
•
49
•
3
Etherll/Qwen2.5-CodeFIM-1.5B-v2-Q8_0-GGUF
Updated
Nov 11, 2024
•
10
Etherll/Qwen2.5-7B-della-test
Text Generation
•
Updated
Nov 8, 2024
•
53
•
1
Etherll/SuperHermes
Text Generation
•
Updated
Oct 27, 2024
•
17
•
1
Etherll/Herplete-LLM-Llama-3.1-8b-Ties-Q5_K_M-GGUF
Updated
Oct 18, 2024
•
12
Etherll/Qwen2.5-Coder-1.5B-CodeFIM
Text Generation
•
Updated
Oct 16, 2024
•
66
•
3
Etherll/Qwen2.5-Coder-1.5B-CodeFIM-Q8_0-GGUF
Updated
Oct 16, 2024
•
5
Etherll/Herplete-LLM-Llama-3.1-8b-Ties
Text Generation
•
Updated
Oct 3, 2024
•
11
Etherll/Qwen2.5-Coder-7B-Instruct-Ties
Text Generation
•
Updated
Sep 30, 2024
•
17
•
1
Etherll/Qwen2.5-14b-web-Q6_K-GGUF
Updated
Sep 23, 2024
•
8
•
1
Expand 24 models
datasets
7
Sort: Recently updated
Etherll/code-fim-v2
Viewer
•
Updated
Oct 13, 2024
•
64k
•
101
•
3
Etherll/python-code-infill-test
Viewer
•
Updated
Oct 8, 2024
•
25.5k
•
79
Etherll/jokes
Viewer
•
Updated
Aug 18, 2024
•
244
•
19
•
3
Etherll/GatedLinearAttention_Transformers_Q_and_A
Viewer
•
Updated
Aug 16, 2024
•
149
•
40
•
1
Etherll/code-FIM-test4
Viewer
•
Updated
Aug 5, 2024
•
69.5k
•
61
Etherll/lady_nagant_v2
Updated
Apr 25, 2023
•
3
Etherll/lady_nagant
Updated
Mar 29, 2023
•
3