Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12.7
TFLOPS
11
12
ll
PRO
Etherll
Follow
sometimesanotion's profile picture
HazemE's profile picture
mindkrypted's profile picture
13 followers
·
11 following
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
open-r1/OlympicCoder-7B
liked
a Space
22 days ago
Reality123b/XylariaDeepReason
reacted
to
s-emanuilov
's
post
with 🔥
about 1 month ago
Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.
View all activity
Organizations
Etherll
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
1 day ago
open-r1/OlympicCoder-7B
Text Generation
•
Updated
1 day ago
•
641
•
71
liked
a Space
22 days ago
Sleeping
2
2
XylariaDeepReason
💬
Generate detailed research summaries from queries
liked
a dataset
about 1 month ago
fawazahmed0/hadith-data
Viewer
•
Updated
Oct 30, 2024
•
300k
•
108
•
3
liked
a dataset
4 months ago
Dreamslol/svelte-5-sveltekit-2
Viewer
•
Updated
Nov 15, 2024
•
4.93k
•
101
•
6
liked
2 models
6 months ago
mradermacher/Herplete-LLM-Llama-3.1-8b-i1-GGUF
Updated
Sep 26, 2024
•
711
•
1
mradermacher/Herplete-LLM-Llama-3.1-8b-GGUF
Updated
Sep 26, 2024
•
363
•
2
liked
a model
7 months ago
Etherll/Herplete-LLM-Llama-3.1-8b
Updated
Sep 3, 2024
•
154
•
5
liked
a dataset
7 months ago
AIRRC/Eudaimonic
Viewer
•
Updated
Aug 21, 2024
•
5.72k
•
138
•
2
liked
a Space
7 months ago
Paused
2
2
Ghost 8b Beta Coder (Etherll)
🦀
liked
a dataset
7 months ago
Sephfox/A.I.R.RConsciousnessDataset
Viewer
•
Updated
Aug 19, 2024
•
1.32k
•
82
•
3
liked
a model
10 months ago
refuelai/Llama-3-Refueled
Text Generation
•
Updated
May 9, 2024
•
1.9k
•
192
liked
a Space
about 1 year ago
Running
on
Zero
275
275
DynamiCrafter
🐨
Generate videos from images and text prompts