Definitely NotJ
CrappyREincarnation
AI & ML interests
None yet
Recent Activity
reacted
to
kristaller486's
post
with ๐
1 day ago
Nebo-T1-Russian
(Probably) the first "longCoT" dataset for the Russian language created via Deeseek-R1.
- Prompts taken from the Sky-T1 dataset and translated via Llama3.3-70B.
- Answers and reasoning generated by Deepseek-R1 (685B).
- 16.4K samples in total, โ12.4K Russian-only (in the rest, either the answer or reasoning is in English).
- Languages in the answers and reasoning are labeled using fasttext.
https://huggingface.co/datasets/kristaller486/Nebo-T1-Russian
Organizations
None yet
models
None public yet
datasets
None public yet