Kristaller486's picture

Kristaller486

kristaller486

AI & ML interests

NLP, Machine Translation

Recent Activity

liked a dataset 1 day ago
HuggingFaceH4/no_robots
liked a model 2 days ago
secretmoon/YankaGPT-8B-v0.1
liked a dataset 3 days ago
Aniemore/resd_annotated
View all activity

Organizations

Social Post Explorers's profile picture

Posts 1

view post
Post
1434
Nebo-T1-Russian

(Probably) the first "longCoT" dataset for the Russian language created via Deeseek-R1.

- Prompts taken from the Sky-T1 dataset and translated via Llama3.3-70B.
- Answers and reasoning generated by Deepseek-R1 (685B).
- 16.4K samples in total, ≈12.4K Russian-only (in the rest, either the answer or reasoning is in English).
- Languages in the answers and reasoning are labeled using fasttext.

kristaller486/Nebo-T1-Russian