KaraKaraWitch's picture

KaraKaraWitch

KaraKaraWitch

AI & ML interests

Making Text Datasets because it's fun. Ready to use datasets / Polished datasets available at "WitchesSocialStream" Org. (@karakarawitch on discord!)

Recent Activity

Organizations

Nothing Here's profile picture Ryoko AI's profile picture RyokoAI Extra's profile picture Tamakoma's profile picture Blog-explorers's profile picture AlppAI's profile picture recursal's profile picture Witches Social Stream's profile picture JunjouEmotional's profile picture Featherless Serverless LLM's profile picture KaraKaraWarehouse's profile picture

KaraKaraWitch's activity

reacted to nyuuzyou's post with 🔥👍 8 days ago
view post
Post
5472
🇷🇺 Russian Forum Messages Dataset - nyuuzyou/ruforum

Collection of approximately 58 million Russian forum messages featuring:

- Complete message content from Russian online forums spanning 2010-2025
- Comprehensive metadata including unique message IDs and timestamps
- Full text content preserving original user discussions and interactions
- Monolingual dataset focused exclusively on Russian language content

This dataset offers a unique textual archive of Russian online conversations suitable for text generation, sentiment analysis, and language modeling research. Released to the public domain under CC0 1.0 license.
updated a Space 23 days ago
published a model about 1 month ago