7 7 30

its5Q

https://t.me/dno5iq

its5Q

AI & ML interests

None yet

Recent Activity

liked a dataset 8 days ago

nyuuzyou/gitee-code

updated a model 2 months ago

its5Q/rvc-ringformer-v2-pretrain

published a model 2 months ago

its5Q/rvc-ringformer-v2-pretrain

View all activity

Organizations

Posts 3

Post

3399

Am I missing something, or there is still no way to filter by model size while searching for models? It has been a requested feature since 2022, but I haven't seen any updates since! With the amount of different models coming out, I think the size filter would be a great extension of the search functionality, especially when looking for smaller models, which are a lot less prevalent.

Post

1790

Continuing my streak by releasing the Wikireading dataset: a large collection of scraped non-fiction books predominantly in Russian language.
its5Q/wikireading

Here's the highlights:
- ~7B tokens, or ~28B characters, making it a great candidate for use in pretraining
- Contains non-fiction works from many knowledge domains
- Includes both the original HTML and extracted text of book chapters

View all Posts