Starlento
Starlento
AI & ML interests
None yet
Recent Activity
liked
a model
21 days ago
cognitivecomputations/DeepSeek-R1-AWQ
liked
a model
27 days ago
Qwen/Qwen2.5-Omni-7B
liked
a model
about 2 months ago
ByteDance-Seed/UI-TARS-7B-DPO
Organizations
None yet
Starlento's activity
Slow inference speed and high VRAM using huggingface transformers
2
#2 opened about 1 year ago
by
Starlento

what is the difference between this model and 01-ai/Yi-1.5-34B?
3
#2 opened 11 months ago
by
muziyongshixin
Error loading the model
1
#2 opened 11 months ago
by
Starlento

function/tool calling capability still in place?
1
#10 opened 11 months ago
by
aleclaza
can you share the dataset?
2
#7 opened 11 months ago
by
adeebDkheel
Is eos_token got fixed?
3
#1 opened 12 months ago
by
Starlento

long repeatitions
7
#2 opened 12 months ago
by
subbur
[bot] Conversion to Parquet
#1 opened 12 months ago
by
parquet-converter

[bot] Conversion to Parquet
#1 opened 12 months ago
by
parquet-converter

generating extremely slow, compared to 4k length model
2
#4 opened 12 months ago
by
CHNtentes
error loading model: create_tensor: tensor 'output.weight' not found ?
6
#1 opened about 1 year ago
by
wukongai
Missing weights for example code
4
#1 opened over 1 year ago
by
Starlento

Update README.md
1
1
#2 opened over 1 year ago
by
sayakpaul

ValueError: Tokenizer class YiTokenizer does not exist or is not currently imported.
1
#1 opened over 1 year ago
by
Starlento

Performance Sharing
2
#10 opened over 1 year ago
by
Starlento

Unable to load checkpoint shards
8
#21 opened over 1 year ago
by
Tilakraj0308
Strange Chinese answer for RWKV-4-Raven-7B-v12-Eng49%-Chn49%-Jpn1%-Other1%-20230530-ctx8192.pth
3
#23 opened almost 2 years ago
by
Starlento

vicuna-13b-delta-v1.1 output a bunch of meaningless text
1
7
#5 opened almost 2 years ago
by
resley

The model is broken
7
#1 opened about 2 years ago
by
Fenfel

Add Preview for Stable Diffusion Webui
2
#14 opened over 2 years ago
by
Starlento
