Starlento's picture

26 62

Starlento

Starlento

·

AI & ML interests

None yet

Recent Activity

liked a model 21 days ago

cognitivecomputations/DeepSeek-R1-AWQ

liked a model 27 days ago

Qwen/Qwen2.5-Omni-7B

liked a model about 2 months ago

ByteDance-Seed/UI-TARS-7B-DPO

View all activity

Organizations

None yet

Starlento's activity

New activity in openbmb/MiniCPM-2B-128k 11 months ago

Slow inference speed and high VRAM using huggingface transformers

#2 opened about 1 year ago by

New activity in 01-ai/Yi-1.5-34B-32K 11 months ago

what is the difference between this model and 01-ai/Yi-1.5-34B?

#2 opened 11 months ago by

New activity in m-a-p/neo_7b 11 months ago

Error loading the model

#2 opened 11 months ago by

New activity in alpindale/WizardLM-2-8x22B 11 months ago

function/tool calling capability still in place?

#10 opened 11 months ago by

New activity in 01-ai/Yi-1.5-34B-Chat 11 months ago

can you share the dataset?

#7 opened 11 months ago by

New activity in lmstudio-community/Meta-Llama-3-8B-Instruct-BPE-fix-GGUF 12 months ago

Is eos_token got fixed?

#1 opened 12 months ago by

New activity in crusoeai/Llama-3-8B-Instruct-262k-GGUF 12 months ago

long repeatitions

#2 opened 12 months ago by

New activity in Starlento/DPO-En-Zh-20k-handbook 12 months ago

[bot] Conversion to Parquet

#1 opened 12 months ago by

parquet-converter

New activity in Starlento/SFT-COIG-CQIA-handbook 12 months ago

[bot] Conversion to Parquet

#1 opened 12 months ago by

parquet-converter

New activity in openbmb/MiniCPM-2B-128k 12 months ago

generating extremely slow, compared to 4k length model

#4 opened 12 months ago by

New activity in s3nh/MiniCPM-2B-dpo-fp32-GGUF about 1 year ago

error loading model: create_tensor: tensor 'output.weight' not found ?

#1 opened about 1 year ago by

New activity in pbelcak/UltraFastBERT-1x11-long over 1 year ago

Missing weights for example code

#1 opened over 1 year ago by

New activity in openai/consistency-decoder over 1 year ago

Update README.md

#2 opened over 1 year ago by

New activity in TheBloke/Yi-34B-GPTQ over 1 year ago

ValueError: Tokenizer class YiTokenizer does not exist or is not currently imported.

#1 opened over 1 year ago by

New activity in adept/fuyu-8b over 1 year ago

Performance Sharing

#10 opened over 1 year ago by

New activity in mistralai/Mistral-7B-Instruct-v0.1 over 1 year ago

Unable to load checkpoint shards

#21 opened over 1 year ago by

New activity in BlinkDL/rwkv-4-raven almost 2 years ago

Strange Chinese answer for RWKV-4-Raven-7B-v12-Eng49%-Chn49%-Jpn1%-Other1%-20230530-ctx8192.pth

#23 opened almost 2 years ago by

New activity in lmsys/vicuna-13b-delta-v1.1 almost 2 years ago

vicuna-13b-delta-v1.1 output a bunch of meaningless text

#5 opened almost 2 years ago by

New activity in TheBloke/Vicuna-13B-1.1-GPTQ about 2 years ago

The model is broken

#1 opened about 2 years ago by

New activity in YoungMasterFromSect/Trauter_LoRAs about 2 years ago

Add Preview for Stable Diffusion Webui

#14 opened over 2 years ago by