Messages about new models and report
pinned
1
#37 opened 16 days ago
by
infgrad
error deploying model
#38 opened 4 days ago
by
jim-bo
How to cite or reference this work?
#36 opened 24 days ago
by
Lux1997
Can I run this model effectively in google colab?
1
#35 opened 26 days ago
by
k0rruptt
Updated model card on instructions to quantize the model/
#34 opened 27 days ago
by
andrewqian123
Why doesn't the VRAM go down when I quantize to 4 GB (and other issues)
#33 opened 28 days ago
by
andrewqian123
add infinity example in the readme
4
#32 opened about 1 month ago
by
michaelfeil
How can I fine-tune using lora? Is there a sample code?
2
#27 opened about 2 months ago
by
sinchir0
Multilingual or Bilingual
#25 opened 3 months ago
by
MeanBean-05
Remote Code execution risk
4
#24 opened 3 months ago
by
srivishnuceg
The output size when deployed in GCP is 1536 instead of 1024
7
#23 opened 3 months ago
by
bennegeek
Is this multilingual or bilingual? english and chinese
#22 opened 3 months ago
by
taowang1993
flash attention
#21 opened 4 months ago
by
Disassemblern
Model loading size on GPU
#20 opened 5 months ago
by
divrajnd
MRL and linear layers
1
#19 opened 5 months ago
by
bobox
Can it output sparse vector?
1
#18 opened 5 months ago
by
kk3dmax
Getting different results for the same examples provided in sample
4
#17 opened 5 months ago
by
sramakintel
Does this model only work on GPU?
1
#16 opened 5 months ago
by
xPurity
About Quantized Models
#14 opened 5 months ago
by
infgrad
Error when loading model KeyError: 'qwen2'
1
#11 opened 5 months ago
by
longluu
Any multi-lingual variant
1
#10 opened 5 months ago
by
prophet123
Parameters for peak performances
3
#8 opened 5 months ago
by
cvdbdo
Difference between dunzhang/stella_en_1.5B_v5 and infgrad/stella_en_1.5B_v5?
1
#7 opened 5 months ago
by
gokturkDev
Model max_seq_length
7
#6 opened 5 months ago
by
shuyuej
Could you provide the training data list?
#5 opened 5 months ago
by
Mengyao00
Fix prompt_name typo
1
#4 opened 5 months ago
by
mber
Upload ONNX weights
2
#3 opened 6 months ago
by
Xenova