much emojies , such wow..
Phyo Arkar Lwin
v3ss0n
ยท
AI & ML interests
None yet
Recent Activity
new activity
14 days ago
unsloth/README:I can't run any of the dynamic bnb-4bit quants with TextGenerationInference
new activity
14 days ago
Qwen/Qwen2.5-Max-Demo:Request to Release Qwen2.5-Max as Open Source Model
new activity
about 2 months ago
mistralai/Mistral-Small-24B-Instruct-2501:Remove gated access?
Organizations
None yet
v3ss0n's activity
I can't run any of the dynamic bnb-4bit quants with TextGenerationInference
2
#6 opened about 2 months ago
by
v3ss0n
Request to Release Qwen2.5-Max as Open Source Model
4
#8 opened 2 months ago
by
quantflex

Remove gated access?
2
#25 opened 2 months ago
by
davidmezzetti

fix: strftime_now is unknown (in <string>:1)
8
#17 opened 2 months ago
by
v3ss0n
Why increase censorship?
21
#20 opened 2 months ago
by
notafraud

Request access to the model
1
#22 opened 2 months ago
by
klydekushy
Adding tool call support in chat template
26
#13 opened 2 months ago
by
Navanit-AI

Commit #e969dcf155adde0b0654770948d93d1b2646d3f4 Introduced `strftime_now` and it is unknown in TGI.
3
#8 opened 2 months ago
by
v3ss0n
chat template doesn't include tools
10
#3 opened 2 months ago
by
copasseron
Add system message to chat template
1
#6 opened 2 months ago
by
Rocketknight1

chat template
1
#9 opened 2 months ago
by
lucyknada

Getting error when trying to infernce using example , or lmdeploy.
2
#7 opened 9 months ago
by
v3ss0n
llama.cpp / gguf?
3
#3 opened 10 months ago
by
nacs
How much VRam does it need?
1
#6 opened 9 months ago
by
v3ss0n
Run inference in CPU
3
#1 opened 10 months ago
by
hythythyt3
Quantized model coming?
8
#3 opened 12 months ago
by
dnhkng
