Arthur Zucker
ArthurZ
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
google/gemma-3-12b-it-qat-q4_0-gguf
liked
a model
4 days ago
hugging-quants/Llama-4-Scout-17B-16E-Instruct-fbgemm-unfused
liked
a model
5 days ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
Organizations
ArthurZ's activity
remove <|finetune_right_pad_id|> and change pad_token to <|finetune_right_pad|>
1
#25 opened 6 days ago
by
wukaixingxp

pad error
8
#25 opened 8 days ago
by
bobber
Bug in AutoModel
3
#26 opened 8 days ago
by
random-checkin

Cannot generate with BS > 1
1
#25 opened 7 days ago
by
chenjiel
change to spda
2
#14 opened 8 days ago
by
wukaixingxp

Fastest way for inference?
3
#28 opened 2 months ago
by
psycy
model-00078-of-000163.safetensors not marked safe?
2
#80 opened 2 months ago
by
aborst

Update tokenizer_config.json
#1 opened 3 months ago
by
ArthurZ

Upload transformers version
10
#3 opened 5 months ago
by
ArthurZ

Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened 5 months ago
by
kwen2501
Update model weight
8
#13 opened 6 months ago
by
nguyen-brat
Update hidden_act to silu
2
#14 opened 6 months ago
by
ArthurZ

llama.cpp support
9
#1 opened 6 months ago
by
ayyylol

tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened 6 months ago
by
dahara1
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 7 months ago
by
Valadaro
hidden_activation vs hidden_act in config.json
2
#10 opened 7 months ago
by
heheda
How to use safetensors?
2
#13 opened 7 months ago
by
prathi1729
lamma cpp ht to gguf not working
4
#2 opened 7 months ago
by
RameshRajamani
8-kv-heads
8
#14 opened 8 months ago
by
ArthurZ

Update config.json
#17 opened 8 months ago
by
ArthurZ
