Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
144.5
TFLOPS
667
14
182
Arthur Zucker
ArthurZ
Follow
KvrParaskevi's profile picture
on1onmangoes's profile picture
Bruno's profile picture
262 followers
·
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
•
20
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
22
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
5
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
google/gemma-2-2b-jpn-it
about 6 hours ago
tokenizer_config.json is different from gemma-2-2b-it
1
#8 opened about 11 hours ago by
dahara1
New activity in
mistral-community/pixtral-12b
8 days ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 10 days ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
12 days ago
hidden_activation vs hidden_act in config.json
2
#10 opened 13 days ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
13 days ago
How to use safetensors?
2
#13 opened 13 days ago by
prathi1729
New activity in
mistral-community/pixtral-12b
14 days ago
lamma cpp ht to gguf not working
4
#2 opened 16 days ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
about 2 months ago
8-kv-heads
8
#14 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
about 2 months ago
Update config.json
#17 opened about 2 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened about 2 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
about 2 months ago
8 kv heads
2
#13 opened 2 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
about 2 months ago
8-kv-heads
#15 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
about 2 months ago
8-kv-heads
3
#21 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
about 2 months ago
8-kv-heads
4
#17 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
2 months ago
Updated eos_token to include multiple IDs
1
#14 opened 2 months ago by
vontimitta
Update tokenizer to prepend special token
#12 opened 2 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
2 months ago
Update tokenizer to prepend special token
1
#11 opened 2 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-8B-Instruct
2 months ago
Upload tokenizer
2
#29 opened 2 months ago by
ArthurZ
Upload tokenizer
#28 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
2 months ago
Upload tokenizer
1
#9 opened 2 months ago by
ArthurZ
Update `_name_or_path` to the HF model id
#8 opened 2 months ago by
davidthomas426
New activity in
meta-llama/Llama-3.1-8B
2 months ago
Update tokenizer to prepend special token
1
#12 opened 2 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-405B-Instruct
2 months ago
Upload tokenizer
1
#9 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-70B-Instruct
2 months ago
Upload tokenizer
1
#12 opened 2 months ago by
ArthurZ
New activity in
ArthurZ/new-t5-base
2 months ago
Upload tokenizer
#1 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-8B-Instruct
2 months ago
Upload tokenizer
#27 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-70B-Instruct
2 months ago
Upload tokenizer
#11 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
2 months ago
Fix quantization_config to work with vLLM v0.5.3.post1
1
#11 opened 2 months ago by
davidthomas426
New activity in
meta-llama/Llama-3.1-8B-Instruct
2 months ago
DO NOT MERGE v2 make sure vllm and transformers work
#12 opened 2 months ago by
ArthurZ
DO NOT MERGE test for vllm
2
#11 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-70B
2 months ago
Can we add `use_scaled_rope` in the config.json?
4
#2 opened 3 months ago by
lanking
New activity in
meta-llama/Llama-Guard-3-8B-INT8
2 months ago
Update config.json
#6 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-Guard-3-8B
2 months ago
Update config.json
#9 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-70B
2 months ago
Update config.json
#9 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-70B-Instruct
2 months ago
Update config.json
#6 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-70B
2 months ago
Update config.json
#8 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-8B
2 months ago
Update config.json
#10 opened 2 months ago by
ArthurZ
New activity in
google/gemma-2-27b-it
3 months ago
Model repeating information and "spitting out" random characters
3
#12 opened 3 months ago by
brazilianslib
Hallucinations, misspellings etc. Something seems broken?
21
#10 opened 3 months ago by
sam-paech
transformers load fails?
7
#6 opened 3 months ago by
bdambrosio
New activity in
google/gemma-2-9b
3 months ago
Runtime autograd error due to inplace operations
1
#4 opened 3 months ago by
xianbin
New activity in
microsoft/Florence-2-large
3 months ago
Please add to llama.cpp and ollama
3
#21 opened 3 months ago by
KeilahElla
New activity in
meta-llama/Meta-Llama-3-8B
4 months ago
Why are "add_bos_token" and "add_eos_token" missing in tokenizer_config.json ?
1
#140 opened 5 months ago by
ekurtic
New activity in
mistralai/Mistral-7B-Instruct-v0.3
4 months ago
Slow tokenizer problem.
4
#22 opened 4 months ago by
bradhutchings
New activity in
meta-llama/Meta-Llama-3-8B
4 months ago
LlamaTokenizerFast.from_pretrained gives incorrect number of tokens for Llama3
3
#156 opened 4 months ago by
farzadab
New activity in
mistralai/Mistral-7B-Instruct-v0.3
5 months ago
Add minor reference to transformers
4
#7 opened 5 months ago by
osanseviero
Upload tokenizer
#6 opened 5 months ago by
ArthurZ
Upload tokenizer
#5 opened 5 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
5 months ago
Update README.md
#4 opened 5 months ago by
ArthurZ
Update README.md
#3 opened 5 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
5 months ago
Update README.md
#4 opened 5 months ago by
ArthurZ
Update config.json
1
#3 opened 5 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
5 months ago
Upload MistralForCausalLM
#2 opened 5 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
5 months ago
Upload MistralForCausalLM
#2 opened 5 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
5 months ago
Upload tokenizer
1
#1 opened 5 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
5 months ago
Upload tokenizer
#1 opened 5 months ago by
ArthurZ
New activity in
01-ai/Yi-9B
5 months ago
Tokenizer inconsistencies related to HTML tags
4
#11 opened 6 months ago by
sanderland
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
5 months ago
Update config.json
1
#105 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
5 months ago
Update config.json
3
#49 opened 5 months ago by
ArthurZ
The sample code for usage with Transformers is incorrect.
2
#45 opened 5 months ago by
endNone
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
5 months ago
How to use EOT_ID
4
#54 opened 5 months ago by
saksham-lamini
New activity in
meta-llama/Meta-Llama-3-8B
5 months ago
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.
9
#72 opened 5 months ago by
tianke0711
Load more