Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
144.5
TFLOPS
673
15
191
Arthur Zucker
ArthurZ
Follow
Mephistopheles-0's profile picture
samuelobrien's profile picture
yuanpeig's profile picture
307 followers
ยท
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Recent Activity
liked
a model
15 days ago
meta-llama/Llama-3.2-1B-Instruct
liked
a Space
15 days ago
m-ric/llm-race-to-the-top
reacted
to
MonsterMMORPG
's
post
with ๐
27 days ago
FLUX Redux is a hidden Gem I am still doing huge research to publish an amazing fully Public - no paywalled Tutorial, but this is generated via SwarmUI Style Model Merge Strength : 0.5 FLUX Guidance Scale is : 6 Used base model is my FLUX fine tuned model with 256 images via Kohya SS GUI as shown in tutorial ( https://youtu.be/FvpWy1x5etM ) - 70 epoch Prompt : anime ohwx man walking in a jungle <segment:yolo-face_yolov9c.pt-1,0.7,0.5> ohwx man, anime
View all activity
Articles
Fixing Gradient Accumulation
Oct 16
โข
43
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
โข
25
Fine-Tuning Gemma Models in Hugging Face
Feb 23
โข
27
Code Llama: Llama 2 learns to code
Aug 25, 2023
โข
9
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistralai/Pixtral-Large-Instruct-2411
about 1 month ago
Upload transformers version
7
#3 opened about 1 month ago by
ArthurZ
New activity in
huggingface/documentation-images
about 1 month ago
Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened about 1 month ago by
kwen2501
New activity in
mistral-community/pixtral-12b
2 months ago
Update model weight
8
#13 opened 2 months ago by
nguyen-brat
Update hidden_act to silu
2
#14 opened 2 months ago by
ArthurZ
New activity in
rhymes-ai/Aria
3 months ago
llama.cpp support
9
#1 opened 3 months ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
3 months ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened 3 months ago by
dahara1
New activity in
mistral-community/pixtral-12b
3 months ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 3 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
3 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened 3 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
3 months ago
How to use safetensors?
2
#13 opened 3 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
3 months ago
lamma cpp ht to gguf not working
4
#2 opened 3 months ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
4 months ago
8-kv-heads
8
#14 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Update config.json
#17 opened 4 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened 5 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
5 months ago
8 kv heads
2
#13 opened 5 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
5 months ago
8-kv-heads
#15 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
5 months ago
8-kv-heads
3
#21 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
5 months ago
8-kv-heads
4
#17 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
5 months ago
Updated eos_token to include multiple IDs
1
#14 opened 5 months ago by
vontimitta
Update tokenizer to prepend special token
#12 opened 5 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
5 months ago
Update tokenizer to prepend special token
1
#11 opened 5 months ago by
lysandre
Load more