Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4990.1
TFLOPS
3
1
1
WU Junyan
SupercarryNg
Follow
21world's profile picture
1 follower
·
0 following
AI & ML interests
None yet
Recent Activity
new
activity
10 months ago
deepseek-ai/DeepSeek-V2-Chat:
Can you provide a sample code for training with DeepSpeed ZeRO3?
liked
a model
12 months ago
meta-llama/Meta-Llama-3-70B-Instruct
new
activity
about 1 year ago
codellama/CodeLlama-70b-Instruct-hf:
Respect the add_generation_prompt parameter of apply_chat_template
View all activity
Organizations
None yet
SupercarryNg
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
deepseek-ai/DeepSeek-V2-Chat
10 months ago
Can you provide a sample code for training with DeepSpeed ZeRO3?
2
#10 opened 11 months ago by
SupercarryNg
liked
a model
12 months ago
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation
•
Updated
Dec 15, 2024
•
491k
•
•
1.47k
New activity in
codellama/CodeLlama-70b-Instruct-hf
about 1 year ago
Respect the add_generation_prompt parameter of apply_chat_template
3
#17 opened about 1 year ago by
noamwies
New activity in
deepseek-ai/deepseek-moe-16b-base
about 1 year ago
Wrong Special Token
2
#1 opened about 1 year ago by
SupercarryNg