WU Junyan
SupercarryNg
AI & ML interests
None yet
Recent Activity
new activity
10 months ago
deepseek-ai/DeepSeek-V2-Chat:Can you provide a sample code for training with DeepSpeed ZeRO3?
liked
a model
12 months ago
meta-llama/Meta-Llama-3-70B-Instruct
Organizations
None yet
SupercarryNg's activity
Can you provide a sample code for training with DeepSpeed ZeRO3?
2
#10 opened 11 months ago
by
SupercarryNg

Respect the add_generation_prompt parameter of apply_chat_template
3
#17 opened about 1 year ago
by
noamwies
Wrong Special Token
2
#1 opened about 1 year ago
by
SupercarryNg
