Shubham Toshniwal

stoshniwal

AI & ML interests

NLP, LLM

Recent Activity

Organizations

NVIDIA's profile picture

stoshniwal's activity

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 3 months ago

Tokenizer config is wrong

8
#10 opened 3 months ago by
stoshniwal
upvoted an article 6 months ago
view article
Article

Fixing Gradient Accumulation

53
New activity in nvidia/OpenMathInstruct-2 6 months ago

Upload scaling_plot.jpg

#4 opened 6 months ago by
shtoshni