Nilabhra Roy Chowdhury
nilabhra
AI & ML interests
None yet
Recent Activity
liked
a model
7 days ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
authored
a paper
12 days ago
ZClip: Adaptive Spike Mitigation for LLM Pre-Training
upvoted
a
paper
12 days ago
ZClip: Adaptive Spike Mitigation for LLM Pre-Training
Organizations
nilabhra's activity
add chat template to tokenizer_config.json
1
#111 opened over 1 year ago
by
epignatelli
Update README.md
#1 opened 10 months ago
by
reach-vb

Update blogpost link
1
#1 opened 11 months ago
by
ZennyKenny

Template
1
1
#7 opened 11 months ago
by
rhaymison

KeyError: 'transformer.h.26.input_layernorm.weight'
2
#5 opened 11 months ago
by
Mabeck
Update README.md
#6 opened 11 months ago
by
rcojocaru

Update README.md
1
#3 opened 11 months ago
by
eltociear

Create README.md
#1 opened 11 months ago
by
rcojocaru

Difference in performance compared to the original Yi-34B
2
#8 opened over 1 year ago
by
nilabhra