ShelterW
ShelterW
·
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
ShelterW/TinyR1-32B-Preview-AWQ
published
a model
13 days ago
ShelterW/TinyR1-32B-Preview-AWQ
liked
a dataset
19 days ago
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Organizations
None yet
ShelterW's activity
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
#6 opened about 2 months ago
by
ShelterW
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
5
#4 opened about 2 months ago
by
ShelterW
What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?
1
#5 opened about 2 months ago
by
ShelterW
reward is None
1
#3 opened 2 months ago
by
ShelterW
hidden state is nan
1
#2 opened 7 months ago
by
ShelterW
Update README.md
#1 opened almost 2 years ago
by
ShelterW
Update README.md
#1 opened almost 2 years ago
by
ShelterW
Update README.md
#1 opened almost 2 years ago
by
ShelterW