Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
95
ShelterW
ShelterW
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
liked
a model
8 days ago
deepseek-ai/DeepSeek-R1
new
activity
11 days ago
Qwen/Qwen2.5-Math-PRM-7B:
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
new
activity
12 days ago
Qwen/Qwen2.5-Math-PRM-7B:
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
View all activity
Organizations
None yet
ShelterW
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
8 days ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
1 day ago
•
149k
•
3.5k
New activity in
Qwen/Qwen2.5-Math-PRM-7B
11 days ago
If the response length exceeds 4096, is a sliding window used, or is it simply truncated?
#6 opened 11 days ago by
ShelterW
New activity in
Qwen/Qwen2.5-Math-PRM-7B
12 days ago
"<extra_0>" is not special token ? I got 5 token_ids ,is it right?
5
#4 opened 12 days ago by
ShelterW
New activity in
OpenLeecher/lmsys_chat_1m_clean
15 days ago
What is the accuracy of the Skywork/Skywork-Reward-Gemma-2-27B-v0.2? How much is the correct sample of 273K?
#5 opened 15 days ago by
ShelterW
New activity in
OpenLeecher/lmsys_chat_1m_clean
20 days ago
reward is None
1
#3 opened 20 days ago by
ShelterW
liked
a Space
26 days ago
Running
on
CPU Upgrade
604
🏆
Open ASR Leaderboard
liked
a model
26 days ago
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
3 days ago
•
38.9k
•
2.47k
liked
a model
about 1 month ago
unsloth/Llama-3.3-70B-Instruct-bnb-4bit
Text Generation
•
Updated
21 days ago
•
236k
•
29
updated
a model
about 2 months ago
ShelterW/Qwen2.5-Math-72B-Instruct-AWQ
Updated
Dec 10, 2024
liked
a model
about 2 months ago
Qwen/QwQ-32B-Preview
Text Generation
•
Updated
16 days ago
•
178k
•
•
1.6k
updated
2 datasets
about 2 months ago
ShelterW/chinese_common_ner
Viewer
•
Updated
Dec 6, 2024
•
110k
•
61
ShelterW/chinese_medical_ner
Viewer
•
Updated
Dec 6, 2024
•
251k
•
79
liked
a Space
2 months ago
Running
870
🔍
QwQ-32B-Preview
QwQ-32B-Preview
liked
a model
3 months ago
2Noise/ChatTTS
Text-to-Audio
•
Updated
Oct 22, 2024
•
27.8k
•
1.45k
liked
a dataset
5 months ago
BAAI/Infinity-Instruct
Viewer
•
Updated
12 days ago
•
20.4M
•
5.47k
•
585
liked
a dataset
6 months ago
lmsys/lmsys-chat-1m
Viewer
•
Updated
Jul 27, 2024
•
1M
•
2.2k
•
627
New activity in
unsloth/gemma-2-27b-it-bnb-4bit
6 months ago
hidden state is nan
1
#2 opened 6 months ago by
ShelterW
liked
3 models
6 months ago
mistralai/Mistral-Nemo-Instruct-2407
Text Generation
•
Updated
Nov 6, 2024
•
1.51M
•
•
1.42k
unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit
Text Generation
•
Updated
Sep 11, 2024
•
15.4k
•
26
unsloth/gemma-2-27b-it-bnb-4bit
Text Generation
•
Updated
Sep 3, 2024
•
5.53k
•
11
Load more