-
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Paper • 2505.14625 • Published • 13 -
1
TinyV
💬Verify model answers against ground truth
-
zhangchenxu/TinyV-Qwen3-1.7B
Text Generation • 2B • Updated • 5 -
zhangchenxu/TinyV-Qwen3-1.7B-Think
Text Generation • 2B • Updated • 15 • 3
Zhangchen Xu PRO
zhangchenxu
AI & ML interests
LLM Data, Alignment, Post-Training, Safety
Recent Activity
new activity
about 22 hours ago
Agent-Ark/Toucan-1.5M:[bot] Conversion to Parquet
new activity
about 22 hours ago
Agent-Ark/Toucan-1.5M:Can we add information about which models are used to generate the question?
upvoted
a
paper
1 day ago
CoDA: Agentic Systems for Collaborative Data Visualization