Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Yi Su
virtuoussy
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR
updated
a dataset
4 days ago
virtuoussy/Math-RLVR
updated
a dataset
4 days ago
virtuoussy/Multi-subject-RLVR
Organizations
None yet