Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 3 days ago • 35
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 3 days ago • 35
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 3 days ago • 35
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 1 day ago • 43
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 3 days ago • 35
LM-Parallel/grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012 Updated 10 days ago
LM-Parallel/grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012 Updated 10 days ago
LM-Parallel/grpo_llama-hsp-v3_bs64_rollout5-lr1e-5-sw-t1.0-kl0.001-sc10-bm10sbm15-20250411103359 Updated 10 days ago • 2
LM-Parallel/grpo_llama-hsp-v3_bs64_rollout5-lr1e-5-sw-t1.0-kl0.001-sc10-bm10sbm15-20250411103359 Updated 10 days ago • 2
LM-Parallel/grpo_llama-hsp-v3_bs64_rollout5-lr1e-5-sw-t1.0-kl0.001-bm10-sbm15-nc-20250411054109 Updated 10 days ago
LM-Parallel/grpo_llama-hsp-v3_bs64_rollout5-lr1e-5-sw-t1.0-kl0.001-bm10-sbm15-nc-20250411054109 Updated 10 days ago
LM-Parallel/grpo_llama-hsp-v3_bs64_rollout5-lr1e-5-sw-t1.0-kl0.01-sc10-bm10sbm15-20250325133311 Updated 10 days ago
LM-Parallel/grpo_llama-hsp-v3_bs64_rollout5-lr1e-5-sw-t1.0-kl0.01-sc10-bm10sbm15-20250325133311 Updated 10 days ago