CL-From-Nothing/rlve-rollouts-nemotron-cascade-8b-qwen3-1.7b Viewer • Updated about 17 hours ago • 1.04k
CL-From-Nothing/rlve-rollouts-nemotron-cascade-8b-qwen3-1.7b Viewer • Updated about 17 hours ago • 1.04k
CL-From-Nothing/DeepSeek-R1-0528-Qwen3-8B_sudoku_level_3_stitch_train_half_mask_student Viewer • Updated 4 days ago • 14.1k • 27
CL-From-Nothing/DeepSeek-R1-0528-Qwen3-8B_sudoku_level_3_stitch_train_half_mask_student Viewer • Updated 4 days ago • 14.1k • 27
CL-From-Nothing/Qwen3-4B-Thinking_sudoku_level_3_stitch_train_half_mask_student Viewer • Updated 7 days ago • 14.1k • 19
CL-From-Nothing/Qwen3-4B-Thinking_sudoku_level_3_stitch_train_half_mask_student Viewer • Updated 7 days ago • 14.1k • 19
CL-From-Nothing/DeepSeek-R1-0528-Qwen3-8B_kukurasu_train_2samples Viewer • Updated 16 days ago • 20k • 11
CL-From-Nothing/DeepSeek-R1-0528-Qwen3-8B_kukurasu_train_2samples Viewer • Updated 16 days ago • 20k • 11
CL-From-Nothing/Nemotron-Cascade-8B_minesweeper_offline_5K_rewrite_offline_rewrite Viewer • Updated 20 days ago • 5.01k • 10
CL-From-Nothing/Nemotron-Cascade-8B_sudoku_reasoning_offline_rewrite Viewer • Updated 20 days ago • 14.1k • 13
CL-From-Nothing/Nemotron-Cascade-8B_minesweeper_offline_5K_rewrite_offline_rewrite Viewer • Updated 20 days ago • 5.01k • 10
CL-From-Nothing/Nemotron-Cascade-8B_sudoku_reasoning_offline_rewrite Viewer • Updated 20 days ago • 14.1k • 13
SocialVeil: Probing Social Intelligence of Language Agents under Communication Barriers Paper • 2602.05115 • Published Feb 4 • 19