bradhiltonendercorp commited on
Commit
23a6144
·
verified ·
1 Parent(s): c967046

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -19,7 +19,7 @@ Deductive Reasoning Qwen 32B is a reinforcement fine-tune of [Qwen 2.5 32B Instr
19
 
20
  Here are some additional resources to check out:
21
 
22
- - Blog Post
23
  - [Training Recipe](https://github.com/openpipe/deductive-reasoning)
24
  - [RL Experiments](https://github.com/openpipe/rl-experiments)
25
  - [Deductive Reasoning Qwen 14B](https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-14B)
 
19
 
20
  Here are some additional resources to check out:
21
 
22
+ - [Blog Post](https://openpipe.ai/blog/using-grpo-to-beat-o1-o3-mini-and-r1-on-temporal-clue)
23
  - [Training Recipe](https://github.com/openpipe/deductive-reasoning)
24
  - [RL Experiments](https://github.com/openpipe/rl-experiments)
25
  - [Deductive Reasoning Qwen 14B](https://huggingface.co/OpenPipe/Deductive-Reasoning-Qwen-14B)