Spaces:
Running
Running
title: README | |
emoji: π | |
colorFrom: yellow | |
colorTo: indigo | |
sdk: static | |
pinned: false | |
The first version of LLaMA-O1 has been uploaded to HF now!Here We Come! | |
Supervised: | |
https://huggingface.co/SimpleBerry/LLaMA-O1-Supervised-1129 | |
Base(Pretrain): | |
https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127 | |
Supervised Finetune Dataset: | |
https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT | |
Pretraining Dataset: | |
https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-Pretrain-1202 | |
RLHF is on the way! View our GitHub Repo: | |
https://github.com/SimpleBerry/LLaMA-O1 | |
Our ongoing related researches: | |
https://huggingface.co/papers/2406.07394 | |
https://huggingface.co/papers/2410.02884 | |
https://huggingface.co/papers/2411.18203 | |