Spaces:
Running
Running
metadata
title: README
emoji: π
colorFrom: yellow
colorTo: indigo
sdk: static
pinned: false
The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!
Supervised: https://huggingface.co/SimpleBerry/LLaMA-O1-Supervised-1129
Base(Pretrain): https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127
Supervised Finetune Dataset: https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT
Pretraining Dataset: https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-Pretrain-1202
RLHF is on the way! View our GitHub Repo: https://github.com/SimpleBerry/LLaMA-O1
Our ongoing related researches: https://huggingface.co/papers/2406.07394 https://huggingface.co/papers/2410.02884 https://huggingface.co/papers/2411.18203