README / README.md
qq8933's picture
Update README.md
5b09398 verified
|
raw
history blame
731 Bytes
metadata
title: README
emoji: πŸ‘€
colorFrom: yellow
colorTo: indigo
sdk: static
pinned: false

The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!

Supervised: https://huggingface.co/SimpleBerry/LLaMA-O1-Supervised-1129

Base(Pretrain): https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127

Supervised Finetune Dataset: https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT

Pretraining Dataset: https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-Pretrain-1202

RLHF is on the way! View our GitHub Repo: https://github.com/SimpleBerry/LLaMA-O1

Our ongoing related researches: https://huggingface.co/papers/2406.07394 https://huggingface.co/papers/2410.02884 https://huggingface.co/papers/2411.18203