This is a GPT-NeoX model trained on 50 billion tokens from The Pile, using the Online Data Mixing method.

The OpenLLM leaderboard won't let me submit my model because the description is too short, so I'm adding more characters to the description in hopes that it will be evaluated.

Downloads last month
73
Safetensors
Model size
976M params
Tensor type
F32
·
U8
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.