LLaDA-8B-Base

We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.

Project Page

Code

Downloads last month
14
Safetensors
Model size
8.02B params
Tensor type
BF16
ยท
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Space using GSAI-ML/LLaDA-8B-Base 1