Configuration Parsing Warning: In config.json: "architectures" must be an array

Based model but uses layernorm instead of QK.sum(-1) for the normalization, for better hardware efficiency.

Downloads last month
4
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train hazyresearch/my-awesome-model

Collection including hazyresearch/my-awesome-model