license: apache-2.0 | |
datasets: | |
- Fishfishfishfishfish/Synthetic_text.txt | |
language: | |
- en | |
Each safetensors file represents a different hidden dim value. | |
Each trained for 1 epoch. | |
inference.py must be edited for each safetensors. | |
sequence_length = 64 | |
batch_size = 16 | |
learning_rate = 0.0001 | |
embedding_dim = 256 | |
num_layers = 4 |