--- license: apache-2.0 datasets: - Fishfishfishfishfish/Synthetic_text.txt language: - en --- Each safetensors file represents a different hidden dim value. Each trained for 1 epoch. inference.py must be edited for each safetensors. sequence_length = 64 batch_size = 16 learning_rate = 0.0001 embedding_dim = 256 num_layers = 4