datasets: | |
- togethercomputer/RedPajama-Data-1T-Sample | |
tags: | |
- llama2 | |
- llama | |
Similar to llama2-22b, but with BLOCK_DIAGONAL=false in the merge and twice the fine-tuning tokens. | |
Again, not intended for direct use - meant as a base for further tuning and merging. |