Running 99 99 TxT360: Trillion Extracted Text ๐ Create a large, deduplicated dataset for LLM pre-training
facebook/seamless-m4t-v2-large Automatic Speech Recognition โข Updated Jan 4, 2024 โข 178k โข โข 770