๐ From instruction-following to creative storytelling, dive into 2024's most impactful AI datasets! These gems are shaping everything from scientific research to video understanding.
Iโve published a new dataset to simplify model merging ๐ค
This dataset facilitates the search for compatible architectures for model merging with @arcee_aiโs mergekit, streamlining the automation of high-performance merge searches ๐
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden - Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models - Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
Image Prompt Engineering Guide: โก๏ธ Artistic styling for Image generation โก๏ธ Prompt weighting using the parentheses method to generate realistic images. โก๏ธ Advanced features like style and positioning control[experimental]. โก๏ธ Image placement on the generated AI image using Recraft V3 Mockup.