Gemma 2 JPN Release Collection A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. • 3 items • Updated 2 days ago • 15
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 23 days ago • 76
Reranking Model Collection A collection of Korean-specific reranking models • 2 items • Updated Aug 16 • 2
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated 2 days ago • 54
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems Paper • 2407.01370 • Published Jul 1 • 85
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 65
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 • 148
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training Paper • 2405.06932 • Published May 11 • 16
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • Apr 28 • 37