I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token Paper • 2412.06676 • Published 16 days ago • 9
konstantindobler/mistral7b-de-tokenizer-swap-pure-bf16-v2-anneal-ablation Text Generation • Updated Aug 23 • 12
konstantindobler/mistral7b-ar-tokenizer-swap-pure-bf16-anneal-ablation Text Generation • Updated Aug 23 • 17
kd-shared/culturax-ar-spbpe32k-focus-embs-anneal-bf16-mixed-xassy-final Text Generation • Updated Jun 25 • 11
Restarting on CPU Upgrade 117 🏆 Open Arabic LLM Leaderboard Track, rank and evaluate open Arabic LLMs and chatbots