Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 16 days ago • 106
Running 534 534 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Ferret: Refer and Ground Anything Anywhere at Any Granularity Paper • 2310.07704 • Published Oct 11, 2023 • 11
Memory Augmented Language Models through Mixture of Word Experts Paper • 2311.10768 • Published Nov 15, 2023 • 18
CodeFusion: A Pre-trained Diffusion Model for Code Generation Paper • 2310.17680 • Published Oct 26, 2023 • 69
Table-GPT: Table-tuned GPT for Diverse Table Tasks Paper • 2310.09263 • Published Oct 13, 2023 • 40
Aligning Text-to-Image Diffusion Models with Reward Backpropagation Paper • 2310.03739 • Published Oct 5, 2023 • 22
A Survey on Evaluation of Large Language Models Paper • 2307.03109 • Published Jul 6, 2023 • 42
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • Updated 7 days ago • 101M • • 3.11k
humarin/chatgpt_paraphraser_on_T5_base Text2Text Generation • Updated Aug 1, 2024 • 17.8k • • 179