Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper β’ 2504.07866 β’ Published 5 days ago β’ 7 β’ 3
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper β’ 2504.07866 β’ Published 5 days ago β’ 7 β’ 3
Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Paper β’ 2405.20216 β’ Published May 30, 2024 β’ 20 β’ 3
MoBA: Mixture of Block Attention for Long-Context LLMs Paper β’ 2502.13189 β’ Published Feb 18 β’ 16 β’ 2
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published Nov 21, 2024 β’ 62 β’ 4
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper β’ 2410.11711 β’ Published Oct 15, 2024 β’ 9 β’ 4
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media Paper β’ 2410.12791 β’ Published Oct 16, 2024 β’ 5 β’ 3
Named Clinical Entity Recognition Benchmark Paper β’ 2410.05046 β’ Published Oct 7, 2024 β’ 17 β’ 3
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper β’ 2410.02749 β’ Published Oct 3, 2024 β’ 12 β’ 3
LLaVA-Critic: Learning to Evaluate Multimodal Models Paper β’ 2410.02712 β’ Published Oct 3, 2024 β’ 36 β’ 3
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper β’ 2409.12568 β’ Published Sep 19, 2024 β’ 51 β’ 4
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper β’ 2409.05177 β’ Published Sep 8, 2024 β’ 7 β’ 3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper β’ 2409.04269 β’ Published Sep 6, 2024 β’ 11 β’ 3
Open Language Data Initiative: Advancing Low-Resource Machine Translation for Karakalpak Paper β’ 2409.04269 β’ Published Sep 6, 2024 β’ 11 β’ 3
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance Paper β’ 2409.04593 β’ Published Sep 6, 2024 β’ 27 β’ 2