Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad Paper • 2503.21934 • Published Mar 27, 2025 • 1
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 206
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29, 2025 • 146
Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Paper • 2601.13044 • Published 22 days ago • 12
On the Robustness of Answer Formats in Medical Reasoning Models Paper • 2509.20866 • Published Sep 25, 2025 • 1
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction Paper • 2601.14722 • Published 20 days ago • 15
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper • 2410.12375 • Published Oct 16, 2024 • 5
Reliable Fine-Grained Evaluation of Natural Language Math Proofs Paper • 2510.13888 • Published Oct 14, 2025 • 2
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 149
Typhoon Isan Collection An ASR and a language technology artifact for Thailand’s Isan dialect • 5 items • Updated 13 days ago • 3
ThaiOCRBench: A Task-Diverse Benchmark for Vision-Language Understanding in Thai Paper • 2511.04479 • Published Nov 6, 2025 • 1
FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning Paper • 2506.16123 • Published Jun 19, 2025 • 8
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 194
Prior Prompt Engineering for Reinforcement Fine-Tuning Paper • 2505.14157 • Published May 20, 2025 • 7
Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging -- An Open Recipe Paper • 2502.09056 • Published Feb 13, 2025 • 31