LLMs - a netzkontrast Collection

netzkontrast 's Collections

music

LLMs

Speech

Lora

Video

Image

LLMs

updated 4 days ago

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Paper • 2501.03916 • Published 17 days ago • 14
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 16 days ago • 89
Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 17 days ago • 80
Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 15 days ago • 79
Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published 18 days ago • 14
Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published 15 days ago • 69
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Paper • 2501.05707 • Published 15 days ago • 19
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Paper • 2501.06842 • Published 12 days ago • 15
Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 42
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 40
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 25 days ago • 36
ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published 22 days ago • 25
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 14 days ago • 74
Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 16 days ago • 50
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 10 days ago • 268
Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 8 days ago • 95