-
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Paper • 2410.20672 • Published • 7 -
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus
Paper • 2603.20105 • Published • 37
J C
dark-pen
AI & ML interests
None yet
Recent Activity
liked a model about 2 hours ago
kernels-community/quantization-bitsandbytes liked a model about 2 hours ago
llm-semantic-router/mmbert32k-modality-router-lora liked a model about 2 hours ago
nvidia/llama-nemotron-embed-vl-1b-v2