46 354 1182

euclaise

https://euclaise.xyz

euclaise

AI & ML interests

None yet

Recent Activity

liked a model about 2 hours ago

mit-oasys/rlm-qwen3-8b-v0.1

upvoted a paper about 5 hours ago

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

upvoted a paper about 5 hours ago

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

View all activity

Organizations

liked a model about 2 hours ago

mit-oasys/rlm-qwen3-8b-v0.1

8B • Updated 3 days ago • 70 • 16

upvoted 3 papers about 5 hours ago

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Paper • 2601.17124 • Published 9 days ago • 31

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published 4 days ago • 34

Scaling Embeddings Outperforms Scaling Experts in Language Models

Paper • 2601.21204 • Published 4 days ago • 95

liked a model about 10 hours ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • Updated 2 days ago • 53.5k • • 1.41k

liked a model 1 day ago

ChengyuDu0123/HER-32B

Text Generation • 33B • Updated 3 days ago • 11 • 4

liked a dataset 3 days ago

sojuL/RubricHub_v1

Viewer • Updated 13 days ago • 364k • 774 • 155

liked 2 models 3 days ago

Alibaba-Apsara/DASD-4B-Thinking

Text Generation • 4B • Updated 18 days ago • 2.87k • 210

MiniMaxAI/MiniMax-M2.1

Text Generation • Updated 5 days ago • 90.1k • • 1.24k

upvoted 4 papers 3 days ago

Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models

Paper • 2601.14152 • Published 13 days ago • 5

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published 12 days ago • 68

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Paper • 2601.17367 • Published 9 days ago • 33

Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

Paper • 2601.19895 • Published 5 days ago • 20

liked a model 12 days ago

zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated 4 days ago • 827k • • 1.4k

liked a dataset 14 days ago

OpenDataArena/ODA-Mixture-100k

Viewer • Updated 12 days ago • 101k • 4.31k • 95

upvoted a paper 22 days ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Paper • 2601.04890 • Published 25 days ago • 41

upvoted 3 papers 25 days ago

upvoted an article 29 days ago

Article

Jupyter Agents: training LLMs to reason with notebooks

Sep 10, 2025

•

euclaise

AI & ML interests

Recent Activity

Organizations

euclaise's activity

Jupyter Agents: training LLMs to reason with notebooks