WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published 4 days ago • 28
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 4 days ago • 11
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper • 2504.06261 • Published 12 days ago • 101
TransMamba: Flexibly Switching between Transformer and Mamba Paper • 2503.24067 • Published 21 days ago • 17