rubbyninja

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

advancing research

upvoted a paper about 1 month ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

updated a collection about 1 month ago

advancing research

View all activity

Organizations

None yet

rubbyninja's activity

updated a collection about 1 month ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper about 1 month ago

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

Paper • 2501.09781 • Published Jan 16 • 29

updated a collection about 1 month ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 118

upvoted a paper about 2 months ago

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30, 2024 • 78

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper 3 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 56

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 382

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted 2 papers 3 months ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 22

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper 3 months ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 98

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper 3 months ago

Large Concept Models: Language Modeling in a Sentence Representation Space

Paper • 2412.08821 • Published Dec 11, 2024 • 14

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper 3 months ago

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

Paper • 1901.02860 • Published Jan 9, 2019 • 3

updated a collection 3 months ago

advancing research

Collection

32 items • Updated Mar 11

upvoted a paper 3 months ago

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 110